Project [CASE WHEN ((fractile#94223236 = NA) OR (fractile#94223236 = null)) THEN null ELSE cast(fractile#94223236 as int) END AS fractile#94223284, CASE WHEN (cap#94223237 = null) THEN null ELSE cap#94223237 END AS cap#94223285, CASE WHEN ((size#94223238 = NA) OR (size#94223238 = null)) THEN null ELSE cast(size#94223238 as float) END AS size#94223286, CASE WHEN ((value#94223239 = NA) OR (value#94223239 = null)) THEN null ELSE cast(value#94223239 as float) END AS value#94223287, CASE WHEN ((growth#94223240 = NA) OR (growth#94223240 = null)) THEN null ELSE cast(growth#94223240 as float) END AS growth#94223288, CASE WHEN ((leverage#94223241 = NA) OR (leverage#94223241 = null)) THEN null ELSE cast(leverage#94223241 as float) END AS leverage#94223289, CASE WHEN ((volatility#94223242 = NA) OR (volatility#94223242 = null)) THEN null ELSE cast(volatility#94223242 as float) END AS volatility#94223290, CASE WHEN ((momentum#94223243 = NA) OR (momentum#94223243 = null)) THEN null ELSE cast(momentum#94223243 as float) END AS momentum#94223291, CASE WHEN ((yield#94223244 = NA) OR (yield#94223244 = null)) THEN null ELSE cast(yield#94223244 as float) END AS yield#94223292, CASE WHEN ((##94223245 = NA) OR (##94223245 = null)) THEN null ELSE cast(##94223245 as float) END AS ##94223293, CASE WHEN (min_date#94223246 = null) THEN null ELSE cast(min_date#94223246 as date) END AS min_date#94223294, CASE WHEN (max_date#94223247 = null) THEN null ELSE cast(max_date#94223247 as date) END AS max_date#94223295]
Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397]
== Physical Plan ==
CollectLimit (20)
+- InMemoryTableScan (1)
+- InMemoryRelation (2)
+- * Sort (19)
+- Exchange (18)
+- * Project (17)
+- * BroadcastHashJoin Inner BuildRight (16)
:- * Project (8)
: +- * Filter (7)
: +- InMemoryTableScan (3)
: +- InMemoryRelation (4)
: +- * Project (6)
: +- Scan csv (5)
+- BroadcastExchange (15)
+- * Project (14)
+- * Filter (13)
+- InMemoryTableScan (9)
+- InMemoryRelation (10)
+- * Project (12)
+- Scan csv (11)
(1) InMemoryTableScan
Output [11]: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
Arguments: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
(2) InMemoryRelation
Arguments: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Sort [fractile#94223284 ASC NULLS FIRST], true, 0
+- Exchange rangepartitioning(fractile#94223284 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7522821]
+- *(2) Project [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
+- *(2) BroadcastHashJoin [cap_description#94223308], [description#94160396], Inner, BuildRight, false
:- *(2) Project [fractile#94223284, cap#94223285 AS cap_description#94223308, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
: +- *(2) Filter ((isnotnull(fractile#94223284) AND NOT (fractile#94223284 = -1)) AND isnotnull(cap#94223285))
: +- InMemoryTableScan [##94223293, cap#94223285, fractile#94223284, growth#94223288, leverage#94223289, max_date#94223295, min_date#94223294, momentum#94223291, size#94223286, value#94223287, volatility#94223290, yield#94223292], [isnotnull(fractile#94223284), NOT (fractile#94223284 = -1), isnotnull(cap#94223285)]
: +- InMemoryRelation [fractile#94223284, cap#94223285, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295], StorageLevel(disk, memory, deserialized, 1 replicas)
: +- *(1) Project [CASE WHEN ((fractile#94223236 = NA) OR (fractile#94223236 = null)) THEN null ELSE cast(fractile#94223236 as int) END AS fractile#94223284, CASE WHEN (cap#94223237 = null) THEN null ELSE cap#94223237 END AS cap#94223285, CASE WHEN ((size#94223238 = NA) OR (size#94223238 = null)) THEN null ELSE cast(size#94223238 as float) END AS size#94223286, CASE WHEN ((value#94223239 = NA) OR (value#94223239 = null)) THEN null ELSE cast(value#94223239 as float) END AS value#94223287, CASE WHEN ((growth#94223240 = NA) OR (growth#94223240 = null)) THEN null ELSE cast(growth#94223240 as float) END AS growth#94223288, CASE WHEN ((leverage#94223241 = NA) OR (leverage#94223241 = null)) THEN null ELSE cast(leverage#94223241 as float) END AS leverage#94223289, CASE WHEN ((volatility#94223242 = NA) OR (volatility#94223242 = null)) THEN null ELSE cast(volatility#94223242 as float) END AS volatility#94223290, CASE WHEN ((momentum#94223243 = NA) OR (momentum#94223243 = null)) THEN null ELSE cast(momentum#94223243 as float) END AS momentum#94223291, CASE WHEN ((yield#94223244 = NA) OR (yield#94223244 = null)) THEN null ELSE cast(yield#94223244 as float) END AS yield#94223292, CASE WHEN ((##94223245 = NA) OR (##94223245 = null)) THEN null ELSE cast(##94223245 as float) END AS ##94223293, CASE WHEN (min_date#94223246 = null) THEN null ELSE cast(min_date#94223246 as date) END AS min_date#94223294, CASE WHEN (max_date#94223247 = null) THEN null ELSE cast(max_date#94223247 as date) END AS max_date#94223295]
: +- FileScan csv [fractile#94223236,cap#94223237,size#94223238,value#94223239,growth#94223240,leverage#94223241,volatility#94223242,momentum#94223243,yield#94223244,##94223245,min_date#94223246,max_date#94223247] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/tm1/eatm1_score/REPORT..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<fractile:string,cap:string,size:string,value:string,growth:string,leverage:string,volatili...
+- BroadcastExchange HashedRelationBroadcastMode(List(input[0, string, true]),false), [id=#7522816]
+- *(1) Project [description#94160396]
+- *(1) Filter ((isnotnull(cap#94160394) AND (cap#94160394 = 0)) AND isnotnull(description#94160396))
+- InMemoryTableScan [cap#94160394, description#94160396], [isnotnull(cap#94160394), (cap#94160394 = 0), isnotnull(description#94160396)]
+- InMemoryRelation [cap#94160394, sort#94160395, description#94160396, universe#94160397], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397]
+- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string>
,None), [fractile#94223284 ASC NULLS FIRST]
(3) InMemoryTableScan
Output [12]: [##94223293, cap#94223285, fractile#94223284, growth#94223288, leverage#94223289, max_date#94223295, min_date#94223294, momentum#94223291, size#94223286, value#94223287, volatility#94223290, yield#94223292]
Arguments: [##94223293, cap#94223285, fractile#94223284, growth#94223288, leverage#94223289, max_date#94223295, min_date#94223294, momentum#94223291, size#94223286, value#94223287, volatility#94223290, yield#94223292], [isnotnull(fractile#94223284), NOT (fractile#94223284 = -1), isnotnull(cap#94223285)]
(4) InMemoryRelation
Arguments: [fractile#94223284, cap#94223285, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((fractile#94223236 = NA) OR (fractile#94223236 = null)) THEN null ELSE cast(fractile#94223236 as int) END AS fractile#94223284, CASE WHEN (cap#94223237 = null) THEN null ELSE cap#94223237 END AS cap#94223285, CASE WHEN ((size#94223238 = NA) OR (size#94223238 = null)) THEN null ELSE cast(size#94223238 as float) END AS size#94223286, CASE WHEN ((value#94223239 = NA) OR (value#94223239 = null)) THEN null ELSE cast(value#94223239 as float) END AS value#94223287, CASE WHEN ((growth#94223240 = NA) OR (growth#94223240 = null)) THEN null ELSE cast(growth#94223240 as float) END AS growth#94223288, CASE WHEN ((leverage#94223241 = NA) OR (leverage#94223241 = null)) THEN null ELSE cast(leverage#94223241 as float) END AS leverage#94223289, CASE WHEN ((volatility#94223242 = NA) OR (volatility#94223242 = null)) THEN null ELSE cast(volatility#94223242 as float) END AS volatility#94223290, CASE WHEN ((momentum#94223243 = NA) OR (momentum#94223243 = null)) THEN null ELSE cast(momentum#94223243 as float) END AS momentum#94223291, CASE WHEN ((yield#94223244 = NA) OR (yield#94223244 = null)) THEN null ELSE cast(yield#94223244 as float) END AS yield#94223292, CASE WHEN ((##94223245 = NA) OR (##94223245 = null)) THEN null ELSE cast(##94223245 as float) END AS ##94223293, CASE WHEN (min_date#94223246 = null) THEN null ELSE cast(min_date#94223246 as date) END AS min_date#94223294, CASE WHEN (max_date#94223247 = null) THEN null ELSE cast(max_date#94223247 as date) END AS max_date#94223295]
+- FileScan csv [fractile#94223236,cap#94223237,size#94223238,value#94223239,growth#94223240,leverage#94223241,volatility#94223242,momentum#94223243,yield#94223244,##94223245,min_date#94223246,max_date#94223247] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/tm1/eatm1_score/REPORT..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<fractile:string,cap:string,size:string,value:string,growth:string,leverage:string,volatili...
,None)
(5) Scan csv
Output [12]: [fractile#94223236, cap#94223237, size#94223238, value#94223239, growth#94223240, leverage#94223241, volatility#94223242, momentum#94223243, yield#94223244, ##94223245, min_date#94223246, max_date#94223247]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/tm1/eatm1_score/REPORT_exposures.csv]
ReadSchema: struct<fractile:string,cap:string,size:string,value:string,growth:string,leverage:string,volatility:string,momentum:string,yield:string,#:string,min_date:string,max_date:string>
(6) Project [codegen id : 1]
Output [12]: [CASE WHEN ((fractile#94223236 = NA) OR (fractile#94223236 = null)) THEN null ELSE cast(fractile#94223236 as int) END AS fractile#94223284, CASE WHEN (cap#94223237 = null) THEN null ELSE cap#94223237 END AS cap#94223285, CASE WHEN ((size#94223238 = NA) OR (size#94223238 = null)) THEN null ELSE cast(size#94223238 as float) END AS size#94223286, CASE WHEN ((value#94223239 = NA) OR (value#94223239 = null)) THEN null ELSE cast(value#94223239 as float) END AS value#94223287, CASE WHEN ((growth#94223240 = NA) OR (growth#94223240 = null)) THEN null ELSE cast(growth#94223240 as float) END AS growth#94223288, CASE WHEN ((leverage#94223241 = NA) OR (leverage#94223241 = null)) THEN null ELSE cast(leverage#94223241 as float) END AS leverage#94223289, CASE WHEN ((volatility#94223242 = NA) OR (volatility#94223242 = null)) THEN null ELSE cast(volatility#94223242 as float) END AS volatility#94223290, CASE WHEN ((momentum#94223243 = NA) OR (momentum#94223243 = null)) THEN null ELSE cast(momentum#94223243 as float) END AS momentum#94223291, CASE WHEN ((yield#94223244 = NA) OR (yield#94223244 = null)) THEN null ELSE cast(yield#94223244 as float) END AS yield#94223292, CASE WHEN ((##94223245 = NA) OR (##94223245 = null)) THEN null ELSE cast(##94223245 as float) END AS ##94223293, CASE WHEN (min_date#94223246 = null) THEN null ELSE cast(min_date#94223246 as date) END AS min_date#94223294, CASE WHEN (max_date#94223247 = null) THEN null ELSE cast(max_date#94223247 as date) END AS max_date#94223295]
Input [12]: [fractile#94223236, cap#94223237, size#94223238, value#94223239, growth#94223240, leverage#94223241, volatility#94223242, momentum#94223243, yield#94223244, ##94223245, min_date#94223246, max_date#94223247]
(7) Filter [codegen id : 2]
Input [12]: [##94223293, cap#94223285, fractile#94223284, growth#94223288, leverage#94223289, max_date#94223295, min_date#94223294, momentum#94223291, size#94223286, value#94223287, volatility#94223290, yield#94223292]
Condition : ((isnotnull(fractile#94223284) AND NOT (fractile#94223284 = -1)) AND isnotnull(cap#94223285))
(8) Project [codegen id : 2]
Output [12]: [fractile#94223284, cap#94223285 AS cap_description#94223308, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
Input [12]: [##94223293, cap#94223285, fractile#94223284, growth#94223288, leverage#94223289, max_date#94223295, min_date#94223294, momentum#94223291, size#94223286, value#94223287, volatility#94223290, yield#94223292]
(9) InMemoryTableScan
Output [2]: [cap#94160394, description#94160396]
Arguments: [cap#94160394, description#94160396], [isnotnull(cap#94160394), (cap#94160394 = 0), isnotnull(description#94160396)]
(10) InMemoryRelation
Arguments: [cap#94160394, sort#94160395, description#94160396, universe#94160397], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397]
+- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string>
,None)
(11) Scan csv
Output [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv]
ReadSchema: struct<cap:string,sort:string,description:string,universe:string>
(12) Project [codegen id : 1]
Output [4]: [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397]
Input [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380]
(13) Filter [codegen id : 1]
Input [2]: [cap#94160394, description#94160396]
Condition : ((isnotnull(cap#94160394) AND (cap#94160394 = 0)) AND isnotnull(description#94160396))
(14) Project [codegen id : 1]
Output [1]: [description#94160396]
Input [2]: [cap#94160394, description#94160396]
(15) BroadcastExchange
Input [1]: [description#94160396]
Arguments: HashedRelationBroadcastMode(List(input[0, string, true]),false), [id=#7522816]
(16) BroadcastHashJoin [codegen id : 2]
Left keys [1]: [cap_description#94223308]
Right keys [1]: [description#94160396]
Join condition: None
(17) Project [codegen id : 2]
Output [11]: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
Input [13]: [fractile#94223284, cap_description#94223308, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295, description#94160396]
(18) Exchange
Input [11]: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
Arguments: rangepartitioning(fractile#94223284 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7522821]
(19) Sort [codegen id : 3]
Input [11]: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
Arguments: [fractile#94223284 ASC NULLS FIRST], true, 0
(20) CollectLimit
Input [11]: [fractile#94223284, size#94223286, value#94223287, growth#94223288, leverage#94223289, volatility#94223290, momentum#94223291, yield#94223292, ##94223293, min_date#94223294, max_date#94223295]
Arguments: 1000000