Project [CASE WHEN ((fractile#94117538 = NA) OR (fractile#94117538 = null)) THEN null ELSE cast(fractile#94117538 as int) END AS fractile#94117586, CASE WHEN (cap#94117539 = null) THEN null ELSE cap#94117539 END AS cap#94117587, CASE WHEN ((size#94117540 = NA) OR (size#94117540 = null)) THEN null ELSE cast(size#94117540 as float) END AS size#94117588, CASE WHEN ((value#94117541 = NA) OR (value#94117541 = null)) THEN null ELSE cast(value#94117541 as float) END AS value#94117589, CASE WHEN ((growth#94117542 = NA) OR (growth#94117542 = null)) THEN null ELSE cast(growth#94117542 as float) END AS growth#94117590, CASE WHEN ((leverage#94117543 = NA) OR (leverage#94117543 = null)) THEN null ELSE cast(leverage#94117543 as float) END AS leverage#94117591, CASE WHEN ((volatility#94117544 = NA) OR (volatility#94117544 = null)) THEN null ELSE cast(volatility#94117544 as float) END AS volatility#94117592, CASE WHEN ((momentum#94117545 = NA) OR (momentum#94117545 = null)) THEN null ELSE cast(momentum#94117545 as float) END AS momentum#94117593, CASE WHEN ((yield#94117546 = NA) OR (yield#94117546 = null)) THEN null ELSE cast(yield#94117546 as float) END AS yield#94117594, CASE WHEN ((##94117547 = NA) OR (##94117547 = null)) THEN null ELSE cast(##94117547 as float) END AS ##94117595, CASE WHEN (min_date#94117548 = null) THEN null ELSE cast(min_date#94117548 as date) END AS min_date#94117596, CASE WHEN (max_date#94117549 = null) THEN null ELSE cast(max_date#94117549 as date) END AS max_date#94117597]
Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535]
== Physical Plan ==
CollectLimit (20)
+- InMemoryTableScan (1)
+- InMemoryRelation (2)
+- * Sort (19)
+- Exchange (18)
+- * Project (17)
+- * BroadcastHashJoin Inner BuildRight (16)
:- * Project (8)
: +- * Filter (7)
: +- InMemoryTableScan (3)
: +- InMemoryRelation (4)
: +- * Project (6)
: +- Scan csv (5)
+- BroadcastExchange (15)
+- * Project (14)
+- * Filter (13)
+- InMemoryTableScan (9)
+- InMemoryRelation (10)
+- * Project (12)
+- Scan csv (11)
(1) InMemoryTableScan
Output [11]: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
Arguments: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
(2) InMemoryRelation
Arguments: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Sort [fractile#94117586 ASC NULLS FIRST], true, 0
+- Exchange rangepartitioning(fractile#94117586 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7514548]
+- *(2) Project [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
+- *(2) BroadcastHashJoin [cap_description#94117610], [description#93880533], Inner, BuildRight, false
:- *(2) Project [fractile#94117586, cap#94117587 AS cap_description#94117610, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
: +- *(2) Filter ((isnotnull(fractile#94117586) AND NOT (fractile#94117586 = -1)) AND isnotnull(cap#94117587))
: +- InMemoryTableScan [##94117595, cap#94117587, fractile#94117586, growth#94117590, leverage#94117591, max_date#94117597, min_date#94117596, momentum#94117593, size#94117588, value#94117589, volatility#94117592, yield#94117594], [isnotnull(fractile#94117586), NOT (fractile#94117586 = -1), isnotnull(cap#94117587)]
: +- InMemoryRelation [fractile#94117586, cap#94117587, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597], StorageLevel(disk, memory, deserialized, 1 replicas)
: +- *(1) Project [CASE WHEN ((fractile#94117538 = NA) OR (fractile#94117538 = null)) THEN null ELSE cast(fractile#94117538 as int) END AS fractile#94117586, CASE WHEN (cap#94117539 = null) THEN null ELSE cap#94117539 END AS cap#94117587, CASE WHEN ((size#94117540 = NA) OR (size#94117540 = null)) THEN null ELSE cast(size#94117540 as float) END AS size#94117588, CASE WHEN ((value#94117541 = NA) OR (value#94117541 = null)) THEN null ELSE cast(value#94117541 as float) END AS value#94117589, CASE WHEN ((growth#94117542 = NA) OR (growth#94117542 = null)) THEN null ELSE cast(growth#94117542 as float) END AS growth#94117590, CASE WHEN ((leverage#94117543 = NA) OR (leverage#94117543 = null)) THEN null ELSE cast(leverage#94117543 as float) END AS leverage#94117591, CASE WHEN ((volatility#94117544 = NA) OR (volatility#94117544 = null)) THEN null ELSE cast(volatility#94117544 as float) END AS volatility#94117592, CASE WHEN ((momentum#94117545 = NA) OR (momentum#94117545 = null)) THEN null ELSE cast(momentum#94117545 as float) END AS momentum#94117593, CASE WHEN ((yield#94117546 = NA) OR (yield#94117546 = null)) THEN null ELSE cast(yield#94117546 as float) END AS yield#94117594, CASE WHEN ((##94117547 = NA) OR (##94117547 = null)) THEN null ELSE cast(##94117547 as float) END AS ##94117595, CASE WHEN (min_date#94117548 = null) THEN null ELSE cast(min_date#94117548 as date) END AS min_date#94117596, CASE WHEN (max_date#94117549 = null) THEN null ELSE cast(max_date#94117549 as date) END AS max_date#94117597]
: +- FileScan csv [fractile#94117538,cap#94117539,size#94117540,value#94117541,growth#94117542,leverage#94117543,volatility#94117544,momentum#94117545,yield#94117546,##94117547,min_date#94117548,max_date#94117549] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/tm1/eatm1_score/REPORT..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<fractile:string,cap:string,size:string,value:string,growth:string,leverage:string,volatili...
+- BroadcastExchange HashedRelationBroadcastMode(List(input[0, string, true]),false), [id=#7514543]
+- *(1) Project [description#93880533]
+- *(1) Filter ((isnotnull(cap#93880528) AND (cap#93880528 = 0)) AND isnotnull(description#93880533))
+- InMemoryTableScan [cap#93880528, description#93880533], [isnotnull(cap#93880528), (cap#93880528 = 0), isnotnull(description#93880533)]
+- InMemoryRelation [cap#93880528, sort#93880531, description#93880533, universe#93880535], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535]
+- FileScan csv [cap#93880496,sort#93880498,description#93880500,universe#93880502] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string>
,None), [fractile#94117586 ASC NULLS FIRST]
(3) InMemoryTableScan
Output [12]: [##94117595, cap#94117587, fractile#94117586, growth#94117590, leverage#94117591, max_date#94117597, min_date#94117596, momentum#94117593, size#94117588, value#94117589, volatility#94117592, yield#94117594]
Arguments: [##94117595, cap#94117587, fractile#94117586, growth#94117590, leverage#94117591, max_date#94117597, min_date#94117596, momentum#94117593, size#94117588, value#94117589, volatility#94117592, yield#94117594], [isnotnull(fractile#94117586), NOT (fractile#94117586 = -1), isnotnull(cap#94117587)]
(4) InMemoryRelation
Arguments: [fractile#94117586, cap#94117587, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((fractile#94117538 = NA) OR (fractile#94117538 = null)) THEN null ELSE cast(fractile#94117538 as int) END AS fractile#94117586, CASE WHEN (cap#94117539 = null) THEN null ELSE cap#94117539 END AS cap#94117587, CASE WHEN ((size#94117540 = NA) OR (size#94117540 = null)) THEN null ELSE cast(size#94117540 as float) END AS size#94117588, CASE WHEN ((value#94117541 = NA) OR (value#94117541 = null)) THEN null ELSE cast(value#94117541 as float) END AS value#94117589, CASE WHEN ((growth#94117542 = NA) OR (growth#94117542 = null)) THEN null ELSE cast(growth#94117542 as float) END AS growth#94117590, CASE WHEN ((leverage#94117543 = NA) OR (leverage#94117543 = null)) THEN null ELSE cast(leverage#94117543 as float) END AS leverage#94117591, CASE WHEN ((volatility#94117544 = NA) OR (volatility#94117544 = null)) THEN null ELSE cast(volatility#94117544 as float) END AS volatility#94117592, CASE WHEN ((momentum#94117545 = NA) OR (momentum#94117545 = null)) THEN null ELSE cast(momentum#94117545 as float) END AS momentum#94117593, CASE WHEN ((yield#94117546 = NA) OR (yield#94117546 = null)) THEN null ELSE cast(yield#94117546 as float) END AS yield#94117594, CASE WHEN ((##94117547 = NA) OR (##94117547 = null)) THEN null ELSE cast(##94117547 as float) END AS ##94117595, CASE WHEN (min_date#94117548 = null) THEN null ELSE cast(min_date#94117548 as date) END AS min_date#94117596, CASE WHEN (max_date#94117549 = null) THEN null ELSE cast(max_date#94117549 as date) END AS max_date#94117597]
+- FileScan csv [fractile#94117538,cap#94117539,size#94117540,value#94117541,growth#94117542,leverage#94117543,volatility#94117544,momentum#94117545,yield#94117546,##94117547,min_date#94117548,max_date#94117549] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/tm1/eatm1_score/REPORT..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<fractile:string,cap:string,size:string,value:string,growth:string,leverage:string,volatili...
,None)
(5) Scan csv
Output [12]: [fractile#94117538, cap#94117539, size#94117540, value#94117541, growth#94117542, leverage#94117543, volatility#94117544, momentum#94117545, yield#94117546, ##94117547, min_date#94117548, max_date#94117549]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/tm1/eatm1_score/REPORT_exposures.csv]
ReadSchema: struct<fractile:string,cap:string,size:string,value:string,growth:string,leverage:string,volatility:string,momentum:string,yield:string,#:string,min_date:string,max_date:string>
(6) Project [codegen id : 1]
Output [12]: [CASE WHEN ((fractile#94117538 = NA) OR (fractile#94117538 = null)) THEN null ELSE cast(fractile#94117538 as int) END AS fractile#94117586, CASE WHEN (cap#94117539 = null) THEN null ELSE cap#94117539 END AS cap#94117587, CASE WHEN ((size#94117540 = NA) OR (size#94117540 = null)) THEN null ELSE cast(size#94117540 as float) END AS size#94117588, CASE WHEN ((value#94117541 = NA) OR (value#94117541 = null)) THEN null ELSE cast(value#94117541 as float) END AS value#94117589, CASE WHEN ((growth#94117542 = NA) OR (growth#94117542 = null)) THEN null ELSE cast(growth#94117542 as float) END AS growth#94117590, CASE WHEN ((leverage#94117543 = NA) OR (leverage#94117543 = null)) THEN null ELSE cast(leverage#94117543 as float) END AS leverage#94117591, CASE WHEN ((volatility#94117544 = NA) OR (volatility#94117544 = null)) THEN null ELSE cast(volatility#94117544 as float) END AS volatility#94117592, CASE WHEN ((momentum#94117545 = NA) OR (momentum#94117545 = null)) THEN null ELSE cast(momentum#94117545 as float) END AS momentum#94117593, CASE WHEN ((yield#94117546 = NA) OR (yield#94117546 = null)) THEN null ELSE cast(yield#94117546 as float) END AS yield#94117594, CASE WHEN ((##94117547 = NA) OR (##94117547 = null)) THEN null ELSE cast(##94117547 as float) END AS ##94117595, CASE WHEN (min_date#94117548 = null) THEN null ELSE cast(min_date#94117548 as date) END AS min_date#94117596, CASE WHEN (max_date#94117549 = null) THEN null ELSE cast(max_date#94117549 as date) END AS max_date#94117597]
Input [12]: [fractile#94117538, cap#94117539, size#94117540, value#94117541, growth#94117542, leverage#94117543, volatility#94117544, momentum#94117545, yield#94117546, ##94117547, min_date#94117548, max_date#94117549]
(7) Filter [codegen id : 2]
Input [12]: [##94117595, cap#94117587, fractile#94117586, growth#94117590, leverage#94117591, max_date#94117597, min_date#94117596, momentum#94117593, size#94117588, value#94117589, volatility#94117592, yield#94117594]
Condition : ((isnotnull(fractile#94117586) AND NOT (fractile#94117586 = -1)) AND isnotnull(cap#94117587))
(8) Project [codegen id : 2]
Output [12]: [fractile#94117586, cap#94117587 AS cap_description#94117610, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
Input [12]: [##94117595, cap#94117587, fractile#94117586, growth#94117590, leverage#94117591, max_date#94117597, min_date#94117596, momentum#94117593, size#94117588, value#94117589, volatility#94117592, yield#94117594]
(9) InMemoryTableScan
Output [2]: [cap#93880528, description#93880533]
Arguments: [cap#93880528, description#93880533], [isnotnull(cap#93880528), (cap#93880528 = 0), isnotnull(description#93880533)]
(10) InMemoryRelation
Arguments: [cap#93880528, sort#93880531, description#93880533, universe#93880535], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535]
+- FileScan csv [cap#93880496,sort#93880498,description#93880500,universe#93880502] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string>
,None)
(11) Scan csv
Output [4]: [cap#93880496, sort#93880498, description#93880500, universe#93880502]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv]
ReadSchema: struct<cap:string,sort:string,description:string,universe:string>
(12) Project [codegen id : 1]
Output [4]: [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535]
Input [4]: [cap#93880496, sort#93880498, description#93880500, universe#93880502]
(13) Filter [codegen id : 1]
Input [2]: [cap#93880528, description#93880533]
Condition : ((isnotnull(cap#93880528) AND (cap#93880528 = 0)) AND isnotnull(description#93880533))
(14) Project [codegen id : 1]
Output [1]: [description#93880533]
Input [2]: [cap#93880528, description#93880533]
(15) BroadcastExchange
Input [1]: [description#93880533]
Arguments: HashedRelationBroadcastMode(List(input[0, string, true]),false), [id=#7514543]
(16) BroadcastHashJoin [codegen id : 2]
Left keys [1]: [cap_description#94117610]
Right keys [1]: [description#93880533]
Join condition: None
(17) Project [codegen id : 2]
Output [11]: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
Input [13]: [fractile#94117586, cap_description#94117610, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597, description#93880533]
(18) Exchange
Input [11]: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
Arguments: rangepartitioning(fractile#94117586 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7514548]
(19) Sort [codegen id : 3]
Input [11]: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
Arguments: [fractile#94117586 ASC NULLS FIRST], true, 0
(20) CollectLimit
Input [11]: [fractile#94117586, size#94117588, value#94117589, growth#94117590, leverage#94117591, volatility#94117592, momentum#94117593, yield#94117594, ##94117595, min_date#94117596, max_date#94117597]
Arguments: 1000000