== Physical Plan == CollectLimit (21) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Project (20) +- * Sort (19) +- Exchange (18) +- * Project (17) +- * BroadcastHashJoin Inner BuildLeft (16) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Project (15) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [3]: [cap#94071090, turnover#94068591, days_hold#94071118] Arguments: [cap#94071090, turnover#94068591, days_hold#94071118] (2) InMemoryRelation Arguments: [cap#94071090, turnover#94068591, days_hold#94071118], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Project [cap#94071090, turnover#94068591, (1.0 / cast(turnover#94068591 as double)) AS days_hold#94071118] +- *(3) Sort [cap_sort#94071001 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(cap_sort#94071001 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7510876] +- *(2) Project [turnover#94068591, cap_description#94071000 AS cap#94071090, cap_sort#94071001] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#94068413))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#93880528 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7510868] : +- *(1) Filter isnotnull(cap#94068413) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#94068413, turnover#94068591], [isnotnull(cap#94068413)] : +- InMemoryRelation [cap#94068413, retIC#94068414, resretIC#94068415, numcos#94068416, numdates#94068417, annual_bmret#94068418, annual_ret#94068419, std_ret#94068420, Sharpe_ret#94068421, PctPos_ret#94068422, TR_ret#94068423, IR_ret#94068424, annual_resret#94068425, std_resret#94068426, Sharpe_resret#94068507, PctPos_resret#94068508, TR_resret#94068520, IR_resret#94068534, annual_retnet#94068547, std_retnet#94068548, Sharpe_retnet#94068549, PctPos_retnet#94068562, TR_retnet#94068563, IR_retnet#94068590, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#94068252 = NA) OR (cap#94068252 = null)) THEN null ELSE cast(cap#94068252 as float) END AS cap#94068413, CASE WHEN ((retIC#94068253 = NA) OR (retIC#94068253 = null)) THEN null ELSE cast(retIC#94068253 as float) END AS retIC#94068414, CASE WHEN ((resretIC#94068254 = NA) OR (resretIC#94068254 = null)) THEN null ELSE cast(resretIC#94068254 as float) END AS resretIC#94068415, CASE WHEN ((numcos#94068255 = NA) OR (numcos#94068255 = null)) THEN null ELSE cast(numcos#94068255 as float) END AS numcos#94068416, CASE WHEN ((numdates#94068256 = NA) OR (numdates#94068256 = null)) THEN null ELSE cast(numdates#94068256 as int) END AS numdates#94068417, CASE WHEN ((annual_bmret#94068257 = NA) OR (annual_bmret#94068257 = null)) THEN null ELSE cast(annual_bmret#94068257 as float) END AS annual_bmret#94068418, CASE WHEN ((annual_ret#94068258 = NA) OR (annual_ret#94068258 = null)) THEN null ELSE cast(annual_ret#94068258 as float) END AS annual_ret#94068419, CASE WHEN ((std_ret#94068259 = NA) OR (std_ret#94068259 = null)) THEN null ELSE cast(std_ret#94068259 as float) END AS std_ret#94068420, CASE WHEN ((Sharpe_ret#94068260 = NA) OR (Sharpe_ret#94068260 = null)) THEN null ELSE cast(Sharpe_ret#94068260 as float) END AS Sharpe_ret#94068421, CASE WHEN ((PctPos_ret#94068261 = NA) OR (PctPos_ret#94068261 = null)) THEN null ELSE cast(PctPos_ret#94068261 as float) END AS PctPos_ret#94068422, CASE WHEN ((TR_ret#94068262 = NA) OR (TR_ret#94068262 = null)) THEN null ELSE cast(TR_ret#94068262 as float) END AS TR_ret#94068423, CASE WHEN ((IR_ret#94068263 = NA) OR (IR_ret#94068263 = null)) THEN null ELSE cast(IR_ret#94068263 as float) END AS IR_ret#94068424, CASE WHEN ((annual_resret#94068264 = NA) OR (annual_resret#94068264 = null)) THEN null ELSE cast(annual_resret#94068264 as float) END AS annual_resret#94068425, CASE WHEN ((std_resret#94068265 = NA) OR (std_resret#94068265 = null)) THEN null ELSE cast(std_resret#94068265 as float) END AS std_resret#94068426, CASE WHEN ((Sharpe_resret#94068266 = NA) OR (Sharpe_resret#94068266 = null)) THEN null ELSE cast(Sharpe_resret#94068266 as float) END AS Sharpe_resret#94068507, CASE WHEN ((PctPos_resret#94068267 = NA) OR (PctPos_resret#94068267 = null)) THEN null ELSE cast(PctPos_resret#94068267 as float) END AS PctPos_resret#94068508, CASE WHEN ((TR_resret#94068268 = NA) OR (TR_resret#94068268 = null)) THEN null ELSE cast(TR_resret#94068268 as float) END AS TR_resret#94068520, CASE WHEN ((IR_resret#94068269 = NA) OR (IR_resret#94068269 = null)) THEN null ELSE cast(IR_resret#94068269 as float) END AS IR_resret#94068534, CASE WHEN ((annual_retnet#94068270 = NA) OR (annual_retnet#94068270 = null)) THEN null ELSE cast(annual_retnet#94068270 as float) END AS annual_retnet#94068547, CASE WHEN ((std_retnet#94068271 = NA) OR (std_retnet#94068271 = null)) THEN null ELSE cast(std_retnet#94068271 as float) END AS std_retnet#94068548, CASE WHEN ((Sharpe_retnet#94068272 = NA) OR (Sharpe_retnet#94068272 = null)) THEN null ELSE cast(Sharpe_retnet#94068272 as float) END AS Sharpe_retnet#94068549, CASE WHEN ((PctPos_retnet#94068273 = NA) OR (PctPos_retnet#94068273 = null)) THEN null ELSE cast(PctPos_retnet#94068273 as float) END AS PctPos_retnet#94068562, CASE WHEN ((TR_retnet#94068274 = NA) OR (TR_retnet#94068274 = null)) THEN null ELSE cast(TR_retnet#94068274 as float) END AS TR_retnet#94068563, CASE WHEN ((IR_retnet#94068275 = NA) OR (IR_retnet#94068275 = null)) THEN null ELSE cast(IR_retnet#94068275 as float) END AS IR_retnet#94068590, ... 2 more fields] : +- FileScan csv [cap#94068252,retIC#94068253,resretIC#94068254,numcos#94068255,numdates#94068256,annual_bmret#94068257,annual_ret#94068258,std_ret#94068259,Sharpe_ret#94068260,PctPos_ret#94068261,TR_ret#94068262,IR_ret#94068263,annual_resret#94068264,std_resret#94068265,Sharpe_resret#94068266,PctPos_resret#94068267,TR_resret#94068268,IR_resret#94068269,annual_retnet#94068270,std_retnet#94068271,Sharpe_retnet#94068272,PctPos_retnet#94068273,TR_retnet#94068274,IR_retnet#94068275,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/risk_factors/value/sta..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Project [cap#93880528, description#93880533 AS cap_description#94071000, sort#93880531 AS cap_sort#94071001] +- *(2) Filter isnotnull(cap#93880528) +- InMemoryTableScan [cap#93880528, description#93880533, sort#93880531], [isnotnull(cap#93880528)] +- InMemoryRelation [cap#93880528, sort#93880531, description#93880533, universe#93880535], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535] +- FileScan csv [cap#93880496,sort#93880498,description#93880500,universe#93880502] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [cap_sort#94071001 ASC NULLS FIRST] (3) InMemoryTableScan Output [2]: [cap#94068413, turnover#94068591] Arguments: [cap#94068413, turnover#94068591], [isnotnull(cap#94068413)] (4) InMemoryRelation Arguments: [cap#94068413, retIC#94068414, resretIC#94068415, numcos#94068416, numdates#94068417, annual_bmret#94068418, annual_ret#94068419, std_ret#94068420, Sharpe_ret#94068421, PctPos_ret#94068422, TR_ret#94068423, IR_ret#94068424, annual_resret#94068425, std_resret#94068426, Sharpe_resret#94068507, PctPos_resret#94068508, TR_resret#94068520, IR_resret#94068534, annual_retnet#94068547, std_retnet#94068548, Sharpe_retnet#94068549, PctPos_retnet#94068562, TR_retnet#94068563, IR_retnet#94068590, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94068252 = NA) OR (cap#94068252 = null)) THEN null ELSE cast(cap#94068252 as float) END AS cap#94068413, CASE WHEN ((retIC#94068253 = NA) OR (retIC#94068253 = null)) THEN null ELSE cast(retIC#94068253 as float) END AS retIC#94068414, CASE WHEN ((resretIC#94068254 = NA) OR (resretIC#94068254 = null)) THEN null ELSE cast(resretIC#94068254 as float) END AS resretIC#94068415, CASE WHEN ((numcos#94068255 = NA) OR (numcos#94068255 = null)) THEN null ELSE cast(numcos#94068255 as float) END AS numcos#94068416, CASE WHEN ((numdates#94068256 = NA) OR (numdates#94068256 = null)) THEN null ELSE cast(numdates#94068256 as int) END AS numdates#94068417, CASE WHEN ((annual_bmret#94068257 = NA) OR (annual_bmret#94068257 = null)) THEN null ELSE cast(annual_bmret#94068257 as float) END AS annual_bmret#94068418, CASE WHEN ((annual_ret#94068258 = NA) OR (annual_ret#94068258 = null)) THEN null ELSE cast(annual_ret#94068258 as float) END AS annual_ret#94068419, CASE WHEN ((std_ret#94068259 = NA) OR (std_ret#94068259 = null)) THEN null ELSE cast(std_ret#94068259 as float) END AS std_ret#94068420, CASE WHEN ((Sharpe_ret#94068260 = NA) OR (Sharpe_ret#94068260 = null)) THEN null ELSE cast(Sharpe_ret#94068260 as float) END AS Sharpe_ret#94068421, CASE WHEN ((PctPos_ret#94068261 = NA) OR (PctPos_ret#94068261 = null)) THEN null ELSE cast(PctPos_ret#94068261 as float) END AS PctPos_ret#94068422, CASE WHEN ((TR_ret#94068262 = NA) OR (TR_ret#94068262 = null)) THEN null ELSE cast(TR_ret#94068262 as float) END AS TR_ret#94068423, CASE WHEN ((IR_ret#94068263 = NA) OR (IR_ret#94068263 = null)) THEN null ELSE cast(IR_ret#94068263 as float) END AS IR_ret#94068424, CASE WHEN ((annual_resret#94068264 = NA) OR (annual_resret#94068264 = null)) THEN null ELSE cast(annual_resret#94068264 as float) END AS annual_resret#94068425, CASE WHEN ((std_resret#94068265 = NA) OR (std_resret#94068265 = null)) THEN null ELSE cast(std_resret#94068265 as float) END AS std_resret#94068426, CASE WHEN ((Sharpe_resret#94068266 = NA) OR (Sharpe_resret#94068266 = null)) THEN null ELSE cast(Sharpe_resret#94068266 as float) END AS Sharpe_resret#94068507, CASE WHEN ((PctPos_resret#94068267 = NA) OR (PctPos_resret#94068267 = null)) THEN null ELSE cast(PctPos_resret#94068267 as float) END AS PctPos_resret#94068508, CASE WHEN ((TR_resret#94068268 = NA) OR (TR_resret#94068268 = null)) THEN null ELSE cast(TR_resret#94068268 as float) END AS TR_resret#94068520, CASE WHEN ((IR_resret#94068269 = NA) OR (IR_resret#94068269 = null)) THEN null ELSE cast(IR_resret#94068269 as float) END AS IR_resret#94068534, CASE WHEN ((annual_retnet#94068270 = NA) OR (annual_retnet#94068270 = null)) THEN null ELSE cast(annual_retnet#94068270 as float) END AS annual_retnet#94068547, CASE WHEN ((std_retnet#94068271 = NA) OR (std_retnet#94068271 = null)) THEN null ELSE cast(std_retnet#94068271 as float) END AS std_retnet#94068548, CASE WHEN ((Sharpe_retnet#94068272 = NA) OR (Sharpe_retnet#94068272 = null)) THEN null ELSE cast(Sharpe_retnet#94068272 as float) END AS Sharpe_retnet#94068549, CASE WHEN ((PctPos_retnet#94068273 = NA) OR (PctPos_retnet#94068273 = null)) THEN null ELSE cast(PctPos_retnet#94068273 as float) END AS PctPos_retnet#94068562, CASE WHEN ((TR_retnet#94068274 = NA) OR (TR_retnet#94068274 = null)) THEN null ELSE cast(TR_retnet#94068274 as float) END AS TR_retnet#94068563, CASE WHEN ((IR_retnet#94068275 = NA) OR (IR_retnet#94068275 = null)) THEN null ELSE cast(IR_retnet#94068275 as float) END AS IR_retnet#94068590, ... 2 more fields] +- FileScan csv [cap#94068252,retIC#94068253,resretIC#94068254,numcos#94068255,numdates#94068256,annual_bmret#94068257,annual_ret#94068258,std_ret#94068259,Sharpe_ret#94068260,PctPos_ret#94068261,TR_ret#94068262,IR_ret#94068263,annual_resret#94068264,std_resret#94068265,Sharpe_resret#94068266,PctPos_resret#94068267,TR_resret#94068268,IR_resret#94068269,annual_retnet#94068270,std_retnet#94068271,Sharpe_retnet#94068272,PctPos_retnet#94068273,TR_retnet#94068274,IR_retnet#94068275,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/risk_factors/value/sta..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#94068252, retIC#94068253, resretIC#94068254, numcos#94068255, numdates#94068256, annual_bmret#94068257, annual_ret#94068258, std_ret#94068259, Sharpe_ret#94068260, PctPos_ret#94068261, TR_ret#94068262, IR_ret#94068263, annual_resret#94068264, std_resret#94068265, Sharpe_resret#94068266, PctPos_resret#94068267, TR_resret#94068268, IR_resret#94068269, annual_retnet#94068270, std_retnet#94068271, Sharpe_retnet#94068272, PctPos_retnet#94068273, TR_retnet#94068274, IR_retnet#94068275, turnover#94068276, coverage#94068277] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/risk_factors/value/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#94068252 = NA) OR (cap#94068252 = null)) THEN null ELSE cast(cap#94068252 as float) END AS cap#94068413, CASE WHEN ((retIC#94068253 = NA) OR (retIC#94068253 = null)) THEN null ELSE cast(retIC#94068253 as float) END AS retIC#94068414, CASE WHEN ((resretIC#94068254 = NA) OR (resretIC#94068254 = null)) THEN null ELSE cast(resretIC#94068254 as float) END AS resretIC#94068415, CASE WHEN ((numcos#94068255 = NA) OR (numcos#94068255 = null)) THEN null ELSE cast(numcos#94068255 as float) END AS numcos#94068416, CASE WHEN ((numdates#94068256 = NA) OR (numdates#94068256 = null)) THEN null ELSE cast(numdates#94068256 as int) END AS numdates#94068417, CASE WHEN ((annual_bmret#94068257 = NA) OR (annual_bmret#94068257 = null)) THEN null ELSE cast(annual_bmret#94068257 as float) END AS annual_bmret#94068418, CASE WHEN ((annual_ret#94068258 = NA) OR (annual_ret#94068258 = null)) THEN null ELSE cast(annual_ret#94068258 as float) END AS annual_ret#94068419, CASE WHEN ((std_ret#94068259 = NA) OR (std_ret#94068259 = null)) THEN null ELSE cast(std_ret#94068259 as float) END AS std_ret#94068420, CASE WHEN ((Sharpe_ret#94068260 = NA) OR (Sharpe_ret#94068260 = null)) THEN null ELSE cast(Sharpe_ret#94068260 as float) END AS Sharpe_ret#94068421, CASE WHEN ((PctPos_ret#94068261 = NA) OR (PctPos_ret#94068261 = null)) THEN null ELSE cast(PctPos_ret#94068261 as float) END AS PctPos_ret#94068422, CASE WHEN ((TR_ret#94068262 = NA) OR (TR_ret#94068262 = null)) THEN null ELSE cast(TR_ret#94068262 as float) END AS TR_ret#94068423, CASE WHEN ((IR_ret#94068263 = NA) OR (IR_ret#94068263 = null)) THEN null ELSE cast(IR_ret#94068263 as float) END AS IR_ret#94068424, CASE WHEN ((annual_resret#94068264 = NA) OR (annual_resret#94068264 = null)) THEN null ELSE cast(annual_resret#94068264 as float) END AS annual_resret#94068425, CASE WHEN ((std_resret#94068265 = NA) OR (std_resret#94068265 = null)) THEN null ELSE cast(std_resret#94068265 as float) END AS std_resret#94068426, CASE WHEN ((Sharpe_resret#94068266 = NA) OR (Sharpe_resret#94068266 = null)) THEN null ELSE cast(Sharpe_resret#94068266 as float) END AS Sharpe_resret#94068507, CASE WHEN ((PctPos_resret#94068267 = NA) OR (PctPos_resret#94068267 = null)) THEN null ELSE cast(PctPos_resret#94068267 as float) END AS PctPos_resret#94068508, CASE WHEN ((TR_resret#94068268 = NA) OR (TR_resret#94068268 = null)) THEN null ELSE cast(TR_resret#94068268 as float) END AS TR_resret#94068520, CASE WHEN ((IR_resret#94068269 = NA) OR (IR_resret#94068269 = null)) THEN null ELSE cast(IR_resret#94068269 as float) END AS IR_resret#94068534, CASE WHEN ((annual_retnet#94068270 = NA) OR (annual_retnet#94068270 = null)) THEN null ELSE cast(annual_retnet#94068270 as float) END AS annual_retnet#94068547, CASE WHEN ((std_retnet#94068271 = NA) OR (std_retnet#94068271 = null)) THEN null ELSE cast(std_retnet#94068271 as float) END AS std_retnet#94068548, CASE WHEN ((Sharpe_retnet#94068272 = NA) OR (Sharpe_retnet#94068272 = null)) THEN null ELSE cast(Sharpe_retnet#94068272 as float) END AS Sharpe_retnet#94068549, CASE WHEN ((PctPos_retnet#94068273 = NA) OR (PctPos_retnet#94068273 = null)) THEN null ELSE cast(PctPos_retnet#94068273 as float) END AS PctPos_retnet#94068562, CASE WHEN ((TR_retnet#94068274 = NA) OR (TR_retnet#94068274 = null)) THEN null ELSE cast(TR_retnet#94068274 as float) END AS TR_retnet#94068563, CASE WHEN ((IR_retnet#94068275 = NA) OR (IR_retnet#94068275 = null)) THEN null ELSE cast(IR_retnet#94068275 as float) END AS IR_retnet#94068590, CASE WHEN ((turnover#94068276 = NA) OR (turnover#94068276 = null)) THEN null ELSE cast(turnover#94068276 as float) END AS turnover#94068591, CASE WHEN ((coverage#94068277 = NA) OR (coverage#94068277 = null)) THEN null ELSE cast(coverage#94068277 as float) END AS coverage#94068652] Input [26]: [cap#94068252, retIC#94068253, resretIC#94068254, numcos#94068255, numdates#94068256, annual_bmret#94068257, annual_ret#94068258, std_ret#94068259, Sharpe_ret#94068260, PctPos_ret#94068261, TR_ret#94068262, IR_ret#94068263, annual_resret#94068264, std_resret#94068265, Sharpe_resret#94068266, PctPos_resret#94068267, TR_resret#94068268, IR_resret#94068269, annual_retnet#94068270, std_retnet#94068271, Sharpe_retnet#94068272, PctPos_retnet#94068273, TR_retnet#94068274, IR_retnet#94068275, turnover#94068276, coverage#94068277] (7) ColumnarToRow [codegen id : 1] Input [2]: [cap#94068413, turnover#94068591] (8) Filter [codegen id : 1] Input [2]: [cap#94068413, turnover#94068591] Condition : isnotnull(cap#94068413) (9) BroadcastExchange Input [2]: [cap#94068413, turnover#94068591] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7510868] (10) InMemoryTableScan Output [3]: [cap#93880528, description#93880533, sort#93880531] Arguments: [cap#93880528, description#93880533, sort#93880531], [isnotnull(cap#93880528)] (11) InMemoryRelation Arguments: [cap#93880528, sort#93880531, description#93880533, universe#93880535], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535] +- FileScan csv [cap#93880496,sort#93880498,description#93880500,universe#93880502] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#93880496, sort#93880498, description#93880500, universe#93880502] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535] Input [4]: [cap#93880496, sort#93880498, description#93880500, universe#93880502] (14) Filter Input [3]: [cap#93880528, description#93880533, sort#93880531] Condition : isnotnull(cap#93880528) (15) Project Output [3]: [cap#93880528, description#93880533 AS cap_description#94071000, sort#93880531 AS cap_sort#94071001] Input [3]: [cap#93880528, description#93880533, sort#93880531] (16) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#94068413))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#93880528 as float)))] Join condition: None (17) Project [codegen id : 2] Output [3]: [turnover#94068591, cap_description#94071000 AS cap#94071090, cap_sort#94071001] Input [5]: [cap#94068413, turnover#94068591, cap#93880528, cap_description#94071000, cap_sort#94071001] (18) Exchange Input [3]: [turnover#94068591, cap#94071090, cap_sort#94071001] Arguments: rangepartitioning(cap_sort#94071001 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7510876] (19) Sort [codegen id : 3] Input [3]: [turnover#94068591, cap#94071090, cap_sort#94071001] Arguments: [cap_sort#94071001 ASC NULLS FIRST], true, 0 (20) Project [codegen id : 3] Output [3]: [cap#94071090, turnover#94068591, (1.0 / cast(turnover#94068591 as double)) AS days_hold#94071118] Input [3]: [turnover#94068591, cap#94071090, cap_sort#94071001] (21) CollectLimit Input [3]: [cap#94071090, turnover#94068591, days_hold#94071118] Arguments: 1000000