== Physical Plan == CollectLimit (21) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Project (20) +- * Sort (19) +- Exchange (18) +- * Project (17) +- * BroadcastHashJoin Inner BuildLeft (16) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Project (15) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [3]: [cap#94769042, turnover#94766695, days_hold#94769097] Arguments: [cap#94769042, turnover#94766695, days_hold#94769097] (2) InMemoryRelation Arguments: [cap#94769042, turnover#94766695, days_hold#94769097], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Project [cap#94769042, turnover#94766695, (1.0 / cast(turnover#94766695 as double)) AS days_hold#94769097] +- *(3) Sort [cap_sort#94768980 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(cap_sort#94768980 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7566201] +- *(2) Project [turnover#94766695, cap_description#94768979 AS cap#94769042, cap_sort#94768980] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#94766441))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94694039 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7566193] : +- *(1) Filter isnotnull(cap#94766441) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#94766441, turnover#94766695], [isnotnull(cap#94766441)] : +- InMemoryRelation [cap#94766441, retIC#94766445, resretIC#94766450, numcos#94766453, numdates#94766455, annual_bmret#94766461, annual_ret#94766466, std_ret#94766470, Sharpe_ret#94766475, PctPos_ret#94766478, TR_ret#94766481, IR_ret#94766482, annual_resret#94766623, std_resret#94766636, Sharpe_resret#94766637, PctPos_resret#94766649, TR_resret#94766651, IR_resret#94766652, annual_retnet#94766665, std_retnet#94766666, Sharpe_retnet#94766667, PctPos_retnet#94766680, TR_retnet#94766681, IR_retnet#94766682, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#94766234 = NA) OR (cap#94766234 = null)) THEN null ELSE cast(cap#94766234 as float) END AS cap#94766441, CASE WHEN ((retIC#94766235 = NA) OR (retIC#94766235 = null)) THEN null ELSE cast(retIC#94766235 as float) END AS retIC#94766445, CASE WHEN ((resretIC#94766236 = NA) OR (resretIC#94766236 = null)) THEN null ELSE cast(resretIC#94766236 as float) END AS resretIC#94766450, CASE WHEN ((numcos#94766237 = NA) OR (numcos#94766237 = null)) THEN null ELSE cast(numcos#94766237 as float) END AS numcos#94766453, CASE WHEN ((numdates#94766238 = NA) OR (numdates#94766238 = null)) THEN null ELSE cast(numdates#94766238 as int) END AS numdates#94766455, CASE WHEN ((annual_bmret#94766239 = NA) OR (annual_bmret#94766239 = null)) THEN null ELSE cast(annual_bmret#94766239 as float) END AS annual_bmret#94766461, CASE WHEN ((annual_ret#94766240 = NA) OR (annual_ret#94766240 = null)) THEN null ELSE cast(annual_ret#94766240 as float) END AS annual_ret#94766466, CASE WHEN ((std_ret#94766241 = NA) OR (std_ret#94766241 = null)) THEN null ELSE cast(std_ret#94766241 as float) END AS std_ret#94766470, CASE WHEN ((Sharpe_ret#94766242 = NA) OR (Sharpe_ret#94766242 = null)) THEN null ELSE cast(Sharpe_ret#94766242 as float) END AS Sharpe_ret#94766475, CASE WHEN ((PctPos_ret#94766243 = NA) OR (PctPos_ret#94766243 = null)) THEN null ELSE cast(PctPos_ret#94766243 as float) END AS PctPos_ret#94766478, CASE WHEN ((TR_ret#94766244 = NA) OR (TR_ret#94766244 = null)) THEN null ELSE cast(TR_ret#94766244 as float) END AS TR_ret#94766481, CASE WHEN ((IR_ret#94766245 = NA) OR (IR_ret#94766245 = null)) THEN null ELSE cast(IR_ret#94766245 as float) END AS IR_ret#94766482, CASE WHEN ((annual_resret#94766246 = NA) OR (annual_resret#94766246 = null)) THEN null ELSE cast(annual_resret#94766246 as float) END AS annual_resret#94766623, CASE WHEN ((std_resret#94766247 = NA) OR (std_resret#94766247 = null)) THEN null ELSE cast(std_resret#94766247 as float) END AS std_resret#94766636, CASE WHEN ((Sharpe_resret#94766248 = NA) OR (Sharpe_resret#94766248 = null)) THEN null ELSE cast(Sharpe_resret#94766248 as float) END AS Sharpe_resret#94766637, CASE WHEN ((PctPos_resret#94766249 = NA) OR (PctPos_resret#94766249 = null)) THEN null ELSE cast(PctPos_resret#94766249 as float) END AS PctPos_resret#94766649, CASE WHEN ((TR_resret#94766250 = NA) OR (TR_resret#94766250 = null)) THEN null ELSE cast(TR_resret#94766250 as float) END AS TR_resret#94766651, CASE WHEN ((IR_resret#94766251 = NA) OR (IR_resret#94766251 = null)) THEN null ELSE cast(IR_resret#94766251 as float) END AS IR_resret#94766652, CASE WHEN ((annual_retnet#94766252 = NA) OR (annual_retnet#94766252 = null)) THEN null ELSE cast(annual_retnet#94766252 as float) END AS annual_retnet#94766665, CASE WHEN ((std_retnet#94766253 = NA) OR (std_retnet#94766253 = null)) THEN null ELSE cast(std_retnet#94766253 as float) END AS std_retnet#94766666, CASE WHEN ((Sharpe_retnet#94766254 = NA) OR (Sharpe_retnet#94766254 = null)) THEN null ELSE cast(Sharpe_retnet#94766254 as float) END AS Sharpe_retnet#94766667, CASE WHEN ((PctPos_retnet#94766255 = NA) OR (PctPos_retnet#94766255 = null)) THEN null ELSE cast(PctPos_retnet#94766255 as float) END AS PctPos_retnet#94766680, CASE WHEN ((TR_retnet#94766256 = NA) OR (TR_retnet#94766256 = null)) THEN null ELSE cast(TR_retnet#94766256 as float) END AS TR_retnet#94766681, CASE WHEN ((IR_retnet#94766257 = NA) OR (IR_retnet#94766257 = null)) THEN null ELSE cast(IR_retnet#94766257 as float) END AS IR_retnet#94766682, ... 2 more fields] : +- FileScan csv [cap#94766234,retIC#94766235,resretIC#94766236,numcos#94766237,numdates#94766238,annual_bmret#94766239,annual_ret#94766240,std_ret#94766241,Sharpe_ret#94766242,PctPos_ret#94766243,TR_ret#94766244,IR_ret#94766245,annual_resret#94766246,std_resret#94766247,Sharpe_resret#94766248,PctPos_resret#94766249,TR_resret#94766250,IR_resret#94766251,annual_retnet#94766252,std_retnet#94766253,Sharpe_retnet#94766254,PctPos_retnet#94766255,TR_retnet#94766256,IR_retnet#94766257,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/esg_innovation/innovat..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Project [cap#94694039, description#94694044 AS cap_description#94768979, sort#94694042 AS cap_sort#94768980] +- *(2) Filter isnotnull(cap#94694039) +- InMemoryTableScan [cap#94694039, description#94694044, sort#94694042], [isnotnull(cap#94694039)] +- InMemoryRelation [cap#94694039, sort#94694042, description#94694044, universe#94694045], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#94694007 = NA) OR (cap#94694007 = null)) THEN null ELSE cast(cap#94694007 as int) END AS cap#94694039, CASE WHEN (sort#94694010 = null) THEN null ELSE sort#94694010 END AS sort#94694042, CASE WHEN (description#94694012 = null) THEN null ELSE description#94694012 END AS description#94694044, CASE WHEN ((universe#94694014 = NA) OR (universe#94694014 = null)) THEN null ELSE cast(universe#94694014 as int) END AS universe#94694045] +- FileScan csv [cap#94694007,sort#94694010,description#94694012,universe#94694014] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [cap_sort#94768980 ASC NULLS FIRST] (3) InMemoryTableScan Output [2]: [cap#94766441, turnover#94766695] Arguments: [cap#94766441, turnover#94766695], [isnotnull(cap#94766441)] (4) InMemoryRelation Arguments: [cap#94766441, retIC#94766445, resretIC#94766450, numcos#94766453, numdates#94766455, annual_bmret#94766461, annual_ret#94766466, std_ret#94766470, Sharpe_ret#94766475, PctPos_ret#94766478, TR_ret#94766481, IR_ret#94766482, annual_resret#94766623, std_resret#94766636, Sharpe_resret#94766637, PctPos_resret#94766649, TR_resret#94766651, IR_resret#94766652, annual_retnet#94766665, std_retnet#94766666, Sharpe_retnet#94766667, PctPos_retnet#94766680, TR_retnet#94766681, IR_retnet#94766682, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94766234 = NA) OR (cap#94766234 = null)) THEN null ELSE cast(cap#94766234 as float) END AS cap#94766441, CASE WHEN ((retIC#94766235 = NA) OR (retIC#94766235 = null)) THEN null ELSE cast(retIC#94766235 as float) END AS retIC#94766445, CASE WHEN ((resretIC#94766236 = NA) OR (resretIC#94766236 = null)) THEN null ELSE cast(resretIC#94766236 as float) END AS resretIC#94766450, CASE WHEN ((numcos#94766237 = NA) OR (numcos#94766237 = null)) THEN null ELSE cast(numcos#94766237 as float) END AS numcos#94766453, CASE WHEN ((numdates#94766238 = NA) OR (numdates#94766238 = null)) THEN null ELSE cast(numdates#94766238 as int) END AS numdates#94766455, CASE WHEN ((annual_bmret#94766239 = NA) OR (annual_bmret#94766239 = null)) THEN null ELSE cast(annual_bmret#94766239 as float) END AS annual_bmret#94766461, CASE WHEN ((annual_ret#94766240 = NA) OR (annual_ret#94766240 = null)) THEN null ELSE cast(annual_ret#94766240 as float) END AS annual_ret#94766466, CASE WHEN ((std_ret#94766241 = NA) OR (std_ret#94766241 = null)) THEN null ELSE cast(std_ret#94766241 as float) END AS std_ret#94766470, CASE WHEN ((Sharpe_ret#94766242 = NA) OR (Sharpe_ret#94766242 = null)) THEN null ELSE cast(Sharpe_ret#94766242 as float) END AS Sharpe_ret#94766475, CASE WHEN ((PctPos_ret#94766243 = NA) OR (PctPos_ret#94766243 = null)) THEN null ELSE cast(PctPos_ret#94766243 as float) END AS PctPos_ret#94766478, CASE WHEN ((TR_ret#94766244 = NA) OR (TR_ret#94766244 = null)) THEN null ELSE cast(TR_ret#94766244 as float) END AS TR_ret#94766481, CASE WHEN ((IR_ret#94766245 = NA) OR (IR_ret#94766245 = null)) THEN null ELSE cast(IR_ret#94766245 as float) END AS IR_ret#94766482, CASE WHEN ((annual_resret#94766246 = NA) OR (annual_resret#94766246 = null)) THEN null ELSE cast(annual_resret#94766246 as float) END AS annual_resret#94766623, CASE WHEN ((std_resret#94766247 = NA) OR (std_resret#94766247 = null)) THEN null ELSE cast(std_resret#94766247 as float) END AS std_resret#94766636, CASE WHEN ((Sharpe_resret#94766248 = NA) OR (Sharpe_resret#94766248 = null)) THEN null ELSE cast(Sharpe_resret#94766248 as float) END AS Sharpe_resret#94766637, CASE WHEN ((PctPos_resret#94766249 = NA) OR (PctPos_resret#94766249 = null)) THEN null ELSE cast(PctPos_resret#94766249 as float) END AS PctPos_resret#94766649, CASE WHEN ((TR_resret#94766250 = NA) OR (TR_resret#94766250 = null)) THEN null ELSE cast(TR_resret#94766250 as float) END AS TR_resret#94766651, CASE WHEN ((IR_resret#94766251 = NA) OR (IR_resret#94766251 = null)) THEN null ELSE cast(IR_resret#94766251 as float) END AS IR_resret#94766652, CASE WHEN ((annual_retnet#94766252 = NA) OR (annual_retnet#94766252 = null)) THEN null ELSE cast(annual_retnet#94766252 as float) END AS annual_retnet#94766665, CASE WHEN ((std_retnet#94766253 = NA) OR (std_retnet#94766253 = null)) THEN null ELSE cast(std_retnet#94766253 as float) END AS std_retnet#94766666, CASE WHEN ((Sharpe_retnet#94766254 = NA) OR (Sharpe_retnet#94766254 = null)) THEN null ELSE cast(Sharpe_retnet#94766254 as float) END AS Sharpe_retnet#94766667, CASE WHEN ((PctPos_retnet#94766255 = NA) OR (PctPos_retnet#94766255 = null)) THEN null ELSE cast(PctPos_retnet#94766255 as float) END AS PctPos_retnet#94766680, CASE WHEN ((TR_retnet#94766256 = NA) OR (TR_retnet#94766256 = null)) THEN null ELSE cast(TR_retnet#94766256 as float) END AS TR_retnet#94766681, CASE WHEN ((IR_retnet#94766257 = NA) OR (IR_retnet#94766257 = null)) THEN null ELSE cast(IR_retnet#94766257 as float) END AS IR_retnet#94766682, ... 2 more fields] +- FileScan csv [cap#94766234,retIC#94766235,resretIC#94766236,numcos#94766237,numdates#94766238,annual_bmret#94766239,annual_ret#94766240,std_ret#94766241,Sharpe_ret#94766242,PctPos_ret#94766243,TR_ret#94766244,IR_ret#94766245,annual_resret#94766246,std_resret#94766247,Sharpe_resret#94766248,PctPos_resret#94766249,TR_resret#94766250,IR_resret#94766251,annual_retnet#94766252,std_retnet#94766253,Sharpe_retnet#94766254,PctPos_retnet#94766255,TR_retnet#94766256,IR_retnet#94766257,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/esg_innovation/innovat..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#94766234, retIC#94766235, resretIC#94766236, numcos#94766237, numdates#94766238, annual_bmret#94766239, annual_ret#94766240, std_ret#94766241, Sharpe_ret#94766242, PctPos_ret#94766243, TR_ret#94766244, IR_ret#94766245, annual_resret#94766246, std_resret#94766247, Sharpe_resret#94766248, PctPos_resret#94766249, TR_resret#94766250, IR_resret#94766251, annual_retnet#94766252, std_retnet#94766253, Sharpe_retnet#94766254, PctPos_retnet#94766255, TR_retnet#94766256, IR_retnet#94766257, turnover#94766258, coverage#94766259] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/esg_innovation/innovation/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#94766234 = NA) OR (cap#94766234 = null)) THEN null ELSE cast(cap#94766234 as float) END AS cap#94766441, CASE WHEN ((retIC#94766235 = NA) OR (retIC#94766235 = null)) THEN null ELSE cast(retIC#94766235 as float) END AS retIC#94766445, CASE WHEN ((resretIC#94766236 = NA) OR (resretIC#94766236 = null)) THEN null ELSE cast(resretIC#94766236 as float) END AS resretIC#94766450, CASE WHEN ((numcos#94766237 = NA) OR (numcos#94766237 = null)) THEN null ELSE cast(numcos#94766237 as float) END AS numcos#94766453, CASE WHEN ((numdates#94766238 = NA) OR (numdates#94766238 = null)) THEN null ELSE cast(numdates#94766238 as int) END AS numdates#94766455, CASE WHEN ((annual_bmret#94766239 = NA) OR (annual_bmret#94766239 = null)) THEN null ELSE cast(annual_bmret#94766239 as float) END AS annual_bmret#94766461, CASE WHEN ((annual_ret#94766240 = NA) OR (annual_ret#94766240 = null)) THEN null ELSE cast(annual_ret#94766240 as float) END AS annual_ret#94766466, CASE WHEN ((std_ret#94766241 = NA) OR (std_ret#94766241 = null)) THEN null ELSE cast(std_ret#94766241 as float) END AS std_ret#94766470, CASE WHEN ((Sharpe_ret#94766242 = NA) OR (Sharpe_ret#94766242 = null)) THEN null ELSE cast(Sharpe_ret#94766242 as float) END AS Sharpe_ret#94766475, CASE WHEN ((PctPos_ret#94766243 = NA) OR (PctPos_ret#94766243 = null)) THEN null ELSE cast(PctPos_ret#94766243 as float) END AS PctPos_ret#94766478, CASE WHEN ((TR_ret#94766244 = NA) OR (TR_ret#94766244 = null)) THEN null ELSE cast(TR_ret#94766244 as float) END AS TR_ret#94766481, CASE WHEN ((IR_ret#94766245 = NA) OR (IR_ret#94766245 = null)) THEN null ELSE cast(IR_ret#94766245 as float) END AS IR_ret#94766482, CASE WHEN ((annual_resret#94766246 = NA) OR (annual_resret#94766246 = null)) THEN null ELSE cast(annual_resret#94766246 as float) END AS annual_resret#94766623, CASE WHEN ((std_resret#94766247 = NA) OR (std_resret#94766247 = null)) THEN null ELSE cast(std_resret#94766247 as float) END AS std_resret#94766636, CASE WHEN ((Sharpe_resret#94766248 = NA) OR (Sharpe_resret#94766248 = null)) THEN null ELSE cast(Sharpe_resret#94766248 as float) END AS Sharpe_resret#94766637, CASE WHEN ((PctPos_resret#94766249 = NA) OR (PctPos_resret#94766249 = null)) THEN null ELSE cast(PctPos_resret#94766249 as float) END AS PctPos_resret#94766649, CASE WHEN ((TR_resret#94766250 = NA) OR (TR_resret#94766250 = null)) THEN null ELSE cast(TR_resret#94766250 as float) END AS TR_resret#94766651, CASE WHEN ((IR_resret#94766251 = NA) OR (IR_resret#94766251 = null)) THEN null ELSE cast(IR_resret#94766251 as float) END AS IR_resret#94766652, CASE WHEN ((annual_retnet#94766252 = NA) OR (annual_retnet#94766252 = null)) THEN null ELSE cast(annual_retnet#94766252 as float) END AS annual_retnet#94766665, CASE WHEN ((std_retnet#94766253 = NA) OR (std_retnet#94766253 = null)) THEN null ELSE cast(std_retnet#94766253 as float) END AS std_retnet#94766666, CASE WHEN ((Sharpe_retnet#94766254 = NA) OR (Sharpe_retnet#94766254 = null)) THEN null ELSE cast(Sharpe_retnet#94766254 as float) END AS Sharpe_retnet#94766667, CASE WHEN ((PctPos_retnet#94766255 = NA) OR (PctPos_retnet#94766255 = null)) THEN null ELSE cast(PctPos_retnet#94766255 as float) END AS PctPos_retnet#94766680, CASE WHEN ((TR_retnet#94766256 = NA) OR (TR_retnet#94766256 = null)) THEN null ELSE cast(TR_retnet#94766256 as float) END AS TR_retnet#94766681, CASE WHEN ((IR_retnet#94766257 = NA) OR (IR_retnet#94766257 = null)) THEN null ELSE cast(IR_retnet#94766257 as float) END AS IR_retnet#94766682, CASE WHEN ((turnover#94766258 = NA) OR (turnover#94766258 = null)) THEN null ELSE cast(turnover#94766258 as float) END AS turnover#94766695, CASE WHEN ((coverage#94766259 = NA) OR (coverage#94766259 = null)) THEN null ELSE cast(coverage#94766259 as float) END AS coverage#94766696] Input [26]: [cap#94766234, retIC#94766235, resretIC#94766236, numcos#94766237, numdates#94766238, annual_bmret#94766239, annual_ret#94766240, std_ret#94766241, Sharpe_ret#94766242, PctPos_ret#94766243, TR_ret#94766244, IR_ret#94766245, annual_resret#94766246, std_resret#94766247, Sharpe_resret#94766248, PctPos_resret#94766249, TR_resret#94766250, IR_resret#94766251, annual_retnet#94766252, std_retnet#94766253, Sharpe_retnet#94766254, PctPos_retnet#94766255, TR_retnet#94766256, IR_retnet#94766257, turnover#94766258, coverage#94766259] (7) ColumnarToRow [codegen id : 1] Input [2]: [cap#94766441, turnover#94766695] (8) Filter [codegen id : 1] Input [2]: [cap#94766441, turnover#94766695] Condition : isnotnull(cap#94766441) (9) BroadcastExchange Input [2]: [cap#94766441, turnover#94766695] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7566193] (10) InMemoryTableScan Output [3]: [cap#94694039, description#94694044, sort#94694042] Arguments: [cap#94694039, description#94694044, sort#94694042], [isnotnull(cap#94694039)] (11) InMemoryRelation Arguments: [cap#94694039, sort#94694042, description#94694044, universe#94694045], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94694007 = NA) OR (cap#94694007 = null)) THEN null ELSE cast(cap#94694007 as int) END AS cap#94694039, CASE WHEN (sort#94694010 = null) THEN null ELSE sort#94694010 END AS sort#94694042, CASE WHEN (description#94694012 = null) THEN null ELSE description#94694012 END AS description#94694044, CASE WHEN ((universe#94694014 = NA) OR (universe#94694014 = null)) THEN null ELSE cast(universe#94694014 as int) END AS universe#94694045] +- FileScan csv [cap#94694007,sort#94694010,description#94694012,universe#94694014] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#94694007, sort#94694010, description#94694012, universe#94694014] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#94694007 = NA) OR (cap#94694007 = null)) THEN null ELSE cast(cap#94694007 as int) END AS cap#94694039, CASE WHEN (sort#94694010 = null) THEN null ELSE sort#94694010 END AS sort#94694042, CASE WHEN (description#94694012 = null) THEN null ELSE description#94694012 END AS description#94694044, CASE WHEN ((universe#94694014 = NA) OR (universe#94694014 = null)) THEN null ELSE cast(universe#94694014 as int) END AS universe#94694045] Input [4]: [cap#94694007, sort#94694010, description#94694012, universe#94694014] (14) Filter Input [3]: [cap#94694039, description#94694044, sort#94694042] Condition : isnotnull(cap#94694039) (15) Project Output [3]: [cap#94694039, description#94694044 AS cap_description#94768979, sort#94694042 AS cap_sort#94768980] Input [3]: [cap#94694039, description#94694044, sort#94694042] (16) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#94766441))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94694039 as float)))] Join condition: None (17) Project [codegen id : 2] Output [3]: [turnover#94766695, cap_description#94768979 AS cap#94769042, cap_sort#94768980] Input [5]: [cap#94766441, turnover#94766695, cap#94694039, cap_description#94768979, cap_sort#94768980] (18) Exchange Input [3]: [turnover#94766695, cap#94769042, cap_sort#94768980] Arguments: rangepartitioning(cap_sort#94768980 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7566201] (19) Sort [codegen id : 3] Input [3]: [turnover#94766695, cap#94769042, cap_sort#94768980] Arguments: [cap_sort#94768980 ASC NULLS FIRST], true, 0 (20) Project [codegen id : 3] Output [3]: [cap#94769042, turnover#94766695, (1.0 / cast(turnover#94766695 as double)) AS days_hold#94769097] Input [3]: [turnover#94766695, cap#94769042, cap_sort#94768980] (21) CollectLimit Input [3]: [cap#94769042, turnover#94766695, days_hold#94769097] Arguments: 1000000