== Physical Plan == CollectLimit (21) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Project (20) +- * Sort (19) +- Exchange (18) +- * Project (17) +- * BroadcastHashJoin Inner BuildLeft (16) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Project (15) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [3]: [cap#94313606, turnover#94311079, days_hold#94313661] Arguments: [cap#94313606, turnover#94311079, days_hold#94313661] (2) InMemoryRelation Arguments: [cap#94313606, turnover#94311079, days_hold#94313661], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Project [cap#94313606, turnover#94311079, (1.0 / cast(turnover#94311079 as double)) AS days_hold#94313661] +- *(3) Sort [cap_sort#94313544 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(cap_sort#94313544 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7530260] +- *(2) Project [turnover#94311079, cap_description#94313543 AS cap#94313606, cap_sort#94313544] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#94311010))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7530252] : +- *(1) Filter isnotnull(cap#94311010) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#94311010, turnover#94311079], [isnotnull(cap#94311010)] : +- InMemoryRelation [cap#94311010, retIC#94311011, resretIC#94311012, numcos#94311013, numdates#94311014, annual_bmret#94311019, annual_ret#94311020, std_ret#94311021, Sharpe_ret#94311037, PctPos_ret#94311038, TR_ret#94311039, IR_ret#94311052, annual_resret#94311053, std_resret#94311054, Sharpe_resret#94311055, PctPos_resret#94311056, TR_resret#94311057, IR_resret#94311058, annual_retnet#94311059, std_retnet#94311060, Sharpe_retnet#94311072, PctPos_retnet#94311073, TR_retnet#94311074, IR_retnet#94311078, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#94310698 = NA) OR (cap#94310698 = null)) THEN null ELSE cast(cap#94310698 as float) END AS cap#94311010, CASE WHEN ((retIC#94310699 = NA) OR (retIC#94310699 = null)) THEN null ELSE cast(retIC#94310699 as float) END AS retIC#94311011, CASE WHEN ((resretIC#94310700 = NA) OR (resretIC#94310700 = null)) THEN null ELSE cast(resretIC#94310700 as float) END AS resretIC#94311012, CASE WHEN ((numcos#94310701 = NA) OR (numcos#94310701 = null)) THEN null ELSE cast(numcos#94310701 as float) END AS numcos#94311013, CASE WHEN ((numdates#94310702 = NA) OR (numdates#94310702 = null)) THEN null ELSE cast(numdates#94310702 as int) END AS numdates#94311014, CASE WHEN ((annual_bmret#94310703 = NA) OR (annual_bmret#94310703 = null)) THEN null ELSE cast(annual_bmret#94310703 as float) END AS annual_bmret#94311019, CASE WHEN ((annual_ret#94310704 = NA) OR (annual_ret#94310704 = null)) THEN null ELSE cast(annual_ret#94310704 as float) END AS annual_ret#94311020, CASE WHEN ((std_ret#94310705 = NA) OR (std_ret#94310705 = null)) THEN null ELSE cast(std_ret#94310705 as float) END AS std_ret#94311021, CASE WHEN ((Sharpe_ret#94310706 = NA) OR (Sharpe_ret#94310706 = null)) THEN null ELSE cast(Sharpe_ret#94310706 as float) END AS Sharpe_ret#94311037, CASE WHEN ((PctPos_ret#94310707 = NA) OR (PctPos_ret#94310707 = null)) THEN null ELSE cast(PctPos_ret#94310707 as float) END AS PctPos_ret#94311038, CASE WHEN ((TR_ret#94310708 = NA) OR (TR_ret#94310708 = null)) THEN null ELSE cast(TR_ret#94310708 as float) END AS TR_ret#94311039, CASE WHEN ((IR_ret#94310709 = NA) OR (IR_ret#94310709 = null)) THEN null ELSE cast(IR_ret#94310709 as float) END AS IR_ret#94311052, CASE WHEN ((annual_resret#94310710 = NA) OR (annual_resret#94310710 = null)) THEN null ELSE cast(annual_resret#94310710 as float) END AS annual_resret#94311053, CASE WHEN ((std_resret#94310711 = NA) OR (std_resret#94310711 = null)) THEN null ELSE cast(std_resret#94310711 as float) END AS std_resret#94311054, CASE WHEN ((Sharpe_resret#94310712 = NA) OR (Sharpe_resret#94310712 = null)) THEN null ELSE cast(Sharpe_resret#94310712 as float) END AS Sharpe_resret#94311055, CASE WHEN ((PctPos_resret#94310713 = NA) OR (PctPos_resret#94310713 = null)) THEN null ELSE cast(PctPos_resret#94310713 as float) END AS PctPos_resret#94311056, CASE WHEN ((TR_resret#94310714 = NA) OR (TR_resret#94310714 = null)) THEN null ELSE cast(TR_resret#94310714 as float) END AS TR_resret#94311057, CASE WHEN ((IR_resret#94310715 = NA) OR (IR_resret#94310715 = null)) THEN null ELSE cast(IR_resret#94310715 as float) END AS IR_resret#94311058, CASE WHEN ((annual_retnet#94310716 = NA) OR (annual_retnet#94310716 = null)) THEN null ELSE cast(annual_retnet#94310716 as float) END AS annual_retnet#94311059, CASE WHEN ((std_retnet#94310717 = NA) OR (std_retnet#94310717 = null)) THEN null ELSE cast(std_retnet#94310717 as float) END AS std_retnet#94311060, CASE WHEN ((Sharpe_retnet#94310718 = NA) OR (Sharpe_retnet#94310718 = null)) THEN null ELSE cast(Sharpe_retnet#94310718 as float) END AS Sharpe_retnet#94311072, CASE WHEN ((PctPos_retnet#94310719 = NA) OR (PctPos_retnet#94310719 = null)) THEN null ELSE cast(PctPos_retnet#94310719 as float) END AS PctPos_retnet#94311073, CASE WHEN ((TR_retnet#94310720 = NA) OR (TR_retnet#94310720 = null)) THEN null ELSE cast(TR_retnet#94310720 as float) END AS TR_retnet#94311074, CASE WHEN ((IR_retnet#94310721 = NA) OR (IR_retnet#94310721 = null)) THEN null ELSE cast(IR_retnet#94310721 as float) END AS IR_retnet#94311078, ... 2 more fields] : +- FileScan csv [cap#94310698,retIC#94310699,resretIC#94310700,numcos#94310701,numdates#94310702,annual_bmret#94310703,annual_ret#94310704,std_ret#94310705,Sharpe_ret#94310706,PctPos_ret#94310707,TR_ret#94310708,IR_ret#94310709,annual_resret#94310710,std_resret#94310711,Sharpe_resret#94310712,PctPos_resret#94310713,TR_resret#94310714,IR_resret#94310715,annual_retnet#94310716,std_retnet#94310717,Sharpe_retnet#94310718,PctPos_retnet#94310719,TR_retnet#94310720,IR_retnet#94310721,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Project [cap#94160394, description#94160396 AS cap_description#94313543, sort#94160395 AS cap_sort#94313544] +- *(2) Filter isnotnull(cap#94160394) +- InMemoryTableScan [cap#94160394, description#94160396, sort#94160395], [isnotnull(cap#94160394)] +- InMemoryRelation [cap#94160394, sort#94160395, description#94160396, universe#94160397], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [cap_sort#94313544 ASC NULLS FIRST] (3) InMemoryTableScan Output [2]: [cap#94311010, turnover#94311079] Arguments: [cap#94311010, turnover#94311079], [isnotnull(cap#94311010)] (4) InMemoryRelation Arguments: [cap#94311010, retIC#94311011, resretIC#94311012, numcos#94311013, numdates#94311014, annual_bmret#94311019, annual_ret#94311020, std_ret#94311021, Sharpe_ret#94311037, PctPos_ret#94311038, TR_ret#94311039, IR_ret#94311052, annual_resret#94311053, std_resret#94311054, Sharpe_resret#94311055, PctPos_resret#94311056, TR_resret#94311057, IR_resret#94311058, annual_retnet#94311059, std_retnet#94311060, Sharpe_retnet#94311072, PctPos_retnet#94311073, TR_retnet#94311074, IR_retnet#94311078, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94310698 = NA) OR (cap#94310698 = null)) THEN null ELSE cast(cap#94310698 as float) END AS cap#94311010, CASE WHEN ((retIC#94310699 = NA) OR (retIC#94310699 = null)) THEN null ELSE cast(retIC#94310699 as float) END AS retIC#94311011, CASE WHEN ((resretIC#94310700 = NA) OR (resretIC#94310700 = null)) THEN null ELSE cast(resretIC#94310700 as float) END AS resretIC#94311012, CASE WHEN ((numcos#94310701 = NA) OR (numcos#94310701 = null)) THEN null ELSE cast(numcos#94310701 as float) END AS numcos#94311013, CASE WHEN ((numdates#94310702 = NA) OR (numdates#94310702 = null)) THEN null ELSE cast(numdates#94310702 as int) END AS numdates#94311014, CASE WHEN ((annual_bmret#94310703 = NA) OR (annual_bmret#94310703 = null)) THEN null ELSE cast(annual_bmret#94310703 as float) END AS annual_bmret#94311019, CASE WHEN ((annual_ret#94310704 = NA) OR (annual_ret#94310704 = null)) THEN null ELSE cast(annual_ret#94310704 as float) END AS annual_ret#94311020, CASE WHEN ((std_ret#94310705 = NA) OR (std_ret#94310705 = null)) THEN null ELSE cast(std_ret#94310705 as float) END AS std_ret#94311021, CASE WHEN ((Sharpe_ret#94310706 = NA) OR (Sharpe_ret#94310706 = null)) THEN null ELSE cast(Sharpe_ret#94310706 as float) END AS Sharpe_ret#94311037, CASE WHEN ((PctPos_ret#94310707 = NA) OR (PctPos_ret#94310707 = null)) THEN null ELSE cast(PctPos_ret#94310707 as float) END AS PctPos_ret#94311038, CASE WHEN ((TR_ret#94310708 = NA) OR (TR_ret#94310708 = null)) THEN null ELSE cast(TR_ret#94310708 as float) END AS TR_ret#94311039, CASE WHEN ((IR_ret#94310709 = NA) OR (IR_ret#94310709 = null)) THEN null ELSE cast(IR_ret#94310709 as float) END AS IR_ret#94311052, CASE WHEN ((annual_resret#94310710 = NA) OR (annual_resret#94310710 = null)) THEN null ELSE cast(annual_resret#94310710 as float) END AS annual_resret#94311053, CASE WHEN ((std_resret#94310711 = NA) OR (std_resret#94310711 = null)) THEN null ELSE cast(std_resret#94310711 as float) END AS std_resret#94311054, CASE WHEN ((Sharpe_resret#94310712 = NA) OR (Sharpe_resret#94310712 = null)) THEN null ELSE cast(Sharpe_resret#94310712 as float) END AS Sharpe_resret#94311055, CASE WHEN ((PctPos_resret#94310713 = NA) OR (PctPos_resret#94310713 = null)) THEN null ELSE cast(PctPos_resret#94310713 as float) END AS PctPos_resret#94311056, CASE WHEN ((TR_resret#94310714 = NA) OR (TR_resret#94310714 = null)) THEN null ELSE cast(TR_resret#94310714 as float) END AS TR_resret#94311057, CASE WHEN ((IR_resret#94310715 = NA) OR (IR_resret#94310715 = null)) THEN null ELSE cast(IR_resret#94310715 as float) END AS IR_resret#94311058, CASE WHEN ((annual_retnet#94310716 = NA) OR (annual_retnet#94310716 = null)) THEN null ELSE cast(annual_retnet#94310716 as float) END AS annual_retnet#94311059, CASE WHEN ((std_retnet#94310717 = NA) OR (std_retnet#94310717 = null)) THEN null ELSE cast(std_retnet#94310717 as float) END AS std_retnet#94311060, CASE WHEN ((Sharpe_retnet#94310718 = NA) OR (Sharpe_retnet#94310718 = null)) THEN null ELSE cast(Sharpe_retnet#94310718 as float) END AS Sharpe_retnet#94311072, CASE WHEN ((PctPos_retnet#94310719 = NA) OR (PctPos_retnet#94310719 = null)) THEN null ELSE cast(PctPos_retnet#94310719 as float) END AS PctPos_retnet#94311073, CASE WHEN ((TR_retnet#94310720 = NA) OR (TR_retnet#94310720 = null)) THEN null ELSE cast(TR_retnet#94310720 as float) END AS TR_retnet#94311074, CASE WHEN ((IR_retnet#94310721 = NA) OR (IR_retnet#94310721 = null)) THEN null ELSE cast(IR_retnet#94310721 as float) END AS IR_retnet#94311078, ... 2 more fields] +- FileScan csv [cap#94310698,retIC#94310699,resretIC#94310700,numcos#94310701,numdates#94310702,annual_bmret#94310703,annual_ret#94310704,std_ret#94310705,Sharpe_ret#94310706,PctPos_ret#94310707,TR_ret#94310708,IR_ret#94310709,annual_resret#94310710,std_resret#94310711,Sharpe_resret#94310712,PctPos_resret#94310713,TR_resret#94310714,IR_resret#94310715,annual_retnet#94310716,std_retnet#94310717,Sharpe_retnet#94310718,PctPos_retnet#94310719,TR_retnet#94310720,IR_retnet#94310721,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#94310698, retIC#94310699, resretIC#94310700, numcos#94310701, numdates#94310702, annual_bmret#94310703, annual_ret#94310704, std_ret#94310705, Sharpe_ret#94310706, PctPos_ret#94310707, TR_ret#94310708, IR_ret#94310709, annual_resret#94310710, std_resret#94310711, Sharpe_resret#94310712, PctPos_resret#94310713, TR_resret#94310714, IR_resret#94310715, annual_retnet#94310716, std_retnet#94310717, Sharpe_retnet#94310718, PctPos_retnet#94310719, TR_retnet#94310720, IR_retnet#94310721, turnover#94310722, coverage#94310723] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#94310698 = NA) OR (cap#94310698 = null)) THEN null ELSE cast(cap#94310698 as float) END AS cap#94311010, CASE WHEN ((retIC#94310699 = NA) OR (retIC#94310699 = null)) THEN null ELSE cast(retIC#94310699 as float) END AS retIC#94311011, CASE WHEN ((resretIC#94310700 = NA) OR (resretIC#94310700 = null)) THEN null ELSE cast(resretIC#94310700 as float) END AS resretIC#94311012, CASE WHEN ((numcos#94310701 = NA) OR (numcos#94310701 = null)) THEN null ELSE cast(numcos#94310701 as float) END AS numcos#94311013, CASE WHEN ((numdates#94310702 = NA) OR (numdates#94310702 = null)) THEN null ELSE cast(numdates#94310702 as int) END AS numdates#94311014, CASE WHEN ((annual_bmret#94310703 = NA) OR (annual_bmret#94310703 = null)) THEN null ELSE cast(annual_bmret#94310703 as float) END AS annual_bmret#94311019, CASE WHEN ((annual_ret#94310704 = NA) OR (annual_ret#94310704 = null)) THEN null ELSE cast(annual_ret#94310704 as float) END AS annual_ret#94311020, CASE WHEN ((std_ret#94310705 = NA) OR (std_ret#94310705 = null)) THEN null ELSE cast(std_ret#94310705 as float) END AS std_ret#94311021, CASE WHEN ((Sharpe_ret#94310706 = NA) OR (Sharpe_ret#94310706 = null)) THEN null ELSE cast(Sharpe_ret#94310706 as float) END AS Sharpe_ret#94311037, CASE WHEN ((PctPos_ret#94310707 = NA) OR (PctPos_ret#94310707 = null)) THEN null ELSE cast(PctPos_ret#94310707 as float) END AS PctPos_ret#94311038, CASE WHEN ((TR_ret#94310708 = NA) OR (TR_ret#94310708 = null)) THEN null ELSE cast(TR_ret#94310708 as float) END AS TR_ret#94311039, CASE WHEN ((IR_ret#94310709 = NA) OR (IR_ret#94310709 = null)) THEN null ELSE cast(IR_ret#94310709 as float) END AS IR_ret#94311052, CASE WHEN ((annual_resret#94310710 = NA) OR (annual_resret#94310710 = null)) THEN null ELSE cast(annual_resret#94310710 as float) END AS annual_resret#94311053, CASE WHEN ((std_resret#94310711 = NA) OR (std_resret#94310711 = null)) THEN null ELSE cast(std_resret#94310711 as float) END AS std_resret#94311054, CASE WHEN ((Sharpe_resret#94310712 = NA) OR (Sharpe_resret#94310712 = null)) THEN null ELSE cast(Sharpe_resret#94310712 as float) END AS Sharpe_resret#94311055, CASE WHEN ((PctPos_resret#94310713 = NA) OR (PctPos_resret#94310713 = null)) THEN null ELSE cast(PctPos_resret#94310713 as float) END AS PctPos_resret#94311056, CASE WHEN ((TR_resret#94310714 = NA) OR (TR_resret#94310714 = null)) THEN null ELSE cast(TR_resret#94310714 as float) END AS TR_resret#94311057, CASE WHEN ((IR_resret#94310715 = NA) OR (IR_resret#94310715 = null)) THEN null ELSE cast(IR_resret#94310715 as float) END AS IR_resret#94311058, CASE WHEN ((annual_retnet#94310716 = NA) OR (annual_retnet#94310716 = null)) THEN null ELSE cast(annual_retnet#94310716 as float) END AS annual_retnet#94311059, CASE WHEN ((std_retnet#94310717 = NA) OR (std_retnet#94310717 = null)) THEN null ELSE cast(std_retnet#94310717 as float) END AS std_retnet#94311060, CASE WHEN ((Sharpe_retnet#94310718 = NA) OR (Sharpe_retnet#94310718 = null)) THEN null ELSE cast(Sharpe_retnet#94310718 as float) END AS Sharpe_retnet#94311072, CASE WHEN ((PctPos_retnet#94310719 = NA) OR (PctPos_retnet#94310719 = null)) THEN null ELSE cast(PctPos_retnet#94310719 as float) END AS PctPos_retnet#94311073, CASE WHEN ((TR_retnet#94310720 = NA) OR (TR_retnet#94310720 = null)) THEN null ELSE cast(TR_retnet#94310720 as float) END AS TR_retnet#94311074, CASE WHEN ((IR_retnet#94310721 = NA) OR (IR_retnet#94310721 = null)) THEN null ELSE cast(IR_retnet#94310721 as float) END AS IR_retnet#94311078, CASE WHEN ((turnover#94310722 = NA) OR (turnover#94310722 = null)) THEN null ELSE cast(turnover#94310722 as float) END AS turnover#94311079, CASE WHEN ((coverage#94310723 = NA) OR (coverage#94310723 = null)) THEN null ELSE cast(coverage#94310723 as float) END AS coverage#94311092] Input [26]: [cap#94310698, retIC#94310699, resretIC#94310700, numcos#94310701, numdates#94310702, annual_bmret#94310703, annual_ret#94310704, std_ret#94310705, Sharpe_ret#94310706, PctPos_ret#94310707, TR_ret#94310708, IR_ret#94310709, annual_resret#94310710, std_resret#94310711, Sharpe_resret#94310712, PctPos_resret#94310713, TR_resret#94310714, IR_resret#94310715, annual_retnet#94310716, std_retnet#94310717, Sharpe_retnet#94310718, PctPos_retnet#94310719, TR_retnet#94310720, IR_retnet#94310721, turnover#94310722, coverage#94310723] (7) ColumnarToRow [codegen id : 1] Input [2]: [cap#94311010, turnover#94311079] (8) Filter [codegen id : 1] Input [2]: [cap#94311010, turnover#94311079] Condition : isnotnull(cap#94311010) (9) BroadcastExchange Input [2]: [cap#94311010, turnover#94311079] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7530252] (10) InMemoryTableScan Output [3]: [cap#94160394, description#94160396, sort#94160395] Arguments: [cap#94160394, description#94160396, sort#94160395], [isnotnull(cap#94160394)] (11) InMemoryRelation Arguments: [cap#94160394, sort#94160395, description#94160396, universe#94160397], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] Input [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] (14) Filter Input [3]: [cap#94160394, description#94160396, sort#94160395] Condition : isnotnull(cap#94160394) (15) Project Output [3]: [cap#94160394, description#94160396 AS cap_description#94313543, sort#94160395 AS cap_sort#94313544] Input [3]: [cap#94160394, description#94160396, sort#94160395] (16) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#94311010))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))] Join condition: None (17) Project [codegen id : 2] Output [3]: [turnover#94311079, cap_description#94313543 AS cap#94313606, cap_sort#94313544] Input [5]: [cap#94311010, turnover#94311079, cap#94160394, cap_description#94313543, cap_sort#94313544] (18) Exchange Input [3]: [turnover#94311079, cap#94313606, cap_sort#94313544] Arguments: rangepartitioning(cap_sort#94313544 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7530260] (19) Sort [codegen id : 3] Input [3]: [turnover#94311079, cap#94313606, cap_sort#94313544] Arguments: [cap_sort#94313544 ASC NULLS FIRST], true, 0 (20) Project [codegen id : 3] Output [3]: [cap#94313606, turnover#94311079, (1.0 / cast(turnover#94311079 as double)) AS days_hold#94313661] Input [3]: [turnover#94311079, cap#94313606, cap_sort#94313544] (21) CollectLimit Input [3]: [cap#94313606, turnover#94311079, days_hold#94313661] Arguments: 10000