== Physical Plan == CollectLimit (21) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Project (20) +- * Sort (19) +- Exchange (18) +- * Project (17) +- * BroadcastHashJoin Inner BuildLeft (16) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Project (15) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [3]: [cap#94328190, turnover#94325173, days_hold#94328224] Arguments: [cap#94328190, turnover#94325173, days_hold#94328224] (2) InMemoryRelation Arguments: [cap#94328190, turnover#94325173, days_hold#94328224], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Project [cap#94328190, turnover#94325173, (1.0 / cast(turnover#94325173 as double)) AS days_hold#94328224] +- *(3) Sort [cap_sort#94328101 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(cap_sort#94328101 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7531410] +- *(2) Project [turnover#94325173, cap_description#94328100 AS cap#94328190, cap_sort#94328101] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#94324901))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7531402] : +- *(1) Filter isnotnull(cap#94324901) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#94324901, turnover#94325173], [isnotnull(cap#94324901)] : +- InMemoryRelation [cap#94324901, retIC#94324914, resretIC#94324916, numcos#94324920, numdates#94324922, annual_bmret#94324925, annual_ret#94324984, std_ret#94324987, Sharpe_ret#94324989, PctPos_ret#94325003, TR_ret#94325095, IR_ret#94325159, annual_resret#94325161, std_resret#94325162, Sharpe_resret#94325163, PctPos_resret#94325164, TR_resret#94325165, IR_resret#94325166, annual_retnet#94325167, std_retnet#94325168, Sharpe_retnet#94325169, PctPos_retnet#94325170, TR_retnet#94325171, IR_retnet#94325172, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#94324657 = NA) OR (cap#94324657 = null)) THEN null ELSE cast(cap#94324657 as float) END AS cap#94324901, CASE WHEN ((retIC#94324658 = NA) OR (retIC#94324658 = null)) THEN null ELSE cast(retIC#94324658 as float) END AS retIC#94324914, CASE WHEN ((resretIC#94324659 = NA) OR (resretIC#94324659 = null)) THEN null ELSE cast(resretIC#94324659 as float) END AS resretIC#94324916, CASE WHEN ((numcos#94324660 = NA) OR (numcos#94324660 = null)) THEN null ELSE cast(numcos#94324660 as float) END AS numcos#94324920, CASE WHEN ((numdates#94324661 = NA) OR (numdates#94324661 = null)) THEN null ELSE cast(numdates#94324661 as float) END AS numdates#94324922, CASE WHEN ((annual_bmret#94324662 = NA) OR (annual_bmret#94324662 = null)) THEN null ELSE cast(annual_bmret#94324662 as float) END AS annual_bmret#94324925, CASE WHEN ((annual_ret#94324663 = NA) OR (annual_ret#94324663 = null)) THEN null ELSE cast(annual_ret#94324663 as float) END AS annual_ret#94324984, CASE WHEN ((std_ret#94324664 = NA) OR (std_ret#94324664 = null)) THEN null ELSE cast(std_ret#94324664 as float) END AS std_ret#94324987, CASE WHEN ((Sharpe_ret#94324665 = NA) OR (Sharpe_ret#94324665 = null)) THEN null ELSE cast(Sharpe_ret#94324665 as float) END AS Sharpe_ret#94324989, CASE WHEN ((PctPos_ret#94324666 = NA) OR (PctPos_ret#94324666 = null)) THEN null ELSE cast(PctPos_ret#94324666 as float) END AS PctPos_ret#94325003, CASE WHEN ((TR_ret#94324667 = NA) OR (TR_ret#94324667 = null)) THEN null ELSE cast(TR_ret#94324667 as float) END AS TR_ret#94325095, CASE WHEN ((IR_ret#94324668 = NA) OR (IR_ret#94324668 = null)) THEN null ELSE cast(IR_ret#94324668 as float) END AS IR_ret#94325159, CASE WHEN ((annual_resret#94324669 = NA) OR (annual_resret#94324669 = null)) THEN null ELSE cast(annual_resret#94324669 as float) END AS annual_resret#94325161, CASE WHEN ((std_resret#94324670 = NA) OR (std_resret#94324670 = null)) THEN null ELSE cast(std_resret#94324670 as float) END AS std_resret#94325162, CASE WHEN ((Sharpe_resret#94324671 = NA) OR (Sharpe_resret#94324671 = null)) THEN null ELSE cast(Sharpe_resret#94324671 as float) END AS Sharpe_resret#94325163, CASE WHEN ((PctPos_resret#94324672 = NA) OR (PctPos_resret#94324672 = null)) THEN null ELSE cast(PctPos_resret#94324672 as float) END AS PctPos_resret#94325164, CASE WHEN ((TR_resret#94324673 = NA) OR (TR_resret#94324673 = null)) THEN null ELSE cast(TR_resret#94324673 as float) END AS TR_resret#94325165, CASE WHEN ((IR_resret#94324674 = NA) OR (IR_resret#94324674 = null)) THEN null ELSE cast(IR_resret#94324674 as float) END AS IR_resret#94325166, CASE WHEN ((annual_retnet#94324675 = NA) OR (annual_retnet#94324675 = null)) THEN null ELSE cast(annual_retnet#94324675 as float) END AS annual_retnet#94325167, CASE WHEN ((std_retnet#94324676 = NA) OR (std_retnet#94324676 = null)) THEN null ELSE cast(std_retnet#94324676 as float) END AS std_retnet#94325168, CASE WHEN ((Sharpe_retnet#94324677 = NA) OR (Sharpe_retnet#94324677 = null)) THEN null ELSE cast(Sharpe_retnet#94324677 as float) END AS Sharpe_retnet#94325169, CASE WHEN ((PctPos_retnet#94324678 = NA) OR (PctPos_retnet#94324678 = null)) THEN null ELSE cast(PctPos_retnet#94324678 as float) END AS PctPos_retnet#94325170, CASE WHEN ((TR_retnet#94324679 = NA) OR (TR_retnet#94324679 = null)) THEN null ELSE cast(TR_retnet#94324679 as float) END AS TR_retnet#94325171, CASE WHEN ((IR_retnet#94324680 = NA) OR (IR_retnet#94324680 = null)) THEN null ELSE cast(IR_retnet#94324680 as float) END AS IR_retnet#94325172, ... 2 more fields] : +- FileScan csv [cap#94324657,retIC#94324658,resretIC#94324659,numcos#94324660,numdates#94324661,annual_bmret#94324662,annual_ret#94324663,std_ret#94324664,Sharpe_ret#94324665,PctPos_ret#94324666,TR_ret#94324667,IR_ret#94324668,annual_resret#94324669,std_resret#94324670,Sharpe_resret#94324671,PctPos_resret#94324672,TR_resret#94324673,IR_resret#94324674,annual_retnet#94324675,std_retnet#94324676,Sharpe_retnet#94324677,PctPos_retnet#94324678,TR_retnet#94324679,IR_retnet#94324680,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Project [cap#94160394, description#94160396 AS cap_description#94328100, sort#94160395 AS cap_sort#94328101] +- *(2) Filter isnotnull(cap#94160394) +- InMemoryTableScan [cap#94160394, description#94160396, sort#94160395], [isnotnull(cap#94160394)] +- InMemoryRelation [cap#94160394, sort#94160395, description#94160396, universe#94160397], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [cap_sort#94328101 ASC NULLS FIRST] (3) InMemoryTableScan Output [2]: [cap#94324901, turnover#94325173] Arguments: [cap#94324901, turnover#94325173], [isnotnull(cap#94324901)] (4) InMemoryRelation Arguments: [cap#94324901, retIC#94324914, resretIC#94324916, numcos#94324920, numdates#94324922, annual_bmret#94324925, annual_ret#94324984, std_ret#94324987, Sharpe_ret#94324989, PctPos_ret#94325003, TR_ret#94325095, IR_ret#94325159, annual_resret#94325161, std_resret#94325162, Sharpe_resret#94325163, PctPos_resret#94325164, TR_resret#94325165, IR_resret#94325166, annual_retnet#94325167, std_retnet#94325168, Sharpe_retnet#94325169, PctPos_retnet#94325170, TR_retnet#94325171, IR_retnet#94325172, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94324657 = NA) OR (cap#94324657 = null)) THEN null ELSE cast(cap#94324657 as float) END AS cap#94324901, CASE WHEN ((retIC#94324658 = NA) OR (retIC#94324658 = null)) THEN null ELSE cast(retIC#94324658 as float) END AS retIC#94324914, CASE WHEN ((resretIC#94324659 = NA) OR (resretIC#94324659 = null)) THEN null ELSE cast(resretIC#94324659 as float) END AS resretIC#94324916, CASE WHEN ((numcos#94324660 = NA) OR (numcos#94324660 = null)) THEN null ELSE cast(numcos#94324660 as float) END AS numcos#94324920, CASE WHEN ((numdates#94324661 = NA) OR (numdates#94324661 = null)) THEN null ELSE cast(numdates#94324661 as float) END AS numdates#94324922, CASE WHEN ((annual_bmret#94324662 = NA) OR (annual_bmret#94324662 = null)) THEN null ELSE cast(annual_bmret#94324662 as float) END AS annual_bmret#94324925, CASE WHEN ((annual_ret#94324663 = NA) OR (annual_ret#94324663 = null)) THEN null ELSE cast(annual_ret#94324663 as float) END AS annual_ret#94324984, CASE WHEN ((std_ret#94324664 = NA) OR (std_ret#94324664 = null)) THEN null ELSE cast(std_ret#94324664 as float) END AS std_ret#94324987, CASE WHEN ((Sharpe_ret#94324665 = NA) OR (Sharpe_ret#94324665 = null)) THEN null ELSE cast(Sharpe_ret#94324665 as float) END AS Sharpe_ret#94324989, CASE WHEN ((PctPos_ret#94324666 = NA) OR (PctPos_ret#94324666 = null)) THEN null ELSE cast(PctPos_ret#94324666 as float) END AS PctPos_ret#94325003, CASE WHEN ((TR_ret#94324667 = NA) OR (TR_ret#94324667 = null)) THEN null ELSE cast(TR_ret#94324667 as float) END AS TR_ret#94325095, CASE WHEN ((IR_ret#94324668 = NA) OR (IR_ret#94324668 = null)) THEN null ELSE cast(IR_ret#94324668 as float) END AS IR_ret#94325159, CASE WHEN ((annual_resret#94324669 = NA) OR (annual_resret#94324669 = null)) THEN null ELSE cast(annual_resret#94324669 as float) END AS annual_resret#94325161, CASE WHEN ((std_resret#94324670 = NA) OR (std_resret#94324670 = null)) THEN null ELSE cast(std_resret#94324670 as float) END AS std_resret#94325162, CASE WHEN ((Sharpe_resret#94324671 = NA) OR (Sharpe_resret#94324671 = null)) THEN null ELSE cast(Sharpe_resret#94324671 as float) END AS Sharpe_resret#94325163, CASE WHEN ((PctPos_resret#94324672 = NA) OR (PctPos_resret#94324672 = null)) THEN null ELSE cast(PctPos_resret#94324672 as float) END AS PctPos_resret#94325164, CASE WHEN ((TR_resret#94324673 = NA) OR (TR_resret#94324673 = null)) THEN null ELSE cast(TR_resret#94324673 as float) END AS TR_resret#94325165, CASE WHEN ((IR_resret#94324674 = NA) OR (IR_resret#94324674 = null)) THEN null ELSE cast(IR_resret#94324674 as float) END AS IR_resret#94325166, CASE WHEN ((annual_retnet#94324675 = NA) OR (annual_retnet#94324675 = null)) THEN null ELSE cast(annual_retnet#94324675 as float) END AS annual_retnet#94325167, CASE WHEN ((std_retnet#94324676 = NA) OR (std_retnet#94324676 = null)) THEN null ELSE cast(std_retnet#94324676 as float) END AS std_retnet#94325168, CASE WHEN ((Sharpe_retnet#94324677 = NA) OR (Sharpe_retnet#94324677 = null)) THEN null ELSE cast(Sharpe_retnet#94324677 as float) END AS Sharpe_retnet#94325169, CASE WHEN ((PctPos_retnet#94324678 = NA) OR (PctPos_retnet#94324678 = null)) THEN null ELSE cast(PctPos_retnet#94324678 as float) END AS PctPos_retnet#94325170, CASE WHEN ((TR_retnet#94324679 = NA) OR (TR_retnet#94324679 = null)) THEN null ELSE cast(TR_retnet#94324679 as float) END AS TR_retnet#94325171, CASE WHEN ((IR_retnet#94324680 = NA) OR (IR_retnet#94324680 = null)) THEN null ELSE cast(IR_retnet#94324680 as float) END AS IR_retnet#94325172, ... 2 more fields] +- FileScan csv [cap#94324657,retIC#94324658,resretIC#94324659,numcos#94324660,numdates#94324661,annual_bmret#94324662,annual_ret#94324663,std_ret#94324664,Sharpe_ret#94324665,PctPos_ret#94324666,TR_ret#94324667,IR_ret#94324668,annual_resret#94324669,std_resret#94324670,Sharpe_resret#94324671,PctPos_resret#94324672,TR_resret#94324673,IR_resret#94324674,annual_retnet#94324675,std_retnet#94324676,Sharpe_retnet#94324677,PctPos_retnet#94324678,TR_retnet#94324679,IR_retnet#94324680,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#94324657, retIC#94324658, resretIC#94324659, numcos#94324660, numdates#94324661, annual_bmret#94324662, annual_ret#94324663, std_ret#94324664, Sharpe_ret#94324665, PctPos_ret#94324666, TR_ret#94324667, IR_ret#94324668, annual_resret#94324669, std_resret#94324670, Sharpe_resret#94324671, PctPos_resret#94324672, TR_resret#94324673, IR_resret#94324674, annual_retnet#94324675, std_retnet#94324676, Sharpe_retnet#94324677, PctPos_retnet#94324678, TR_retnet#94324679, IR_retnet#94324680, turnover#94324681, coverage#94324682] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_postearnings/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#94324657 = NA) OR (cap#94324657 = null)) THEN null ELSE cast(cap#94324657 as float) END AS cap#94324901, CASE WHEN ((retIC#94324658 = NA) OR (retIC#94324658 = null)) THEN null ELSE cast(retIC#94324658 as float) END AS retIC#94324914, CASE WHEN ((resretIC#94324659 = NA) OR (resretIC#94324659 = null)) THEN null ELSE cast(resretIC#94324659 as float) END AS resretIC#94324916, CASE WHEN ((numcos#94324660 = NA) OR (numcos#94324660 = null)) THEN null ELSE cast(numcos#94324660 as float) END AS numcos#94324920, CASE WHEN ((numdates#94324661 = NA) OR (numdates#94324661 = null)) THEN null ELSE cast(numdates#94324661 as float) END AS numdates#94324922, CASE WHEN ((annual_bmret#94324662 = NA) OR (annual_bmret#94324662 = null)) THEN null ELSE cast(annual_bmret#94324662 as float) END AS annual_bmret#94324925, CASE WHEN ((annual_ret#94324663 = NA) OR (annual_ret#94324663 = null)) THEN null ELSE cast(annual_ret#94324663 as float) END AS annual_ret#94324984, CASE WHEN ((std_ret#94324664 = NA) OR (std_ret#94324664 = null)) THEN null ELSE cast(std_ret#94324664 as float) END AS std_ret#94324987, CASE WHEN ((Sharpe_ret#94324665 = NA) OR (Sharpe_ret#94324665 = null)) THEN null ELSE cast(Sharpe_ret#94324665 as float) END AS Sharpe_ret#94324989, CASE WHEN ((PctPos_ret#94324666 = NA) OR (PctPos_ret#94324666 = null)) THEN null ELSE cast(PctPos_ret#94324666 as float) END AS PctPos_ret#94325003, CASE WHEN ((TR_ret#94324667 = NA) OR (TR_ret#94324667 = null)) THEN null ELSE cast(TR_ret#94324667 as float) END AS TR_ret#94325095, CASE WHEN ((IR_ret#94324668 = NA) OR (IR_ret#94324668 = null)) THEN null ELSE cast(IR_ret#94324668 as float) END AS IR_ret#94325159, CASE WHEN ((annual_resret#94324669 = NA) OR (annual_resret#94324669 = null)) THEN null ELSE cast(annual_resret#94324669 as float) END AS annual_resret#94325161, CASE WHEN ((std_resret#94324670 = NA) OR (std_resret#94324670 = null)) THEN null ELSE cast(std_resret#94324670 as float) END AS std_resret#94325162, CASE WHEN ((Sharpe_resret#94324671 = NA) OR (Sharpe_resret#94324671 = null)) THEN null ELSE cast(Sharpe_resret#94324671 as float) END AS Sharpe_resret#94325163, CASE WHEN ((PctPos_resret#94324672 = NA) OR (PctPos_resret#94324672 = null)) THEN null ELSE cast(PctPos_resret#94324672 as float) END AS PctPos_resret#94325164, CASE WHEN ((TR_resret#94324673 = NA) OR (TR_resret#94324673 = null)) THEN null ELSE cast(TR_resret#94324673 as float) END AS TR_resret#94325165, CASE WHEN ((IR_resret#94324674 = NA) OR (IR_resret#94324674 = null)) THEN null ELSE cast(IR_resret#94324674 as float) END AS IR_resret#94325166, CASE WHEN ((annual_retnet#94324675 = NA) OR (annual_retnet#94324675 = null)) THEN null ELSE cast(annual_retnet#94324675 as float) END AS annual_retnet#94325167, CASE WHEN ((std_retnet#94324676 = NA) OR (std_retnet#94324676 = null)) THEN null ELSE cast(std_retnet#94324676 as float) END AS std_retnet#94325168, CASE WHEN ((Sharpe_retnet#94324677 = NA) OR (Sharpe_retnet#94324677 = null)) THEN null ELSE cast(Sharpe_retnet#94324677 as float) END AS Sharpe_retnet#94325169, CASE WHEN ((PctPos_retnet#94324678 = NA) OR (PctPos_retnet#94324678 = null)) THEN null ELSE cast(PctPos_retnet#94324678 as float) END AS PctPos_retnet#94325170, CASE WHEN ((TR_retnet#94324679 = NA) OR (TR_retnet#94324679 = null)) THEN null ELSE cast(TR_retnet#94324679 as float) END AS TR_retnet#94325171, CASE WHEN ((IR_retnet#94324680 = NA) OR (IR_retnet#94324680 = null)) THEN null ELSE cast(IR_retnet#94324680 as float) END AS IR_retnet#94325172, CASE WHEN ((turnover#94324681 = NA) OR (turnover#94324681 = null)) THEN null ELSE cast(turnover#94324681 as float) END AS turnover#94325173, CASE WHEN ((coverage#94324682 = NA) OR (coverage#94324682 = null)) THEN null ELSE cast(coverage#94324682 as float) END AS coverage#94325174] Input [26]: [cap#94324657, retIC#94324658, resretIC#94324659, numcos#94324660, numdates#94324661, annual_bmret#94324662, annual_ret#94324663, std_ret#94324664, Sharpe_ret#94324665, PctPos_ret#94324666, TR_ret#94324667, IR_ret#94324668, annual_resret#94324669, std_resret#94324670, Sharpe_resret#94324671, PctPos_resret#94324672, TR_resret#94324673, IR_resret#94324674, annual_retnet#94324675, std_retnet#94324676, Sharpe_retnet#94324677, PctPos_retnet#94324678, TR_retnet#94324679, IR_retnet#94324680, turnover#94324681, coverage#94324682] (7) ColumnarToRow [codegen id : 1] Input [2]: [cap#94324901, turnover#94325173] (8) Filter [codegen id : 1] Input [2]: [cap#94324901, turnover#94325173] Condition : isnotnull(cap#94324901) (9) BroadcastExchange Input [2]: [cap#94324901, turnover#94325173] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7531402] (10) InMemoryTableScan Output [3]: [cap#94160394, description#94160396, sort#94160395] Arguments: [cap#94160394, description#94160396, sort#94160395], [isnotnull(cap#94160394)] (11) InMemoryRelation Arguments: [cap#94160394, sort#94160395, description#94160396, universe#94160397], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] Input [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] (14) Filter Input [3]: [cap#94160394, description#94160396, sort#94160395] Condition : isnotnull(cap#94160394) (15) Project Output [3]: [cap#94160394, description#94160396 AS cap_description#94328100, sort#94160395 AS cap_sort#94328101] Input [3]: [cap#94160394, description#94160396, sort#94160395] (16) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#94324901))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))] Join condition: None (17) Project [codegen id : 2] Output [3]: [turnover#94325173, cap_description#94328100 AS cap#94328190, cap_sort#94328101] Input [5]: [cap#94324901, turnover#94325173, cap#94160394, cap_description#94328100, cap_sort#94328101] (18) Exchange Input [3]: [turnover#94325173, cap#94328190, cap_sort#94328101] Arguments: rangepartitioning(cap_sort#94328101 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7531410] (19) Sort [codegen id : 3] Input [3]: [turnover#94325173, cap#94328190, cap_sort#94328101] Arguments: [cap_sort#94328101 ASC NULLS FIRST], true, 0 (20) Project [codegen id : 3] Output [3]: [cap#94328190, turnover#94325173, (1.0 / cast(turnover#94325173 as double)) AS days_hold#94328224] Input [3]: [turnover#94325173, cap#94328190, cap_sort#94328101] (21) CollectLimit Input [3]: [cap#94328190, turnover#94325173, days_hold#94328224] Arguments: 1000000