== Physical Plan == CollectLimit (21) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Project (20) +- * Sort (19) +- Exchange (18) +- * Project (17) +- * BroadcastHashJoin Inner BuildLeft (16) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Project (15) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [3]: [cap#94342537, turnover#94339491, days_hold#94342568] Arguments: [cap#94342537, turnover#94339491, days_hold#94342568] (2) InMemoryRelation Arguments: [cap#94342537, turnover#94339491, days_hold#94342568], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Project [cap#94342537, turnover#94339491, (1.0 / cast(turnover#94339491 as double)) AS days_hold#94342568] +- *(3) Sort [cap_sort#94342448 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(cap_sort#94342448 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7532552] +- *(2) Project [turnover#94339491, cap_description#94342447 AS cap#94342537, cap_sort#94342448] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#94339344))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7532544] : +- *(1) Filter isnotnull(cap#94339344) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#94339344, turnover#94339491], [isnotnull(cap#94339344)] : +- InMemoryRelation [cap#94339344, retIC#94339345, resretIC#94339346, numcos#94339347, numdates#94339348, annual_bmret#94339349, annual_ret#94339350, std_ret#94339351, Sharpe_ret#94339352, PctPos_ret#94339353, TR_ret#94339354, IR_ret#94339355, annual_resret#94339357, std_resret#94339358, Sharpe_resret#94339426, PctPos_resret#94339427, TR_resret#94339428, IR_resret#94339440, annual_retnet#94339442, std_retnet#94339443, Sharpe_retnet#94339457, PctPos_retnet#94339471, TR_retnet#94339473, IR_retnet#94339489, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#94338977 = NA) OR (cap#94338977 = null)) THEN null ELSE cast(cap#94338977 as float) END AS cap#94339344, CASE WHEN ((retIC#94338978 = NA) OR (retIC#94338978 = null)) THEN null ELSE cast(retIC#94338978 as float) END AS retIC#94339345, CASE WHEN ((resretIC#94338979 = NA) OR (resretIC#94338979 = null)) THEN null ELSE cast(resretIC#94338979 as float) END AS resretIC#94339346, CASE WHEN ((numcos#94338980 = NA) OR (numcos#94338980 = null)) THEN null ELSE cast(numcos#94338980 as float) END AS numcos#94339347, CASE WHEN ((numdates#94338981 = NA) OR (numdates#94338981 = null)) THEN null ELSE cast(numdates#94338981 as int) END AS numdates#94339348, CASE WHEN ((annual_bmret#94338982 = NA) OR (annual_bmret#94338982 = null)) THEN null ELSE cast(annual_bmret#94338982 as float) END AS annual_bmret#94339349, CASE WHEN ((annual_ret#94338983 = NA) OR (annual_ret#94338983 = null)) THEN null ELSE cast(annual_ret#94338983 as float) END AS annual_ret#94339350, CASE WHEN ((std_ret#94338984 = NA) OR (std_ret#94338984 = null)) THEN null ELSE cast(std_ret#94338984 as float) END AS std_ret#94339351, CASE WHEN ((Sharpe_ret#94338985 = NA) OR (Sharpe_ret#94338985 = null)) THEN null ELSE cast(Sharpe_ret#94338985 as float) END AS Sharpe_ret#94339352, CASE WHEN ((PctPos_ret#94338986 = NA) OR (PctPos_ret#94338986 = null)) THEN null ELSE cast(PctPos_ret#94338986 as float) END AS PctPos_ret#94339353, CASE WHEN ((TR_ret#94338987 = NA) OR (TR_ret#94338987 = null)) THEN null ELSE cast(TR_ret#94338987 as float) END AS TR_ret#94339354, CASE WHEN ((IR_ret#94338988 = NA) OR (IR_ret#94338988 = null)) THEN null ELSE cast(IR_ret#94338988 as float) END AS IR_ret#94339355, CASE WHEN ((annual_resret#94338989 = NA) OR (annual_resret#94338989 = null)) THEN null ELSE cast(annual_resret#94338989 as float) END AS annual_resret#94339357, CASE WHEN ((std_resret#94338990 = NA) OR (std_resret#94338990 = null)) THEN null ELSE cast(std_resret#94338990 as float) END AS std_resret#94339358, CASE WHEN ((Sharpe_resret#94338991 = NA) OR (Sharpe_resret#94338991 = null)) THEN null ELSE cast(Sharpe_resret#94338991 as float) END AS Sharpe_resret#94339426, CASE WHEN ((PctPos_resret#94338992 = NA) OR (PctPos_resret#94338992 = null)) THEN null ELSE cast(PctPos_resret#94338992 as float) END AS PctPos_resret#94339427, CASE WHEN ((TR_resret#94338993 = NA) OR (TR_resret#94338993 = null)) THEN null ELSE cast(TR_resret#94338993 as float) END AS TR_resret#94339428, CASE WHEN ((IR_resret#94338994 = NA) OR (IR_resret#94338994 = null)) THEN null ELSE cast(IR_resret#94338994 as float) END AS IR_resret#94339440, CASE WHEN ((annual_retnet#94338995 = NA) OR (annual_retnet#94338995 = null)) THEN null ELSE cast(annual_retnet#94338995 as float) END AS annual_retnet#94339442, CASE WHEN ((std_retnet#94338996 = NA) OR (std_retnet#94338996 = null)) THEN null ELSE cast(std_retnet#94338996 as float) END AS std_retnet#94339443, CASE WHEN ((Sharpe_retnet#94338997 = NA) OR (Sharpe_retnet#94338997 = null)) THEN null ELSE cast(Sharpe_retnet#94338997 as float) END AS Sharpe_retnet#94339457, CASE WHEN ((PctPos_retnet#94338998 = NA) OR (PctPos_retnet#94338998 = null)) THEN null ELSE cast(PctPos_retnet#94338998 as float) END AS PctPos_retnet#94339471, CASE WHEN ((TR_retnet#94338999 = NA) OR (TR_retnet#94338999 = null)) THEN null ELSE cast(TR_retnet#94338999 as float) END AS TR_retnet#94339473, CASE WHEN ((IR_retnet#94339000 = NA) OR (IR_retnet#94339000 = null)) THEN null ELSE cast(IR_retnet#94339000 as float) END AS IR_retnet#94339489, ... 2 more fields] : +- FileScan csv [cap#94338977,retIC#94338978,resretIC#94338979,numcos#94338980,numdates#94338981,annual_bmret#94338982,annual_ret#94338983,std_ret#94338984,Sharpe_ret#94338985,PctPos_ret#94338986,TR_ret#94338987,IR_ret#94338988,annual_resret#94338989,std_resret#94338990,Sharpe_resret#94338991,PctPos_resret#94338992,TR_resret#94338993,IR_resret#94338994,annual_retnet#94338995,std_retnet#94338996,Sharpe_retnet#94338997,PctPos_retnet#94338998,TR_retnet#94338999,IR_retnet#94339000,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Project [cap#94160394, description#94160396 AS cap_description#94342447, sort#94160395 AS cap_sort#94342448] +- *(2) Filter isnotnull(cap#94160394) +- InMemoryTableScan [cap#94160394, description#94160396, sort#94160395], [isnotnull(cap#94160394)] +- InMemoryRelation [cap#94160394, sort#94160395, description#94160396, universe#94160397], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [cap_sort#94342448 ASC NULLS FIRST] (3) InMemoryTableScan Output [2]: [cap#94339344, turnover#94339491] Arguments: [cap#94339344, turnover#94339491], [isnotnull(cap#94339344)] (4) InMemoryRelation Arguments: [cap#94339344, retIC#94339345, resretIC#94339346, numcos#94339347, numdates#94339348, annual_bmret#94339349, annual_ret#94339350, std_ret#94339351, Sharpe_ret#94339352, PctPos_ret#94339353, TR_ret#94339354, IR_ret#94339355, annual_resret#94339357, std_resret#94339358, Sharpe_resret#94339426, PctPos_resret#94339427, TR_resret#94339428, IR_resret#94339440, annual_retnet#94339442, std_retnet#94339443, Sharpe_retnet#94339457, PctPos_retnet#94339471, TR_retnet#94339473, IR_retnet#94339489, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94338977 = NA) OR (cap#94338977 = null)) THEN null ELSE cast(cap#94338977 as float) END AS cap#94339344, CASE WHEN ((retIC#94338978 = NA) OR (retIC#94338978 = null)) THEN null ELSE cast(retIC#94338978 as float) END AS retIC#94339345, CASE WHEN ((resretIC#94338979 = NA) OR (resretIC#94338979 = null)) THEN null ELSE cast(resretIC#94338979 as float) END AS resretIC#94339346, CASE WHEN ((numcos#94338980 = NA) OR (numcos#94338980 = null)) THEN null ELSE cast(numcos#94338980 as float) END AS numcos#94339347, CASE WHEN ((numdates#94338981 = NA) OR (numdates#94338981 = null)) THEN null ELSE cast(numdates#94338981 as int) END AS numdates#94339348, CASE WHEN ((annual_bmret#94338982 = NA) OR (annual_bmret#94338982 = null)) THEN null ELSE cast(annual_bmret#94338982 as float) END AS annual_bmret#94339349, CASE WHEN ((annual_ret#94338983 = NA) OR (annual_ret#94338983 = null)) THEN null ELSE cast(annual_ret#94338983 as float) END AS annual_ret#94339350, CASE WHEN ((std_ret#94338984 = NA) OR (std_ret#94338984 = null)) THEN null ELSE cast(std_ret#94338984 as float) END AS std_ret#94339351, CASE WHEN ((Sharpe_ret#94338985 = NA) OR (Sharpe_ret#94338985 = null)) THEN null ELSE cast(Sharpe_ret#94338985 as float) END AS Sharpe_ret#94339352, CASE WHEN ((PctPos_ret#94338986 = NA) OR (PctPos_ret#94338986 = null)) THEN null ELSE cast(PctPos_ret#94338986 as float) END AS PctPos_ret#94339353, CASE WHEN ((TR_ret#94338987 = NA) OR (TR_ret#94338987 = null)) THEN null ELSE cast(TR_ret#94338987 as float) END AS TR_ret#94339354, CASE WHEN ((IR_ret#94338988 = NA) OR (IR_ret#94338988 = null)) THEN null ELSE cast(IR_ret#94338988 as float) END AS IR_ret#94339355, CASE WHEN ((annual_resret#94338989 = NA) OR (annual_resret#94338989 = null)) THEN null ELSE cast(annual_resret#94338989 as float) END AS annual_resret#94339357, CASE WHEN ((std_resret#94338990 = NA) OR (std_resret#94338990 = null)) THEN null ELSE cast(std_resret#94338990 as float) END AS std_resret#94339358, CASE WHEN ((Sharpe_resret#94338991 = NA) OR (Sharpe_resret#94338991 = null)) THEN null ELSE cast(Sharpe_resret#94338991 as float) END AS Sharpe_resret#94339426, CASE WHEN ((PctPos_resret#94338992 = NA) OR (PctPos_resret#94338992 = null)) THEN null ELSE cast(PctPos_resret#94338992 as float) END AS PctPos_resret#94339427, CASE WHEN ((TR_resret#94338993 = NA) OR (TR_resret#94338993 = null)) THEN null ELSE cast(TR_resret#94338993 as float) END AS TR_resret#94339428, CASE WHEN ((IR_resret#94338994 = NA) OR (IR_resret#94338994 = null)) THEN null ELSE cast(IR_resret#94338994 as float) END AS IR_resret#94339440, CASE WHEN ((annual_retnet#94338995 = NA) OR (annual_retnet#94338995 = null)) THEN null ELSE cast(annual_retnet#94338995 as float) END AS annual_retnet#94339442, CASE WHEN ((std_retnet#94338996 = NA) OR (std_retnet#94338996 = null)) THEN null ELSE cast(std_retnet#94338996 as float) END AS std_retnet#94339443, CASE WHEN ((Sharpe_retnet#94338997 = NA) OR (Sharpe_retnet#94338997 = null)) THEN null ELSE cast(Sharpe_retnet#94338997 as float) END AS Sharpe_retnet#94339457, CASE WHEN ((PctPos_retnet#94338998 = NA) OR (PctPos_retnet#94338998 = null)) THEN null ELSE cast(PctPos_retnet#94338998 as float) END AS PctPos_retnet#94339471, CASE WHEN ((TR_retnet#94338999 = NA) OR (TR_retnet#94338999 = null)) THEN null ELSE cast(TR_retnet#94338999 as float) END AS TR_retnet#94339473, CASE WHEN ((IR_retnet#94339000 = NA) OR (IR_retnet#94339000 = null)) THEN null ELSE cast(IR_retnet#94339000 as float) END AS IR_retnet#94339489, ... 2 more fields] +- FileScan csv [cap#94338977,retIC#94338978,resretIC#94338979,numcos#94338980,numdates#94338981,annual_bmret#94338982,annual_ret#94338983,std_ret#94338984,Sharpe_ret#94338985,PctPos_ret#94338986,TR_ret#94338987,IR_ret#94338988,annual_resret#94338989,std_resret#94338990,Sharpe_resret#94338991,PctPos_resret#94338992,TR_resret#94338993,IR_resret#94338994,annual_retnet#94338995,std_retnet#94338996,Sharpe_retnet#94338997,PctPos_retnet#94338998,TR_retnet#94338999,IR_retnet#94339000,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#94338977, retIC#94338978, resretIC#94338979, numcos#94338980, numdates#94338981, annual_bmret#94338982, annual_ret#94338983, std_ret#94338984, Sharpe_ret#94338985, PctPos_ret#94338986, TR_ret#94338987, IR_ret#94338988, annual_resret#94338989, std_resret#94338990, Sharpe_resret#94338991, PctPos_resret#94338992, TR_resret#94338993, IR_resret#94338994, annual_retnet#94338995, std_retnet#94338996, Sharpe_retnet#94338997, PctPos_retnet#94338998, TR_retnet#94338999, IR_retnet#94339000, turnover#94339001, coverage#94339002] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_preearnings/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#94338977 = NA) OR (cap#94338977 = null)) THEN null ELSE cast(cap#94338977 as float) END AS cap#94339344, CASE WHEN ((retIC#94338978 = NA) OR (retIC#94338978 = null)) THEN null ELSE cast(retIC#94338978 as float) END AS retIC#94339345, CASE WHEN ((resretIC#94338979 = NA) OR (resretIC#94338979 = null)) THEN null ELSE cast(resretIC#94338979 as float) END AS resretIC#94339346, CASE WHEN ((numcos#94338980 = NA) OR (numcos#94338980 = null)) THEN null ELSE cast(numcos#94338980 as float) END AS numcos#94339347, CASE WHEN ((numdates#94338981 = NA) OR (numdates#94338981 = null)) THEN null ELSE cast(numdates#94338981 as int) END AS numdates#94339348, CASE WHEN ((annual_bmret#94338982 = NA) OR (annual_bmret#94338982 = null)) THEN null ELSE cast(annual_bmret#94338982 as float) END AS annual_bmret#94339349, CASE WHEN ((annual_ret#94338983 = NA) OR (annual_ret#94338983 = null)) THEN null ELSE cast(annual_ret#94338983 as float) END AS annual_ret#94339350, CASE WHEN ((std_ret#94338984 = NA) OR (std_ret#94338984 = null)) THEN null ELSE cast(std_ret#94338984 as float) END AS std_ret#94339351, CASE WHEN ((Sharpe_ret#94338985 = NA) OR (Sharpe_ret#94338985 = null)) THEN null ELSE cast(Sharpe_ret#94338985 as float) END AS Sharpe_ret#94339352, CASE WHEN ((PctPos_ret#94338986 = NA) OR (PctPos_ret#94338986 = null)) THEN null ELSE cast(PctPos_ret#94338986 as float) END AS PctPos_ret#94339353, CASE WHEN ((TR_ret#94338987 = NA) OR (TR_ret#94338987 = null)) THEN null ELSE cast(TR_ret#94338987 as float) END AS TR_ret#94339354, CASE WHEN ((IR_ret#94338988 = NA) OR (IR_ret#94338988 = null)) THEN null ELSE cast(IR_ret#94338988 as float) END AS IR_ret#94339355, CASE WHEN ((annual_resret#94338989 = NA) OR (annual_resret#94338989 = null)) THEN null ELSE cast(annual_resret#94338989 as float) END AS annual_resret#94339357, CASE WHEN ((std_resret#94338990 = NA) OR (std_resret#94338990 = null)) THEN null ELSE cast(std_resret#94338990 as float) END AS std_resret#94339358, CASE WHEN ((Sharpe_resret#94338991 = NA) OR (Sharpe_resret#94338991 = null)) THEN null ELSE cast(Sharpe_resret#94338991 as float) END AS Sharpe_resret#94339426, CASE WHEN ((PctPos_resret#94338992 = NA) OR (PctPos_resret#94338992 = null)) THEN null ELSE cast(PctPos_resret#94338992 as float) END AS PctPos_resret#94339427, CASE WHEN ((TR_resret#94338993 = NA) OR (TR_resret#94338993 = null)) THEN null ELSE cast(TR_resret#94338993 as float) END AS TR_resret#94339428, CASE WHEN ((IR_resret#94338994 = NA) OR (IR_resret#94338994 = null)) THEN null ELSE cast(IR_resret#94338994 as float) END AS IR_resret#94339440, CASE WHEN ((annual_retnet#94338995 = NA) OR (annual_retnet#94338995 = null)) THEN null ELSE cast(annual_retnet#94338995 as float) END AS annual_retnet#94339442, CASE WHEN ((std_retnet#94338996 = NA) OR (std_retnet#94338996 = null)) THEN null ELSE cast(std_retnet#94338996 as float) END AS std_retnet#94339443, CASE WHEN ((Sharpe_retnet#94338997 = NA) OR (Sharpe_retnet#94338997 = null)) THEN null ELSE cast(Sharpe_retnet#94338997 as float) END AS Sharpe_retnet#94339457, CASE WHEN ((PctPos_retnet#94338998 = NA) OR (PctPos_retnet#94338998 = null)) THEN null ELSE cast(PctPos_retnet#94338998 as float) END AS PctPos_retnet#94339471, CASE WHEN ((TR_retnet#94338999 = NA) OR (TR_retnet#94338999 = null)) THEN null ELSE cast(TR_retnet#94338999 as float) END AS TR_retnet#94339473, CASE WHEN ((IR_retnet#94339000 = NA) OR (IR_retnet#94339000 = null)) THEN null ELSE cast(IR_retnet#94339000 as float) END AS IR_retnet#94339489, CASE WHEN ((turnover#94339001 = NA) OR (turnover#94339001 = null)) THEN null ELSE cast(turnover#94339001 as float) END AS turnover#94339491, CASE WHEN ((coverage#94339002 = NA) OR (coverage#94339002 = null)) THEN null ELSE cast(coverage#94339002 as float) END AS coverage#94339505] Input [26]: [cap#94338977, retIC#94338978, resretIC#94338979, numcos#94338980, numdates#94338981, annual_bmret#94338982, annual_ret#94338983, std_ret#94338984, Sharpe_ret#94338985, PctPos_ret#94338986, TR_ret#94338987, IR_ret#94338988, annual_resret#94338989, std_resret#94338990, Sharpe_resret#94338991, PctPos_resret#94338992, TR_resret#94338993, IR_resret#94338994, annual_retnet#94338995, std_retnet#94338996, Sharpe_retnet#94338997, PctPos_retnet#94338998, TR_retnet#94338999, IR_retnet#94339000, turnover#94339001, coverage#94339002] (7) ColumnarToRow [codegen id : 1] Input [2]: [cap#94339344, turnover#94339491] (8) Filter [codegen id : 1] Input [2]: [cap#94339344, turnover#94339491] Condition : isnotnull(cap#94339344) (9) BroadcastExchange Input [2]: [cap#94339344, turnover#94339491] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7532544] (10) InMemoryTableScan Output [3]: [cap#94160394, description#94160396, sort#94160395] Arguments: [cap#94160394, description#94160396, sort#94160395], [isnotnull(cap#94160394)] (11) InMemoryRelation Arguments: [cap#94160394, sort#94160395, description#94160396, universe#94160397], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] Input [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] (14) Filter Input [3]: [cap#94160394, description#94160396, sort#94160395] Condition : isnotnull(cap#94160394) (15) Project Output [3]: [cap#94160394, description#94160396 AS cap_description#94342447, sort#94160395 AS cap_sort#94342448] Input [3]: [cap#94160394, description#94160396, sort#94160395] (16) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#94339344))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))] Join condition: None (17) Project [codegen id : 2] Output [3]: [turnover#94339491, cap_description#94342447 AS cap#94342537, cap_sort#94342448] Input [5]: [cap#94339344, turnover#94339491, cap#94160394, cap_description#94342447, cap_sort#94342448] (18) Exchange Input [3]: [turnover#94339491, cap#94342537, cap_sort#94342448] Arguments: rangepartitioning(cap_sort#94342448 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7532552] (19) Sort [codegen id : 3] Input [3]: [turnover#94339491, cap#94342537, cap_sort#94342448] Arguments: [cap_sort#94342448 ASC NULLS FIRST], true, 0 (20) Project [codegen id : 3] Output [3]: [cap#94342537, turnover#94339491, (1.0 / cast(turnover#94339491 as double)) AS days_hold#94342568] Input [3]: [turnover#94339491, cap#94342537, cap_sort#94342448] (21) CollectLimit Input [3]: [cap#94342537, turnover#94339491, days_hold#94342568] Arguments: 10000