== Physical Plan == CollectLimit (19) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Sort (18) +- Exchange (17) +- * Project (16) +- * BroadcastHashJoin Inner BuildLeft (15) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [7]: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, coverage#94296921] Arguments: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, coverage#94296921] (2) InMemoryRelation Arguments: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, coverage#94296921], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Sort [sort#94160395 ASC NULLS FIRST, description#94160396 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(sort#94160395 ASC NULLS FIRST, description#94160396 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7528880] +- *(2) Project [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, (cast(numcos#94296655 as double) / cast(universe#94160397 as double)) AS coverage#94296921] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#94296649))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7528873] : +- *(1) Filter isnotnull(cap#94296649) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#94296649, numcos#94296655, numdates#94296670], [isnotnull(cap#94296649)] : +- InMemoryRelation [cap#94296649, retIC#94296651, resretIC#94296653, numcos#94296655, numdates#94296670, annual_bmret#94296684, annual_ret#94296698, std_ret#94296712, Sharpe_ret#94296726, PctPos_ret#94296728, TR_ret#94296741, IR_ret#94296742, annual_resret#94296805, std_resret#94296806, Sharpe_resret#94296807, PctPos_resret#94296820, TR_resret#94296821, IR_resret#94296822, annual_retnet#94296823, std_retnet#94296824, Sharpe_retnet#94296825, PctPos_retnet#94296826, TR_retnet#94296827, IR_retnet#94296828, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#94296341 = NA) OR (cap#94296341 = null)) THEN null ELSE cast(cap#94296341 as float) END AS cap#94296649, CASE WHEN ((retIC#94296342 = NA) OR (retIC#94296342 = null)) THEN null ELSE cast(retIC#94296342 as float) END AS retIC#94296651, CASE WHEN ((resretIC#94296343 = NA) OR (resretIC#94296343 = null)) THEN null ELSE cast(resretIC#94296343 as float) END AS resretIC#94296653, CASE WHEN ((numcos#94296344 = NA) OR (numcos#94296344 = null)) THEN null ELSE cast(numcos#94296344 as float) END AS numcos#94296655, CASE WHEN ((numdates#94296345 = NA) OR (numdates#94296345 = null)) THEN null ELSE cast(numdates#94296345 as int) END AS numdates#94296670, CASE WHEN ((annual_bmret#94296346 = NA) OR (annual_bmret#94296346 = null)) THEN null ELSE cast(annual_bmret#94296346 as float) END AS annual_bmret#94296684, CASE WHEN ((annual_ret#94296347 = NA) OR (annual_ret#94296347 = null)) THEN null ELSE cast(annual_ret#94296347 as float) END AS annual_ret#94296698, CASE WHEN ((std_ret#94296348 = NA) OR (std_ret#94296348 = null)) THEN null ELSE cast(std_ret#94296348 as float) END AS std_ret#94296712, CASE WHEN ((Sharpe_ret#94296349 = NA) OR (Sharpe_ret#94296349 = null)) THEN null ELSE cast(Sharpe_ret#94296349 as float) END AS Sharpe_ret#94296726, CASE WHEN ((PctPos_ret#94296350 = NA) OR (PctPos_ret#94296350 = null)) THEN null ELSE cast(PctPos_ret#94296350 as float) END AS PctPos_ret#94296728, CASE WHEN ((TR_ret#94296351 = NA) OR (TR_ret#94296351 = null)) THEN null ELSE cast(TR_ret#94296351 as float) END AS TR_ret#94296741, CASE WHEN ((IR_ret#94296352 = NA) OR (IR_ret#94296352 = null)) THEN null ELSE cast(IR_ret#94296352 as float) END AS IR_ret#94296742, CASE WHEN ((annual_resret#94296353 = NA) OR (annual_resret#94296353 = null)) THEN null ELSE cast(annual_resret#94296353 as float) END AS annual_resret#94296805, CASE WHEN ((std_resret#94296354 = NA) OR (std_resret#94296354 = null)) THEN null ELSE cast(std_resret#94296354 as float) END AS std_resret#94296806, CASE WHEN ((Sharpe_resret#94296355 = NA) OR (Sharpe_resret#94296355 = null)) THEN null ELSE cast(Sharpe_resret#94296355 as float) END AS Sharpe_resret#94296807, CASE WHEN ((PctPos_resret#94296356 = NA) OR (PctPos_resret#94296356 = null)) THEN null ELSE cast(PctPos_resret#94296356 as float) END AS PctPos_resret#94296820, CASE WHEN ((TR_resret#94296357 = NA) OR (TR_resret#94296357 = null)) THEN null ELSE cast(TR_resret#94296357 as float) END AS TR_resret#94296821, CASE WHEN ((IR_resret#94296358 = NA) OR (IR_resret#94296358 = null)) THEN null ELSE cast(IR_resret#94296358 as float) END AS IR_resret#94296822, CASE WHEN ((annual_retnet#94296359 = NA) OR (annual_retnet#94296359 = null)) THEN null ELSE cast(annual_retnet#94296359 as float) END AS annual_retnet#94296823, CASE WHEN ((std_retnet#94296360 = NA) OR (std_retnet#94296360 = null)) THEN null ELSE cast(std_retnet#94296360 as float) END AS std_retnet#94296824, CASE WHEN ((Sharpe_retnet#94296361 = NA) OR (Sharpe_retnet#94296361 = null)) THEN null ELSE cast(Sharpe_retnet#94296361 as float) END AS Sharpe_retnet#94296825, CASE WHEN ((PctPos_retnet#94296362 = NA) OR (PctPos_retnet#94296362 = null)) THEN null ELSE cast(PctPos_retnet#94296362 as float) END AS PctPos_retnet#94296826, CASE WHEN ((TR_retnet#94296363 = NA) OR (TR_retnet#94296363 = null)) THEN null ELSE cast(TR_retnet#94296363 as float) END AS TR_retnet#94296827, CASE WHEN ((IR_retnet#94296364 = NA) OR (IR_retnet#94296364 = null)) THEN null ELSE cast(IR_retnet#94296364 as float) END AS IR_retnet#94296828, ... 2 more fields] : +- FileScan csv [cap#94296341,retIC#94296342,resretIC#94296343,numcos#94296344,numdates#94296345,annual_bmret#94296346,annual_ret#94296347,std_ret#94296348,Sharpe_ret#94296349,PctPos_ret#94296350,TR_ret#94296351,IR_ret#94296352,annual_resret#94296353,std_resret#94296354,Sharpe_resret#94296355,PctPos_resret#94296356,TR_resret#94296357,IR_resret#94296358,annual_retnet#94296359,std_retnet#94296360,Sharpe_retnet#94296361,PctPos_retnet#94296362,TR_retnet#94296363,IR_retnet#94296364,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/esg_innovation/innovat..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Filter isnotnull(cap#94160394) +- InMemoryTableScan [cap#94160394, sort#94160395, description#94160396, universe#94160397], [isnotnull(cap#94160394)] +- InMemoryRelation [cap#94160394, sort#94160395, description#94160396, universe#94160397], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [sort#94160395 ASC NULLS FIRST, description#94160396 ASC NULLS FIRST] (3) InMemoryTableScan Output [3]: [cap#94296649, numcos#94296655, numdates#94296670] Arguments: [cap#94296649, numcos#94296655, numdates#94296670], [isnotnull(cap#94296649)] (4) InMemoryRelation Arguments: [cap#94296649, retIC#94296651, resretIC#94296653, numcos#94296655, numdates#94296670, annual_bmret#94296684, annual_ret#94296698, std_ret#94296712, Sharpe_ret#94296726, PctPos_ret#94296728, TR_ret#94296741, IR_ret#94296742, annual_resret#94296805, std_resret#94296806, Sharpe_resret#94296807, PctPos_resret#94296820, TR_resret#94296821, IR_resret#94296822, annual_retnet#94296823, std_retnet#94296824, Sharpe_retnet#94296825, PctPos_retnet#94296826, TR_retnet#94296827, IR_retnet#94296828, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94296341 = NA) OR (cap#94296341 = null)) THEN null ELSE cast(cap#94296341 as float) END AS cap#94296649, CASE WHEN ((retIC#94296342 = NA) OR (retIC#94296342 = null)) THEN null ELSE cast(retIC#94296342 as float) END AS retIC#94296651, CASE WHEN ((resretIC#94296343 = NA) OR (resretIC#94296343 = null)) THEN null ELSE cast(resretIC#94296343 as float) END AS resretIC#94296653, CASE WHEN ((numcos#94296344 = NA) OR (numcos#94296344 = null)) THEN null ELSE cast(numcos#94296344 as float) END AS numcos#94296655, CASE WHEN ((numdates#94296345 = NA) OR (numdates#94296345 = null)) THEN null ELSE cast(numdates#94296345 as int) END AS numdates#94296670, CASE WHEN ((annual_bmret#94296346 = NA) OR (annual_bmret#94296346 = null)) THEN null ELSE cast(annual_bmret#94296346 as float) END AS annual_bmret#94296684, CASE WHEN ((annual_ret#94296347 = NA) OR (annual_ret#94296347 = null)) THEN null ELSE cast(annual_ret#94296347 as float) END AS annual_ret#94296698, CASE WHEN ((std_ret#94296348 = NA) OR (std_ret#94296348 = null)) THEN null ELSE cast(std_ret#94296348 as float) END AS std_ret#94296712, CASE WHEN ((Sharpe_ret#94296349 = NA) OR (Sharpe_ret#94296349 = null)) THEN null ELSE cast(Sharpe_ret#94296349 as float) END AS Sharpe_ret#94296726, CASE WHEN ((PctPos_ret#94296350 = NA) OR (PctPos_ret#94296350 = null)) THEN null ELSE cast(PctPos_ret#94296350 as float) END AS PctPos_ret#94296728, CASE WHEN ((TR_ret#94296351 = NA) OR (TR_ret#94296351 = null)) THEN null ELSE cast(TR_ret#94296351 as float) END AS TR_ret#94296741, CASE WHEN ((IR_ret#94296352 = NA) OR (IR_ret#94296352 = null)) THEN null ELSE cast(IR_ret#94296352 as float) END AS IR_ret#94296742, CASE WHEN ((annual_resret#94296353 = NA) OR (annual_resret#94296353 = null)) THEN null ELSE cast(annual_resret#94296353 as float) END AS annual_resret#94296805, CASE WHEN ((std_resret#94296354 = NA) OR (std_resret#94296354 = null)) THEN null ELSE cast(std_resret#94296354 as float) END AS std_resret#94296806, CASE WHEN ((Sharpe_resret#94296355 = NA) OR (Sharpe_resret#94296355 = null)) THEN null ELSE cast(Sharpe_resret#94296355 as float) END AS Sharpe_resret#94296807, CASE WHEN ((PctPos_resret#94296356 = NA) OR (PctPos_resret#94296356 = null)) THEN null ELSE cast(PctPos_resret#94296356 as float) END AS PctPos_resret#94296820, CASE WHEN ((TR_resret#94296357 = NA) OR (TR_resret#94296357 = null)) THEN null ELSE cast(TR_resret#94296357 as float) END AS TR_resret#94296821, CASE WHEN ((IR_resret#94296358 = NA) OR (IR_resret#94296358 = null)) THEN null ELSE cast(IR_resret#94296358 as float) END AS IR_resret#94296822, CASE WHEN ((annual_retnet#94296359 = NA) OR (annual_retnet#94296359 = null)) THEN null ELSE cast(annual_retnet#94296359 as float) END AS annual_retnet#94296823, CASE WHEN ((std_retnet#94296360 = NA) OR (std_retnet#94296360 = null)) THEN null ELSE cast(std_retnet#94296360 as float) END AS std_retnet#94296824, CASE WHEN ((Sharpe_retnet#94296361 = NA) OR (Sharpe_retnet#94296361 = null)) THEN null ELSE cast(Sharpe_retnet#94296361 as float) END AS Sharpe_retnet#94296825, CASE WHEN ((PctPos_retnet#94296362 = NA) OR (PctPos_retnet#94296362 = null)) THEN null ELSE cast(PctPos_retnet#94296362 as float) END AS PctPos_retnet#94296826, CASE WHEN ((TR_retnet#94296363 = NA) OR (TR_retnet#94296363 = null)) THEN null ELSE cast(TR_retnet#94296363 as float) END AS TR_retnet#94296827, CASE WHEN ((IR_retnet#94296364 = NA) OR (IR_retnet#94296364 = null)) THEN null ELSE cast(IR_retnet#94296364 as float) END AS IR_retnet#94296828, ... 2 more fields] +- FileScan csv [cap#94296341,retIC#94296342,resretIC#94296343,numcos#94296344,numdates#94296345,annual_bmret#94296346,annual_ret#94296347,std_ret#94296348,Sharpe_ret#94296349,PctPos_ret#94296350,TR_ret#94296351,IR_ret#94296352,annual_resret#94296353,std_resret#94296354,Sharpe_resret#94296355,PctPos_resret#94296356,TR_resret#94296357,IR_resret#94296358,annual_retnet#94296359,std_retnet#94296360,Sharpe_retnet#94296361,PctPos_retnet#94296362,TR_retnet#94296363,IR_retnet#94296364,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/esg_innovation/innovat..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#94296341, retIC#94296342, resretIC#94296343, numcos#94296344, numdates#94296345, annual_bmret#94296346, annual_ret#94296347, std_ret#94296348, Sharpe_ret#94296349, PctPos_ret#94296350, TR_ret#94296351, IR_ret#94296352, annual_resret#94296353, std_resret#94296354, Sharpe_resret#94296355, PctPos_resret#94296356, TR_resret#94296357, IR_resret#94296358, annual_retnet#94296359, std_retnet#94296360, Sharpe_retnet#94296361, PctPos_retnet#94296362, TR_retnet#94296363, IR_retnet#94296364, turnover#94296365, coverage#94296366] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/esg_innovation/innovation/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#94296341 = NA) OR (cap#94296341 = null)) THEN null ELSE cast(cap#94296341 as float) END AS cap#94296649, CASE WHEN ((retIC#94296342 = NA) OR (retIC#94296342 = null)) THEN null ELSE cast(retIC#94296342 as float) END AS retIC#94296651, CASE WHEN ((resretIC#94296343 = NA) OR (resretIC#94296343 = null)) THEN null ELSE cast(resretIC#94296343 as float) END AS resretIC#94296653, CASE WHEN ((numcos#94296344 = NA) OR (numcos#94296344 = null)) THEN null ELSE cast(numcos#94296344 as float) END AS numcos#94296655, CASE WHEN ((numdates#94296345 = NA) OR (numdates#94296345 = null)) THEN null ELSE cast(numdates#94296345 as int) END AS numdates#94296670, CASE WHEN ((annual_bmret#94296346 = NA) OR (annual_bmret#94296346 = null)) THEN null ELSE cast(annual_bmret#94296346 as float) END AS annual_bmret#94296684, CASE WHEN ((annual_ret#94296347 = NA) OR (annual_ret#94296347 = null)) THEN null ELSE cast(annual_ret#94296347 as float) END AS annual_ret#94296698, CASE WHEN ((std_ret#94296348 = NA) OR (std_ret#94296348 = null)) THEN null ELSE cast(std_ret#94296348 as float) END AS std_ret#94296712, CASE WHEN ((Sharpe_ret#94296349 = NA) OR (Sharpe_ret#94296349 = null)) THEN null ELSE cast(Sharpe_ret#94296349 as float) END AS Sharpe_ret#94296726, CASE WHEN ((PctPos_ret#94296350 = NA) OR (PctPos_ret#94296350 = null)) THEN null ELSE cast(PctPos_ret#94296350 as float) END AS PctPos_ret#94296728, CASE WHEN ((TR_ret#94296351 = NA) OR (TR_ret#94296351 = null)) THEN null ELSE cast(TR_ret#94296351 as float) END AS TR_ret#94296741, CASE WHEN ((IR_ret#94296352 = NA) OR (IR_ret#94296352 = null)) THEN null ELSE cast(IR_ret#94296352 as float) END AS IR_ret#94296742, CASE WHEN ((annual_resret#94296353 = NA) OR (annual_resret#94296353 = null)) THEN null ELSE cast(annual_resret#94296353 as float) END AS annual_resret#94296805, CASE WHEN ((std_resret#94296354 = NA) OR (std_resret#94296354 = null)) THEN null ELSE cast(std_resret#94296354 as float) END AS std_resret#94296806, CASE WHEN ((Sharpe_resret#94296355 = NA) OR (Sharpe_resret#94296355 = null)) THEN null ELSE cast(Sharpe_resret#94296355 as float) END AS Sharpe_resret#94296807, CASE WHEN ((PctPos_resret#94296356 = NA) OR (PctPos_resret#94296356 = null)) THEN null ELSE cast(PctPos_resret#94296356 as float) END AS PctPos_resret#94296820, CASE WHEN ((TR_resret#94296357 = NA) OR (TR_resret#94296357 = null)) THEN null ELSE cast(TR_resret#94296357 as float) END AS TR_resret#94296821, CASE WHEN ((IR_resret#94296358 = NA) OR (IR_resret#94296358 = null)) THEN null ELSE cast(IR_resret#94296358 as float) END AS IR_resret#94296822, CASE WHEN ((annual_retnet#94296359 = NA) OR (annual_retnet#94296359 = null)) THEN null ELSE cast(annual_retnet#94296359 as float) END AS annual_retnet#94296823, CASE WHEN ((std_retnet#94296360 = NA) OR (std_retnet#94296360 = null)) THEN null ELSE cast(std_retnet#94296360 as float) END AS std_retnet#94296824, CASE WHEN ((Sharpe_retnet#94296361 = NA) OR (Sharpe_retnet#94296361 = null)) THEN null ELSE cast(Sharpe_retnet#94296361 as float) END AS Sharpe_retnet#94296825, CASE WHEN ((PctPos_retnet#94296362 = NA) OR (PctPos_retnet#94296362 = null)) THEN null ELSE cast(PctPos_retnet#94296362 as float) END AS PctPos_retnet#94296826, CASE WHEN ((TR_retnet#94296363 = NA) OR (TR_retnet#94296363 = null)) THEN null ELSE cast(TR_retnet#94296363 as float) END AS TR_retnet#94296827, CASE WHEN ((IR_retnet#94296364 = NA) OR (IR_retnet#94296364 = null)) THEN null ELSE cast(IR_retnet#94296364 as float) END AS IR_retnet#94296828, CASE WHEN ((turnover#94296365 = NA) OR (turnover#94296365 = null)) THEN null ELSE cast(turnover#94296365 as float) END AS turnover#94296831, CASE WHEN ((coverage#94296366 = NA) OR (coverage#94296366 = null)) THEN null ELSE cast(coverage#94296366 as float) END AS coverage#94296832] Input [26]: [cap#94296341, retIC#94296342, resretIC#94296343, numcos#94296344, numdates#94296345, annual_bmret#94296346, annual_ret#94296347, std_ret#94296348, Sharpe_ret#94296349, PctPos_ret#94296350, TR_ret#94296351, IR_ret#94296352, annual_resret#94296353, std_resret#94296354, Sharpe_resret#94296355, PctPos_resret#94296356, TR_resret#94296357, IR_resret#94296358, annual_retnet#94296359, std_retnet#94296360, Sharpe_retnet#94296361, PctPos_retnet#94296362, TR_retnet#94296363, IR_retnet#94296364, turnover#94296365, coverage#94296366] (7) ColumnarToRow [codegen id : 1] Input [3]: [cap#94296649, numcos#94296655, numdates#94296670] (8) Filter [codegen id : 1] Input [3]: [cap#94296649, numcos#94296655, numdates#94296670] Condition : isnotnull(cap#94296649) (9) BroadcastExchange Input [3]: [cap#94296649, numcos#94296655, numdates#94296670] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7528873] (10) InMemoryTableScan Output [4]: [cap#94160394, sort#94160395, description#94160396, universe#94160397] Arguments: [cap#94160394, sort#94160395, description#94160396, universe#94160397], [isnotnull(cap#94160394)] (11) InMemoryRelation Arguments: [cap#94160394, sort#94160395, description#94160396, universe#94160397], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] +- FileScan csv [cap#94160377,sort#94160378,description#94160379,universe#94160380] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#94160377 = NA) OR (cap#94160377 = null)) THEN null ELSE cast(cap#94160377 as int) END AS cap#94160394, CASE WHEN (sort#94160378 = null) THEN null ELSE sort#94160378 END AS sort#94160395, CASE WHEN (description#94160379 = null) THEN null ELSE description#94160379 END AS description#94160396, CASE WHEN ((universe#94160380 = NA) OR (universe#94160380 = null)) THEN null ELSE cast(universe#94160380 as int) END AS universe#94160397] Input [4]: [cap#94160377, sort#94160378, description#94160379, universe#94160380] (14) Filter Input [4]: [cap#94160394, sort#94160395, description#94160396, universe#94160397] Condition : isnotnull(cap#94160394) (15) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#94296649))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#94160394 as float)))] Join condition: None (16) Project [codegen id : 2] Output [7]: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, (cast(numcos#94296655 as double) / cast(universe#94160397 as double)) AS coverage#94296921] Input [7]: [cap#94296649, numcos#94296655, numdates#94296670, cap#94160394, sort#94160395, description#94160396, universe#94160397] (17) Exchange Input [7]: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, coverage#94296921] Arguments: rangepartitioning(sort#94160395 ASC NULLS FIRST, description#94160396 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7528880] (18) Sort [codegen id : 3] Input [7]: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, coverage#94296921] Arguments: [sort#94160395 ASC NULLS FIRST, description#94160396 ASC NULLS FIRST], true, 0 (19) CollectLimit Input [7]: [cap#94296649, numcos#94296655, numdates#94296670, sort#94160395, description#94160396, universe#94160397, coverage#94296921] Arguments: 1000000