== Physical Plan == CollectLimit (19) +- InMemoryTableScan (1) +- InMemoryRelation (2) +- * Sort (18) +- Exchange (17) +- * Project (16) +- * BroadcastHashJoin Inner BuildLeft (15) :- BroadcastExchange (9) : +- * Filter (8) : +- * ColumnarToRow (7) : +- InMemoryTableScan (3) : +- InMemoryRelation (4) : +- * Project (6) : +- Scan csv (5) +- * Filter (14) +- InMemoryTableScan (10) +- InMemoryRelation (11) +- * Project (13) +- Scan csv (12) (1) InMemoryTableScan Output [7]: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, coverage#93981548] Arguments: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, coverage#93981548] (2) InMemoryRelation Arguments: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, coverage#93981548], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(3) Sort [sort#93880531 ASC NULLS FIRST, description#93880533 ASC NULLS FIRST], true, 0 +- Exchange rangepartitioning(sort#93880531 ASC NULLS FIRST, description#93880533 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7503817] +- *(2) Project [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, (cast(numcos#93981144 as double) / cast(universe#93880535 as double)) AS coverage#93981548] +- *(2) BroadcastHashJoin [knownfloatingpointnormalized(normalizenanandzero(cap#93981115))], [knownfloatingpointnormalized(normalizenanandzero(cast(cap#93880528 as float)))], Inner, BuildLeft, false :- BroadcastExchange HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7503810] : +- *(1) Filter isnotnull(cap#93981115) : +- *(1) ColumnarToRow : +- InMemoryTableScan [cap#93981115, numcos#93981144, numdates#93981157], [isnotnull(cap#93981115)] : +- InMemoryRelation [cap#93981115, retIC#93981116, resretIC#93981143, numcos#93981144, numdates#93981157, annual_bmret#93981158, annual_ret#93981182, std_ret#93981204, Sharpe_ret#93981209, PctPos_ret#93981212, TR_ret#93981216, IR_ret#93981219, annual_resret#93981224, std_resret#93981228, Sharpe_resret#93981232, PctPos_resret#93981237, TR_resret#93981241, IR_resret#93981255, annual_retnet#93981257, std_retnet#93981259, Sharpe_retnet#93981261, PctPos_retnet#93981264, TR_retnet#93981278, IR_retnet#93981371, ... 2 more fields], StorageLevel(disk, memory, deserialized, 1 replicas) : +- *(1) Project [CASE WHEN ((cap#93980934 = NA) OR (cap#93980934 = null)) THEN null ELSE cast(cap#93980934 as float) END AS cap#93981115, CASE WHEN ((retIC#93980935 = NA) OR (retIC#93980935 = null)) THEN null ELSE cast(retIC#93980935 as float) END AS retIC#93981116, CASE WHEN ((resretIC#93980938 = NA) OR (resretIC#93980938 = null)) THEN null ELSE cast(resretIC#93980938 as float) END AS resretIC#93981143, CASE WHEN ((numcos#93980940 = NA) OR (numcos#93980940 = null)) THEN null ELSE cast(numcos#93980940 as float) END AS numcos#93981144, CASE WHEN ((numdates#93980943 = NA) OR (numdates#93980943 = null)) THEN null ELSE cast(numdates#93980943 as float) END AS numdates#93981157, CASE WHEN ((annual_bmret#93980945 = NA) OR (annual_bmret#93980945 = null)) THEN null ELSE cast(annual_bmret#93980945 as float) END AS annual_bmret#93981158, CASE WHEN ((annual_ret#93980949 = NA) OR (annual_ret#93980949 = null)) THEN null ELSE cast(annual_ret#93980949 as float) END AS annual_ret#93981182, CASE WHEN ((std_ret#93980951 = NA) OR (std_ret#93980951 = null)) THEN null ELSE cast(std_ret#93980951 as float) END AS std_ret#93981204, CASE WHEN ((Sharpe_ret#93980953 = NA) OR (Sharpe_ret#93980953 = null)) THEN null ELSE cast(Sharpe_ret#93980953 as float) END AS Sharpe_ret#93981209, CASE WHEN ((PctPos_ret#93980955 = NA) OR (PctPos_ret#93980955 = null)) THEN null ELSE cast(PctPos_ret#93980955 as float) END AS PctPos_ret#93981212, CASE WHEN ((TR_ret#93980957 = NA) OR (TR_ret#93980957 = null)) THEN null ELSE cast(TR_ret#93980957 as float) END AS TR_ret#93981216, CASE WHEN ((IR_ret#93980959 = NA) OR (IR_ret#93980959 = null)) THEN null ELSE cast(IR_ret#93980959 as float) END AS IR_ret#93981219, CASE WHEN ((annual_resret#93980961 = NA) OR (annual_resret#93980961 = null)) THEN null ELSE cast(annual_resret#93980961 as float) END AS annual_resret#93981224, CASE WHEN ((std_resret#93980963 = NA) OR (std_resret#93980963 = null)) THEN null ELSE cast(std_resret#93980963 as float) END AS std_resret#93981228, CASE WHEN ((Sharpe_resret#93980965 = NA) OR (Sharpe_resret#93980965 = null)) THEN null ELSE cast(Sharpe_resret#93980965 as float) END AS Sharpe_resret#93981232, CASE WHEN ((PctPos_resret#93980967 = NA) OR (PctPos_resret#93980967 = null)) THEN null ELSE cast(PctPos_resret#93980967 as float) END AS PctPos_resret#93981237, CASE WHEN ((TR_resret#93980970 = NA) OR (TR_resret#93980970 = null)) THEN null ELSE cast(TR_resret#93980970 as float) END AS TR_resret#93981241, CASE WHEN ((IR_resret#93980972 = NA) OR (IR_resret#93980972 = null)) THEN null ELSE cast(IR_resret#93980972 as float) END AS IR_resret#93981255, CASE WHEN ((annual_retnet#93980974 = NA) OR (annual_retnet#93980974 = null)) THEN null ELSE cast(annual_retnet#93980974 as float) END AS annual_retnet#93981257, CASE WHEN ((std_retnet#93980976 = NA) OR (std_retnet#93980976 = null)) THEN null ELSE cast(std_retnet#93980976 as float) END AS std_retnet#93981259, CASE WHEN ((Sharpe_retnet#93980978 = NA) OR (Sharpe_retnet#93980978 = null)) THEN null ELSE cast(Sharpe_retnet#93980978 as float) END AS Sharpe_retnet#93981261, CASE WHEN ((PctPos_retnet#93980980 = NA) OR (PctPos_retnet#93980980 = null)) THEN null ELSE cast(PctPos_retnet#93980980 as float) END AS PctPos_retnet#93981264, CASE WHEN ((TR_retnet#93980981 = NA) OR (TR_retnet#93980981 = null)) THEN null ELSE cast(TR_retnet#93980981 as float) END AS TR_retnet#93981278, CASE WHEN ((IR_retnet#93980982 = NA) OR (IR_retnet#93980982 = null)) THEN null ELSE cast(IR_retnet#93980982 as float) END AS IR_retnet#93981371, ... 2 more fields] : +- FileScan csv [cap#93980934,retIC#93980935,resretIC#93980938,numcos#93980940,numdates#93980943,annual_bmret#93980945,annual_ret#93980949,std_ret#93980951,Sharpe_ret#93980953,PctPos_ret#93980955,TR_ret#93980957,IR_ret#93980959,annual_resret#93980961,std_resret#93980963,Sharpe_resret#93980965,PctPos_resret#93980967,TR_resret#93980970,IR_resret#93980972,annual_retnet#93980974,std_retnet#93980976,Sharpe_retnet#93980978,PctPos_retnet#93980980,TR_retnet#93980981,IR_retnet#93980982,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... +- *(2) Filter isnotnull(cap#93880528) +- InMemoryTableScan [cap#93880528, sort#93880531, description#93880533, universe#93880535], [isnotnull(cap#93880528)] +- InMemoryRelation [cap#93880528, sort#93880531, description#93880533, universe#93880535], StorageLevel(disk, memory, deserialized, 1 replicas) +- *(1) Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535] +- FileScan csv [cap#93880496,sort#93880498,description#93880500,universe#93880502] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None), [sort#93880531 ASC NULLS FIRST, description#93880533 ASC NULLS FIRST] (3) InMemoryTableScan Output [3]: [cap#93981115, numcos#93981144, numdates#93981157] Arguments: [cap#93981115, numcos#93981144, numdates#93981157], [isnotnull(cap#93981115)] (4) InMemoryRelation Arguments: [cap#93981115, retIC#93981116, resretIC#93981143, numcos#93981144, numdates#93981157, annual_bmret#93981158, annual_ret#93981182, std_ret#93981204, Sharpe_ret#93981209, PctPos_ret#93981212, TR_ret#93981216, IR_ret#93981219, annual_resret#93981224, std_resret#93981228, Sharpe_resret#93981232, PctPos_resret#93981237, TR_resret#93981241, IR_resret#93981255, annual_retnet#93981257, std_retnet#93981259, Sharpe_retnet#93981261, PctPos_retnet#93981264, TR_retnet#93981278, IR_retnet#93981371, ... 2 more fields], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#93980934 = NA) OR (cap#93980934 = null)) THEN null ELSE cast(cap#93980934 as float) END AS cap#93981115, CASE WHEN ((retIC#93980935 = NA) OR (retIC#93980935 = null)) THEN null ELSE cast(retIC#93980935 as float) END AS retIC#93981116, CASE WHEN ((resretIC#93980938 = NA) OR (resretIC#93980938 = null)) THEN null ELSE cast(resretIC#93980938 as float) END AS resretIC#93981143, CASE WHEN ((numcos#93980940 = NA) OR (numcos#93980940 = null)) THEN null ELSE cast(numcos#93980940 as float) END AS numcos#93981144, CASE WHEN ((numdates#93980943 = NA) OR (numdates#93980943 = null)) THEN null ELSE cast(numdates#93980943 as float) END AS numdates#93981157, CASE WHEN ((annual_bmret#93980945 = NA) OR (annual_bmret#93980945 = null)) THEN null ELSE cast(annual_bmret#93980945 as float) END AS annual_bmret#93981158, CASE WHEN ((annual_ret#93980949 = NA) OR (annual_ret#93980949 = null)) THEN null ELSE cast(annual_ret#93980949 as float) END AS annual_ret#93981182, CASE WHEN ((std_ret#93980951 = NA) OR (std_ret#93980951 = null)) THEN null ELSE cast(std_ret#93980951 as float) END AS std_ret#93981204, CASE WHEN ((Sharpe_ret#93980953 = NA) OR (Sharpe_ret#93980953 = null)) THEN null ELSE cast(Sharpe_ret#93980953 as float) END AS Sharpe_ret#93981209, CASE WHEN ((PctPos_ret#93980955 = NA) OR (PctPos_ret#93980955 = null)) THEN null ELSE cast(PctPos_ret#93980955 as float) END AS PctPos_ret#93981212, CASE WHEN ((TR_ret#93980957 = NA) OR (TR_ret#93980957 = null)) THEN null ELSE cast(TR_ret#93980957 as float) END AS TR_ret#93981216, CASE WHEN ((IR_ret#93980959 = NA) OR (IR_ret#93980959 = null)) THEN null ELSE cast(IR_ret#93980959 as float) END AS IR_ret#93981219, CASE WHEN ((annual_resret#93980961 = NA) OR (annual_resret#93980961 = null)) THEN null ELSE cast(annual_resret#93980961 as float) END AS annual_resret#93981224, CASE WHEN ((std_resret#93980963 = NA) OR (std_resret#93980963 = null)) THEN null ELSE cast(std_resret#93980963 as float) END AS std_resret#93981228, CASE WHEN ((Sharpe_resret#93980965 = NA) OR (Sharpe_resret#93980965 = null)) THEN null ELSE cast(Sharpe_resret#93980965 as float) END AS Sharpe_resret#93981232, CASE WHEN ((PctPos_resret#93980967 = NA) OR (PctPos_resret#93980967 = null)) THEN null ELSE cast(PctPos_resret#93980967 as float) END AS PctPos_resret#93981237, CASE WHEN ((TR_resret#93980970 = NA) OR (TR_resret#93980970 = null)) THEN null ELSE cast(TR_resret#93980970 as float) END AS TR_resret#93981241, CASE WHEN ((IR_resret#93980972 = NA) OR (IR_resret#93980972 = null)) THEN null ELSE cast(IR_resret#93980972 as float) END AS IR_resret#93981255, CASE WHEN ((annual_retnet#93980974 = NA) OR (annual_retnet#93980974 = null)) THEN null ELSE cast(annual_retnet#93980974 as float) END AS annual_retnet#93981257, CASE WHEN ((std_retnet#93980976 = NA) OR (std_retnet#93980976 = null)) THEN null ELSE cast(std_retnet#93980976 as float) END AS std_retnet#93981259, CASE WHEN ((Sharpe_retnet#93980978 = NA) OR (Sharpe_retnet#93980978 = null)) THEN null ELSE cast(Sharpe_retnet#93980978 as float) END AS Sharpe_retnet#93981261, CASE WHEN ((PctPos_retnet#93980980 = NA) OR (PctPos_retnet#93980980 = null)) THEN null ELSE cast(PctPos_retnet#93980980 as float) END AS PctPos_retnet#93981264, CASE WHEN ((TR_retnet#93980981 = NA) OR (TR_retnet#93980981 = null)) THEN null ELSE cast(TR_retnet#93980981 as float) END AS TR_retnet#93981278, CASE WHEN ((IR_retnet#93980982 = NA) OR (IR_retnet#93980982 = null)) THEN null ELSE cast(IR_retnet#93980982 as float) END AS IR_retnet#93981371, ... 2 more fields] +- FileScan csv [cap#93980934,retIC#93980935,resretIC#93980938,numcos#93980940,numdates#93980943,annual_bmret#93980945,annual_ret#93980949,std_ret#93980951,Sharpe_ret#93980953,PctPos_ret#93980955,TR_ret#93980957,IR_ret#93980959,annual_resret#93980961,std_resret#93980963,Sharpe_resret#93980965,PctPos_resret#93980967,TR_resret#93980970,IR_resret#93980972,annual_retnet#93980974,std_retnet#93980976,Sharpe_retnet#93980978,PctPos_retnet#93980980,TR_retnet#93980981,IR_retnet#93980982,... 2 more fields] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,... ,None) (5) Scan csv Output [26]: [cap#93980934, retIC#93980935, resretIC#93980938, numcos#93980940, numdates#93980943, annual_bmret#93980945, annual_ret#93980949, std_ret#93980951, Sharpe_ret#93980953, PctPos_ret#93980955, TR_ret#93980957, IR_ret#93980959, annual_resret#93980961, std_resret#93980963, Sharpe_resret#93980965, PctPos_resret#93980967, TR_resret#93980970, IR_resret#93980972, annual_retnet#93980974, std_retnet#93980976, Sharpe_retnet#93980978, PctPos_retnet#93980980, TR_retnet#93980981, IR_retnet#93980982, turnover#93980983, coverage#93980984] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_postearnings/stats_cap.csv] ReadSchema: struct<cap:string,retIC:string,resretIC:string,numcos:string,numdates:string,annual_bmret:string,annual_ret:string,std_ret:string,Sharpe_ret:string,PctPos_ret:string,TR_ret:string,IR_ret:string,annual_resret:string,std_resret:string,Sharpe_resret:string,PctPos_resret:string,TR_resret:string,IR_resret:string,annual_retnet:string,std_retnet:string,Sharpe_retnet:string,PctPos_retnet:string,TR_retnet:string,IR_retnet:string,turnover:string,coverage:string> (6) Project [codegen id : 1] Output [26]: [CASE WHEN ((cap#93980934 = NA) OR (cap#93980934 = null)) THEN null ELSE cast(cap#93980934 as float) END AS cap#93981115, CASE WHEN ((retIC#93980935 = NA) OR (retIC#93980935 = null)) THEN null ELSE cast(retIC#93980935 as float) END AS retIC#93981116, CASE WHEN ((resretIC#93980938 = NA) OR (resretIC#93980938 = null)) THEN null ELSE cast(resretIC#93980938 as float) END AS resretIC#93981143, CASE WHEN ((numcos#93980940 = NA) OR (numcos#93980940 = null)) THEN null ELSE cast(numcos#93980940 as float) END AS numcos#93981144, CASE WHEN ((numdates#93980943 = NA) OR (numdates#93980943 = null)) THEN null ELSE cast(numdates#93980943 as float) END AS numdates#93981157, CASE WHEN ((annual_bmret#93980945 = NA) OR (annual_bmret#93980945 = null)) THEN null ELSE cast(annual_bmret#93980945 as float) END AS annual_bmret#93981158, CASE WHEN ((annual_ret#93980949 = NA) OR (annual_ret#93980949 = null)) THEN null ELSE cast(annual_ret#93980949 as float) END AS annual_ret#93981182, CASE WHEN ((std_ret#93980951 = NA) OR (std_ret#93980951 = null)) THEN null ELSE cast(std_ret#93980951 as float) END AS std_ret#93981204, CASE WHEN ((Sharpe_ret#93980953 = NA) OR (Sharpe_ret#93980953 = null)) THEN null ELSE cast(Sharpe_ret#93980953 as float) END AS Sharpe_ret#93981209, CASE WHEN ((PctPos_ret#93980955 = NA) OR (PctPos_ret#93980955 = null)) THEN null ELSE cast(PctPos_ret#93980955 as float) END AS PctPos_ret#93981212, CASE WHEN ((TR_ret#93980957 = NA) OR (TR_ret#93980957 = null)) THEN null ELSE cast(TR_ret#93980957 as float) END AS TR_ret#93981216, CASE WHEN ((IR_ret#93980959 = NA) OR (IR_ret#93980959 = null)) THEN null ELSE cast(IR_ret#93980959 as float) END AS IR_ret#93981219, CASE WHEN ((annual_resret#93980961 = NA) OR (annual_resret#93980961 = null)) THEN null ELSE cast(annual_resret#93980961 as float) END AS annual_resret#93981224, CASE WHEN ((std_resret#93980963 = NA) OR (std_resret#93980963 = null)) THEN null ELSE cast(std_resret#93980963 as float) END AS std_resret#93981228, CASE WHEN ((Sharpe_resret#93980965 = NA) OR (Sharpe_resret#93980965 = null)) THEN null ELSE cast(Sharpe_resret#93980965 as float) END AS Sharpe_resret#93981232, CASE WHEN ((PctPos_resret#93980967 = NA) OR (PctPos_resret#93980967 = null)) THEN null ELSE cast(PctPos_resret#93980967 as float) END AS PctPos_resret#93981237, CASE WHEN ((TR_resret#93980970 = NA) OR (TR_resret#93980970 = null)) THEN null ELSE cast(TR_resret#93980970 as float) END AS TR_resret#93981241, CASE WHEN ((IR_resret#93980972 = NA) OR (IR_resret#93980972 = null)) THEN null ELSE cast(IR_resret#93980972 as float) END AS IR_resret#93981255, CASE WHEN ((annual_retnet#93980974 = NA) OR (annual_retnet#93980974 = null)) THEN null ELSE cast(annual_retnet#93980974 as float) END AS annual_retnet#93981257, CASE WHEN ((std_retnet#93980976 = NA) OR (std_retnet#93980976 = null)) THEN null ELSE cast(std_retnet#93980976 as float) END AS std_retnet#93981259, CASE WHEN ((Sharpe_retnet#93980978 = NA) OR (Sharpe_retnet#93980978 = null)) THEN null ELSE cast(Sharpe_retnet#93980978 as float) END AS Sharpe_retnet#93981261, CASE WHEN ((PctPos_retnet#93980980 = NA) OR (PctPos_retnet#93980980 = null)) THEN null ELSE cast(PctPos_retnet#93980980 as float) END AS PctPos_retnet#93981264, CASE WHEN ((TR_retnet#93980981 = NA) OR (TR_retnet#93980981 = null)) THEN null ELSE cast(TR_retnet#93980981 as float) END AS TR_retnet#93981278, CASE WHEN ((IR_retnet#93980982 = NA) OR (IR_retnet#93980982 = null)) THEN null ELSE cast(IR_retnet#93980982 as float) END AS IR_retnet#93981371, CASE WHEN ((turnover#93980983 = NA) OR (turnover#93980983 = null)) THEN null ELSE cast(turnover#93980983 as float) END AS turnover#93981384, CASE WHEN ((coverage#93980984 = NA) OR (coverage#93980984 = null)) THEN null ELSE cast(coverage#93980984 as float) END AS coverage#93981386] Input [26]: [cap#93980934, retIC#93980935, resretIC#93980938, numcos#93980940, numdates#93980943, annual_bmret#93980945, annual_ret#93980949, std_ret#93980951, Sharpe_ret#93980953, PctPos_ret#93980955, TR_ret#93980957, IR_ret#93980959, annual_resret#93980961, std_resret#93980963, Sharpe_resret#93980965, PctPos_resret#93980967, TR_resret#93980970, IR_resret#93980972, annual_retnet#93980974, std_retnet#93980976, Sharpe_retnet#93980978, PctPos_retnet#93980980, TR_retnet#93980981, IR_retnet#93980982, turnover#93980983, coverage#93980984] (7) ColumnarToRow [codegen id : 1] Input [3]: [cap#93981115, numcos#93981144, numdates#93981157] (8) Filter [codegen id : 1] Input [3]: [cap#93981115, numcos#93981144, numdates#93981157] Condition : isnotnull(cap#93981115) (9) BroadcastExchange Input [3]: [cap#93981115, numcos#93981144, numdates#93981157] Arguments: HashedRelationBroadcastMode(List(knownfloatingpointnormalized(normalizenanandzero(input[0, float, false]))),false), [id=#7503810] (10) InMemoryTableScan Output [4]: [cap#93880528, sort#93880531, description#93880533, universe#93880535] Arguments: [cap#93880528, sort#93880531, description#93880533, universe#93880535], [isnotnull(cap#93880528)] (11) InMemoryRelation Arguments: [cap#93880528, sort#93880531, description#93880533, universe#93880535], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535] +- FileScan csv [cap#93880496,sort#93880498,description#93880500,universe#93880502] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv], PartitionFilters: [], PushedFilters: [], ReadSchema: struct<cap:string,sort:string,description:string,universe:string> ,None) (12) Scan csv Output [4]: [cap#93880496, sort#93880498, description#93880500, universe#93880502] Batched: false Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/curate/curate_cap.csv] ReadSchema: struct<cap:string,sort:string,description:string,universe:string> (13) Project [codegen id : 1] Output [4]: [CASE WHEN ((cap#93880496 = NA) OR (cap#93880496 = null)) THEN null ELSE cast(cap#93880496 as int) END AS cap#93880528, CASE WHEN (sort#93880498 = null) THEN null ELSE sort#93880498 END AS sort#93880531, CASE WHEN (description#93880500 = null) THEN null ELSE description#93880500 END AS description#93880533, CASE WHEN ((universe#93880502 = NA) OR (universe#93880502 = null)) THEN null ELSE cast(universe#93880502 as int) END AS universe#93880535] Input [4]: [cap#93880496, sort#93880498, description#93880500, universe#93880502] (14) Filter Input [4]: [cap#93880528, sort#93880531, description#93880533, universe#93880535] Condition : isnotnull(cap#93880528) (15) BroadcastHashJoin [codegen id : 2] Left keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cap#93981115))] Right keys [1]: [knownfloatingpointnormalized(normalizenanandzero(cast(cap#93880528 as float)))] Join condition: None (16) Project [codegen id : 2] Output [7]: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, (cast(numcos#93981144 as double) / cast(universe#93880535 as double)) AS coverage#93981548] Input [7]: [cap#93981115, numcos#93981144, numdates#93981157, cap#93880528, sort#93880531, description#93880533, universe#93880535] (17) Exchange Input [7]: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, coverage#93981548] Arguments: rangepartitioning(sort#93880531 ASC NULLS FIRST, description#93880533 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7503817] (18) Sort [codegen id : 3] Input [7]: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, coverage#93981548] Arguments: [sort#93880531 ASC NULLS FIRST, description#93880533 ASC NULLS FIRST], true, 0 (19) CollectLimit Input [7]: [cap#93981115, numcos#93981144, numdates#93981157, sort#93880531, description#93880533, universe#93880535, coverage#93981548] Arguments: 1000000