Project [CASE WHEN (date#94125102 = null) THEN null ELSE cast(date#94125102 as date) END AS date#94125225, CASE WHEN ((overall#94125103 = NA) OR (overall#94125103 = null)) THEN null ELSE cast(overall#94125103 as int) END AS overall#94125226, CASE WHEN ((ret#94125104 = NA) OR (ret#94125104 = null)) THEN null ELSE cast(ret#94125104 as float) END AS ret#94125253, CASE WHEN ((resret#94125105 = NA) OR (resret#94125105 = null)) THEN null ELSE cast(resret#94125105 as float) END AS resret#94125254, CASE WHEN ((retnet#94125106 = NA) OR (retnet#94125106 = null)) THEN null ELSE cast(retnet#94125106 as float) END AS retnet#94125281, CASE WHEN ((turnover#94125107 = NA) OR (turnover#94125107 = null)) THEN null ELSE cast(turnover#94125107 as float) END AS turnover#94125294, CASE WHEN ((numcos#94125108 = NA) OR (numcos#94125108 = null)) THEN null ELSE cast(numcos#94125108 as float) END AS numcos#94125295, CASE WHEN ((benchmark#94125109 = NA) OR (benchmark#94125109 = null)) THEN null ELSE cast(benchmark#94125109 as float) END AS benchmark#94125308, CASE WHEN ((excess_ret#94125110 = NA) OR (excess_ret#94125110 = null)) THEN null ELSE cast(excess_ret#94125110 as float) END AS excess_ret#94125309, CASE WHEN ((excess_resret#94125111 = NA) OR (excess_resret#94125111 = null)) THEN null ELSE cast(excess_resret#94125111 as float) END AS excess_resret#94125310, CASE WHEN ((excess_retnet#94125112 = NA) OR (excess_retnet#94125112 = null)) THEN null ELSE cast(excess_retnet#94125112 as float) END AS excess_retnet#94125312]
Filter (isnotnull(cap#94125313) AND (cast(cap#94125313 as string) = 0))
WholeStageCodegen (1)
InMemoryTableScan [cap#94125313, date#94125311, numcos#94125370], [isnotnull(cap#94125313), (cast(cap#94125313 as string) = 0)]
Project [CASE WHEN (date#94125124 = null) THEN null ELSE cast(date#94125124 as date) END AS date#94125311, CASE WHEN ((cap#94125125 = NA) OR (cap#94125125 = null)) THEN null ELSE cast(cap#94125125 as float) END AS cap#94125313, CASE WHEN ((ret#94125126 = NA) OR (ret#94125126 = null)) THEN null ELSE cast(ret#94125126 as float) END AS ret#94125314, CASE WHEN ((resret#94125127 = NA) OR (resret#94125127 = null)) THEN null ELSE cast(resret#94125127 as float) END AS resret#94125341, CASE WHEN ((retnet#94125128 = NA) OR (retnet#94125128 = null)) THEN null ELSE cast(retnet#94125128 as float) END AS retnet#94125342, CASE WHEN ((turnover#94125129 = NA) OR (turnover#94125129 = null)) THEN null ELSE cast(turnover#94125129 as float) END AS turnover#94125343, CASE WHEN ((numcos#94125130 = NA) OR (numcos#94125130 = null)) THEN null ELSE cast(numcos#94125130 as float) END AS numcos#94125370, CASE WHEN ((coverage#94125131 = NA) OR (coverage#94125131 = null)) THEN null ELSE cast(coverage#94125131 as float) END AS coverage#94125371, CASE WHEN ((benchmark#94125132 = NA) OR (benchmark#94125132 = null)) THEN null ELSE cast(benchmark#94125132 as float) END AS benchmark#94125372, CASE WHEN ((excess_ret#94125133 = NA) OR (excess_ret#94125133 = null)) THEN null ELSE cast(excess_ret#94125133 as float) END AS excess_ret#94125373, CASE WHEN ((excess_resret#94125134 = NA) OR (excess_resret#94125134 = null)) THEN null ELSE cast(excess_resret#94125134 as float) END AS excess_resret#94125411, CASE WHEN ((excess_retnet#94125135 = NA) OR (excess_retnet#94125135 = null)) THEN null ELSE cast(excess_retnet#94125135 as float) END AS excess_retnet#94125412]
== Physical Plan ==
CollectLimit (16)
+- InMemoryTableScan (1)
+- InMemoryRelation (2)
+- * Sort (15)
+- Exchange (14)
+- Union (13)
:- InMemoryTableScan (3)
: +- InMemoryRelation (4)
: +- * Project (6)
: +- Scan csv (5)
+- * Project (12)
+- * Filter (11)
+- InMemoryTableScan (7)
+- InMemoryRelation (8)
+- * Project (10)
+- Scan csv (9)
(1) InMemoryTableScan
Output [2]: [date#94125225, numcos#94125295]
Arguments: [date#94125225, numcos#94125295]
(2) InMemoryRelation
Arguments: [date#94125225, numcos#94125295], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(2) Sort [date#94125225 ASC NULLS FIRST], true, 0
+- Exchange rangepartitioning(date#94125225 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7515277]
+- Union
:- InMemoryTableScan [date#94125225, numcos#94125295]
: +- InMemoryRelation [date#94125225, overall#94125226, ret#94125253, resret#94125254, retnet#94125281, turnover#94125294, numcos#94125295, benchmark#94125308, excess_ret#94125309, excess_resret#94125310, excess_retnet#94125312], StorageLevel(disk, memory, deserialized, 1 replicas)
: +- *(1) Project [CASE WHEN (date#94125102 = null) THEN null ELSE cast(date#94125102 as date) END AS date#94125225, CASE WHEN ((overall#94125103 = NA) OR (overall#94125103 = null)) THEN null ELSE cast(overall#94125103 as int) END AS overall#94125226, CASE WHEN ((ret#94125104 = NA) OR (ret#94125104 = null)) THEN null ELSE cast(ret#94125104 as float) END AS ret#94125253, CASE WHEN ((resret#94125105 = NA) OR (resret#94125105 = null)) THEN null ELSE cast(resret#94125105 as float) END AS resret#94125254, CASE WHEN ((retnet#94125106 = NA) OR (retnet#94125106 = null)) THEN null ELSE cast(retnet#94125106 as float) END AS retnet#94125281, CASE WHEN ((turnover#94125107 = NA) OR (turnover#94125107 = null)) THEN null ELSE cast(turnover#94125107 as float) END AS turnover#94125294, CASE WHEN ((numcos#94125108 = NA) OR (numcos#94125108 = null)) THEN null ELSE cast(numcos#94125108 as float) END AS numcos#94125295, CASE WHEN ((benchmark#94125109 = NA) OR (benchmark#94125109 = null)) THEN null ELSE cast(benchmark#94125109 as float) END AS benchmark#94125308, CASE WHEN ((excess_ret#94125110 = NA) OR (excess_ret#94125110 = null)) THEN null ELSE cast(excess_ret#94125110 as float) END AS excess_ret#94125309, CASE WHEN ((excess_resret#94125111 = NA) OR (excess_resret#94125111 = null)) THEN null ELSE cast(excess_resret#94125111 as float) END AS excess_resret#94125310, CASE WHEN ((excess_retnet#94125112 = NA) OR (excess_retnet#94125112 = null)) THEN null ELSE cast(excess_retnet#94125112 as float) END AS excess_retnet#94125312]
: +- FileScan csv [date#94125102,overall#94125103,ret#94125104,resret#94125105,retnet#94125106,turnover#94125107,numcos#94125108,benchmark#94125109,excess_ret#94125110,excess_resret#94125111,excess_retnet#94125112] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/transcripts/transcript..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:s...
+- *(1) Project [date#94125311, numcos#94125370]
+- *(1) Filter (isnotnull(cap#94125313) AND (cast(cap#94125313 as string) = 0))
+- InMemoryTableScan [cap#94125313, date#94125311, numcos#94125370], [isnotnull(cap#94125313), (cast(cap#94125313 as string) = 0)]
+- InMemoryRelation [date#94125311, cap#94125313, ret#94125314, resret#94125341, retnet#94125342, turnover#94125343, numcos#94125370, coverage#94125371, benchmark#94125372, excess_ret#94125373, excess_resret#94125411, excess_retnet#94125412], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [CASE WHEN (date#94125124 = null) THEN null ELSE cast(date#94125124 as date) END AS date#94125311, CASE WHEN ((cap#94125125 = NA) OR (cap#94125125 = null)) THEN null ELSE cast(cap#94125125 as float) END AS cap#94125313, CASE WHEN ((ret#94125126 = NA) OR (ret#94125126 = null)) THEN null ELSE cast(ret#94125126 as float) END AS ret#94125314, CASE WHEN ((resret#94125127 = NA) OR (resret#94125127 = null)) THEN null ELSE cast(resret#94125127 as float) END AS resret#94125341, CASE WHEN ((retnet#94125128 = NA) OR (retnet#94125128 = null)) THEN null ELSE cast(retnet#94125128 as float) END AS retnet#94125342, CASE WHEN ((turnover#94125129 = NA) OR (turnover#94125129 = null)) THEN null ELSE cast(turnover#94125129 as float) END AS turnover#94125343, CASE WHEN ((numcos#94125130 = NA) OR (numcos#94125130 = null)) THEN null ELSE cast(numcos#94125130 as float) END AS numcos#94125370, CASE WHEN ((coverage#94125131 = NA) OR (coverage#94125131 = null)) THEN null ELSE cast(coverage#94125131 as float) END AS coverage#94125371, CASE WHEN ((benchmark#94125132 = NA) OR (benchmark#94125132 = null)) THEN null ELSE cast(benchmark#94125132 as float) END AS benchmark#94125372, CASE WHEN ((excess_ret#94125133 = NA) OR (excess_ret#94125133 = null)) THEN null ELSE cast(excess_ret#94125133 as float) END AS excess_ret#94125373, CASE WHEN ((excess_resret#94125134 = NA) OR (excess_resret#94125134 = null)) THEN null ELSE cast(excess_resret#94125134 as float) END AS excess_resret#94125411, CASE WHEN ((excess_retnet#94125135 = NA) OR (excess_retnet#94125135 = null)) THEN null ELSE cast(excess_retnet#94125135 as float) END AS excess_retnet#94125412]
+- FileScan csv [date#94125124,cap#94125125,ret#94125126,resret#94125127,retnet#94125128,turnover#94125129,numcos#94125130,coverage#94125131,benchmark#94125132,excess_ret#94125133,excess_resret#94125134,excess_retnet#94125135] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/transcripts/transcript..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:strin...
,None), [date#94125225 ASC NULLS FIRST]
(3) InMemoryTableScan
Output [2]: [date#94125225, numcos#94125295]
Arguments: [date#94125225, numcos#94125295]
(4) InMemoryRelation
Arguments: [date#94125225, overall#94125226, ret#94125253, resret#94125254, retnet#94125281, turnover#94125294, numcos#94125295, benchmark#94125308, excess_ret#94125309, excess_resret#94125310, excess_retnet#94125312], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN (date#94125102 = null) THEN null ELSE cast(date#94125102 as date) END AS date#94125225, CASE WHEN ((overall#94125103 = NA) OR (overall#94125103 = null)) THEN null ELSE cast(overall#94125103 as int) END AS overall#94125226, CASE WHEN ((ret#94125104 = NA) OR (ret#94125104 = null)) THEN null ELSE cast(ret#94125104 as float) END AS ret#94125253, CASE WHEN ((resret#94125105 = NA) OR (resret#94125105 = null)) THEN null ELSE cast(resret#94125105 as float) END AS resret#94125254, CASE WHEN ((retnet#94125106 = NA) OR (retnet#94125106 = null)) THEN null ELSE cast(retnet#94125106 as float) END AS retnet#94125281, CASE WHEN ((turnover#94125107 = NA) OR (turnover#94125107 = null)) THEN null ELSE cast(turnover#94125107 as float) END AS turnover#94125294, CASE WHEN ((numcos#94125108 = NA) OR (numcos#94125108 = null)) THEN null ELSE cast(numcos#94125108 as float) END AS numcos#94125295, CASE WHEN ((benchmark#94125109 = NA) OR (benchmark#94125109 = null)) THEN null ELSE cast(benchmark#94125109 as float) END AS benchmark#94125308, CASE WHEN ((excess_ret#94125110 = NA) OR (excess_ret#94125110 = null)) THEN null ELSE cast(excess_ret#94125110 as float) END AS excess_ret#94125309, CASE WHEN ((excess_resret#94125111 = NA) OR (excess_resret#94125111 = null)) THEN null ELSE cast(excess_resret#94125111 as float) END AS excess_resret#94125310, CASE WHEN ((excess_retnet#94125112 = NA) OR (excess_retnet#94125112 = null)) THEN null ELSE cast(excess_retnet#94125112 as float) END AS excess_retnet#94125312]
+- FileScan csv [date#94125102,overall#94125103,ret#94125104,resret#94125105,retnet#94125106,turnover#94125107,numcos#94125108,benchmark#94125109,excess_ret#94125110,excess_resret#94125111,excess_retnet#94125112] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/transcripts/transcript..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:s...
,None)
(5) Scan csv
Output [11]: [date#94125102, overall#94125103, ret#94125104, resret#94125105, retnet#94125106, turnover#94125107, numcos#94125108, benchmark#94125109, excess_ret#94125110, excess_resret#94125111, excess_retnet#94125112]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/transcripts/transcript_model_residualized/longshort_overall.csv]
ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:string,benchmark:string,excess_ret:string,excess_resret:string,excess_retnet:string>
(6) Project [codegen id : 1]
Output [11]: [CASE WHEN (date#94125102 = null) THEN null ELSE cast(date#94125102 as date) END AS date#94125225, CASE WHEN ((overall#94125103 = NA) OR (overall#94125103 = null)) THEN null ELSE cast(overall#94125103 as int) END AS overall#94125226, CASE WHEN ((ret#94125104 = NA) OR (ret#94125104 = null)) THEN null ELSE cast(ret#94125104 as float) END AS ret#94125253, CASE WHEN ((resret#94125105 = NA) OR (resret#94125105 = null)) THEN null ELSE cast(resret#94125105 as float) END AS resret#94125254, CASE WHEN ((retnet#94125106 = NA) OR (retnet#94125106 = null)) THEN null ELSE cast(retnet#94125106 as float) END AS retnet#94125281, CASE WHEN ((turnover#94125107 = NA) OR (turnover#94125107 = null)) THEN null ELSE cast(turnover#94125107 as float) END AS turnover#94125294, CASE WHEN ((numcos#94125108 = NA) OR (numcos#94125108 = null)) THEN null ELSE cast(numcos#94125108 as float) END AS numcos#94125295, CASE WHEN ((benchmark#94125109 = NA) OR (benchmark#94125109 = null)) THEN null ELSE cast(benchmark#94125109 as float) END AS benchmark#94125308, CASE WHEN ((excess_ret#94125110 = NA) OR (excess_ret#94125110 = null)) THEN null ELSE cast(excess_ret#94125110 as float) END AS excess_ret#94125309, CASE WHEN ((excess_resret#94125111 = NA) OR (excess_resret#94125111 = null)) THEN null ELSE cast(excess_resret#94125111 as float) END AS excess_resret#94125310, CASE WHEN ((excess_retnet#94125112 = NA) OR (excess_retnet#94125112 = null)) THEN null ELSE cast(excess_retnet#94125112 as float) END AS excess_retnet#94125312]
Input [11]: [date#94125102, overall#94125103, ret#94125104, resret#94125105, retnet#94125106, turnover#94125107, numcos#94125108, benchmark#94125109, excess_ret#94125110, excess_resret#94125111, excess_retnet#94125112]
(7) InMemoryTableScan
Output [3]: [cap#94125313, date#94125311, numcos#94125370]
Arguments: [cap#94125313, date#94125311, numcos#94125370], [isnotnull(cap#94125313), (cast(cap#94125313 as string) = 0)]
(8) InMemoryRelation
Arguments: [date#94125311, cap#94125313, ret#94125314, resret#94125341, retnet#94125342, turnover#94125343, numcos#94125370, coverage#94125371, benchmark#94125372, excess_ret#94125373, excess_resret#94125411, excess_retnet#94125412], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN (date#94125124 = null) THEN null ELSE cast(date#94125124 as date) END AS date#94125311, CASE WHEN ((cap#94125125 = NA) OR (cap#94125125 = null)) THEN null ELSE cast(cap#94125125 as float) END AS cap#94125313, CASE WHEN ((ret#94125126 = NA) OR (ret#94125126 = null)) THEN null ELSE cast(ret#94125126 as float) END AS ret#94125314, CASE WHEN ((resret#94125127 = NA) OR (resret#94125127 = null)) THEN null ELSE cast(resret#94125127 as float) END AS resret#94125341, CASE WHEN ((retnet#94125128 = NA) OR (retnet#94125128 = null)) THEN null ELSE cast(retnet#94125128 as float) END AS retnet#94125342, CASE WHEN ((turnover#94125129 = NA) OR (turnover#94125129 = null)) THEN null ELSE cast(turnover#94125129 as float) END AS turnover#94125343, CASE WHEN ((numcos#94125130 = NA) OR (numcos#94125130 = null)) THEN null ELSE cast(numcos#94125130 as float) END AS numcos#94125370, CASE WHEN ((coverage#94125131 = NA) OR (coverage#94125131 = null)) THEN null ELSE cast(coverage#94125131 as float) END AS coverage#94125371, CASE WHEN ((benchmark#94125132 = NA) OR (benchmark#94125132 = null)) THEN null ELSE cast(benchmark#94125132 as float) END AS benchmark#94125372, CASE WHEN ((excess_ret#94125133 = NA) OR (excess_ret#94125133 = null)) THEN null ELSE cast(excess_ret#94125133 as float) END AS excess_ret#94125373, CASE WHEN ((excess_resret#94125134 = NA) OR (excess_resret#94125134 = null)) THEN null ELSE cast(excess_resret#94125134 as float) END AS excess_resret#94125411, CASE WHEN ((excess_retnet#94125135 = NA) OR (excess_retnet#94125135 = null)) THEN null ELSE cast(excess_retnet#94125135 as float) END AS excess_retnet#94125412]
+- FileScan csv [date#94125124,cap#94125125,ret#94125126,resret#94125127,retnet#94125128,turnover#94125129,numcos#94125130,coverage#94125131,benchmark#94125132,excess_ret#94125133,excess_resret#94125134,excess_retnet#94125135] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/transcripts/transcript..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:strin...
,None)
(9) Scan csv
Output [12]: [date#94125124, cap#94125125, ret#94125126, resret#94125127, retnet#94125128, turnover#94125129, numcos#94125130, coverage#94125131, benchmark#94125132, excess_ret#94125133, excess_resret#94125134, excess_retnet#94125135]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/transcripts/transcript_model_residualized/longshort_cap.csv]
ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:string,coverage:string,benchmark:string,excess_ret:string,excess_resret:string,excess_retnet:string>
(10) Project [codegen id : 1]
Output [12]: [CASE WHEN (date#94125124 = null) THEN null ELSE cast(date#94125124 as date) END AS date#94125311, CASE WHEN ((cap#94125125 = NA) OR (cap#94125125 = null)) THEN null ELSE cast(cap#94125125 as float) END AS cap#94125313, CASE WHEN ((ret#94125126 = NA) OR (ret#94125126 = null)) THEN null ELSE cast(ret#94125126 as float) END AS ret#94125314, CASE WHEN ((resret#94125127 = NA) OR (resret#94125127 = null)) THEN null ELSE cast(resret#94125127 as float) END AS resret#94125341, CASE WHEN ((retnet#94125128 = NA) OR (retnet#94125128 = null)) THEN null ELSE cast(retnet#94125128 as float) END AS retnet#94125342, CASE WHEN ((turnover#94125129 = NA) OR (turnover#94125129 = null)) THEN null ELSE cast(turnover#94125129 as float) END AS turnover#94125343, CASE WHEN ((numcos#94125130 = NA) OR (numcos#94125130 = null)) THEN null ELSE cast(numcos#94125130 as float) END AS numcos#94125370, CASE WHEN ((coverage#94125131 = NA) OR (coverage#94125131 = null)) THEN null ELSE cast(coverage#94125131 as float) END AS coverage#94125371, CASE WHEN ((benchmark#94125132 = NA) OR (benchmark#94125132 = null)) THEN null ELSE cast(benchmark#94125132 as float) END AS benchmark#94125372, CASE WHEN ((excess_ret#94125133 = NA) OR (excess_ret#94125133 = null)) THEN null ELSE cast(excess_ret#94125133 as float) END AS excess_ret#94125373, CASE WHEN ((excess_resret#94125134 = NA) OR (excess_resret#94125134 = null)) THEN null ELSE cast(excess_resret#94125134 as float) END AS excess_resret#94125411, CASE WHEN ((excess_retnet#94125135 = NA) OR (excess_retnet#94125135 = null)) THEN null ELSE cast(excess_retnet#94125135 as float) END AS excess_retnet#94125412]
Input [12]: [date#94125124, cap#94125125, ret#94125126, resret#94125127, retnet#94125128, turnover#94125129, numcos#94125130, coverage#94125131, benchmark#94125132, excess_ret#94125133, excess_resret#94125134, excess_retnet#94125135]
(11) Filter [codegen id : 1]
Input [3]: [cap#94125313, date#94125311, numcos#94125370]
Condition : (isnotnull(cap#94125313) AND (cast(cap#94125313 as string) = 0))
(12) Project [codegen id : 1]
Output [2]: [date#94125311, numcos#94125370]
Input [3]: [cap#94125313, date#94125311, numcos#94125370]
(13) Union
(14) Exchange
Input [2]: [date#94125225, numcos#94125295]
Arguments: rangepartitioning(date#94125225 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7515277]
(15) Sort [codegen id : 2]
Input [2]: [date#94125225, numcos#94125295]
Arguments: [date#94125225 ASC NULLS FIRST], true, 0
(16) CollectLimit
Input [2]: [date#94125225, numcos#94125295]
Arguments: 1000000