Project [CASE WHEN (date#93995781 = null) THEN null ELSE cast(date#93995781 as date) END AS date#93995980, CASE WHEN ((overall#93995782 = NA) OR (overall#93995782 = null)) THEN null ELSE cast(overall#93995782 as int) END AS overall#93995981, CASE WHEN ((ret#93995783 = NA) OR (ret#93995783 = null)) THEN null ELSE cast(ret#93995783 as float) END AS ret#93995983, CASE WHEN ((resret#93995784 = NA) OR (resret#93995784 = null)) THEN null ELSE cast(resret#93995784 as float) END AS resret#93995985, CASE WHEN ((retnet#93995785 = NA) OR (retnet#93995785 = null)) THEN null ELSE cast(retnet#93995785 as float) END AS retnet#93995987, CASE WHEN ((turnover#93995786 = NA) OR (turnover#93995786 = null)) THEN null ELSE cast(turnover#93995786 as float) END AS turnover#93995989, CASE WHEN ((numcos#93995787 = NA) OR (numcos#93995787 = null)) THEN null ELSE cast(numcos#93995787 as float) END AS numcos#93996017, CASE WHEN ((benchmark#93995788 = NA) OR (benchmark#93995788 = null)) THEN null ELSE cast(benchmark#93995788 as float) END AS benchmark#93996071, CASE WHEN ((excess_ret#93995789 = NA) OR (excess_ret#93995789 = null)) THEN null ELSE cast(excess_ret#93995789 as float) END AS excess_ret#93996099, CASE WHEN ((excess_resret#93995790 = NA) OR (excess_resret#93995790 = null)) THEN null ELSE cast(excess_resret#93995790 as float) END AS excess_resret#93996103, CASE WHEN ((excess_retnet#93995791 = NA) OR (excess_retnet#93995791 = null)) THEN null ELSE cast(excess_retnet#93995791 as float) END AS excess_retnet#93996108]
Filter (isnotnull(cap#93995984) AND (cast(cap#93995984 as string) = 0))
WholeStageCodegen (1)
InMemoryTableScan [cap#93995984, date#93995982, numcos#93996098], [isnotnull(cap#93995984), (cast(cap#93995984 as string) = 0)]
Project [CASE WHEN (date#93995806 = null) THEN null ELSE cast(date#93995806 as date) END AS date#93995982, CASE WHEN ((cap#93995807 = NA) OR (cap#93995807 = null)) THEN null ELSE cast(cap#93995807 as float) END AS cap#93995984, CASE WHEN ((ret#93995808 = NA) OR (ret#93995808 = null)) THEN null ELSE cast(ret#93995808 as float) END AS ret#93995986, CASE WHEN ((resret#93995809 = NA) OR (resret#93995809 = null)) THEN null ELSE cast(resret#93995809 as float) END AS resret#93995988, CASE WHEN ((retnet#93995810 = NA) OR (retnet#93995810 = null)) THEN null ELSE cast(retnet#93995810 as float) END AS retnet#93996016, CASE WHEN ((turnover#93995811 = NA) OR (turnover#93995811 = null)) THEN null ELSE cast(turnover#93995811 as float) END AS turnover#93996044, CASE WHEN ((numcos#93995812 = NA) OR (numcos#93995812 = null)) THEN null ELSE cast(numcos#93995812 as float) END AS numcos#93996098, CASE WHEN ((coverage#93995813 = NA) OR (coverage#93995813 = null)) THEN null ELSE cast(coverage#93995813 as float) END AS coverage#93996100, CASE WHEN ((benchmark#93995814 = NA) OR (benchmark#93995814 = null)) THEN null ELSE cast(benchmark#93995814 as float) END AS benchmark#93996105, CASE WHEN ((excess_ret#93995815 = NA) OR (excess_ret#93995815 = null)) THEN null ELSE cast(excess_ret#93995815 as float) END AS excess_ret#93996110, CASE WHEN ((excess_resret#93995816 = NA) OR (excess_resret#93995816 = null)) THEN null ELSE cast(excess_resret#93995816 as float) END AS excess_resret#93996113, CASE WHEN ((excess_retnet#93995817 = NA) OR (excess_retnet#93995817 = null)) THEN null ELSE cast(excess_retnet#93995817 as float) END AS excess_retnet#93996137]
== Physical Plan ==
CollectLimit (16)
+- InMemoryTableScan (1)
+- InMemoryRelation (2)
+- * Sort (15)
+- Exchange (14)
+- Union (13)
:- InMemoryTableScan (3)
: +- InMemoryRelation (4)
: +- * Project (6)
: +- Scan csv (5)
+- * Project (12)
+- * Filter (11)
+- InMemoryTableScan (7)
+- InMemoryRelation (8)
+- * Project (10)
+- Scan csv (9)
(1) InMemoryTableScan
Output [2]: [date#93995980, numcos#93996017]
Arguments: [date#93995980, numcos#93996017]
(2) InMemoryRelation
Arguments: [date#93995980, numcos#93996017], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(2) Sort [date#93995980 ASC NULLS FIRST], true, 0
+- Exchange rangepartitioning(date#93995980 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7505019]
+- Union
:- InMemoryTableScan [date#93995980, numcos#93996017]
: +- InMemoryRelation [date#93995980, overall#93995981, ret#93995983, resret#93995985, retnet#93995987, turnover#93995989, numcos#93996017, benchmark#93996071, excess_ret#93996099, excess_resret#93996103, excess_retnet#93996108], StorageLevel(disk, memory, deserialized, 1 replicas)
: +- *(1) Project [CASE WHEN (date#93995781 = null) THEN null ELSE cast(date#93995781 as date) END AS date#93995980, CASE WHEN ((overall#93995782 = NA) OR (overall#93995782 = null)) THEN null ELSE cast(overall#93995782 as int) END AS overall#93995981, CASE WHEN ((ret#93995783 = NA) OR (ret#93995783 = null)) THEN null ELSE cast(ret#93995783 as float) END AS ret#93995983, CASE WHEN ((resret#93995784 = NA) OR (resret#93995784 = null)) THEN null ELSE cast(resret#93995784 as float) END AS resret#93995985, CASE WHEN ((retnet#93995785 = NA) OR (retnet#93995785 = null)) THEN null ELSE cast(retnet#93995785 as float) END AS retnet#93995987, CASE WHEN ((turnover#93995786 = NA) OR (turnover#93995786 = null)) THEN null ELSE cast(turnover#93995786 as float) END AS turnover#93995989, CASE WHEN ((numcos#93995787 = NA) OR (numcos#93995787 = null)) THEN null ELSE cast(numcos#93995787 as float) END AS numcos#93996017, CASE WHEN ((benchmark#93995788 = NA) OR (benchmark#93995788 = null)) THEN null ELSE cast(benchmark#93995788 as float) END AS benchmark#93996071, CASE WHEN ((excess_ret#93995789 = NA) OR (excess_ret#93995789 = null)) THEN null ELSE cast(excess_ret#93995789 as float) END AS excess_ret#93996099, CASE WHEN ((excess_resret#93995790 = NA) OR (excess_resret#93995790 = null)) THEN null ELSE cast(excess_resret#93995790 as float) END AS excess_resret#93996103, CASE WHEN ((excess_retnet#93995791 = NA) OR (excess_retnet#93995791 = null)) THEN null ELSE cast(excess_retnet#93995791 as float) END AS excess_retnet#93996108]
: +- FileScan csv [date#93995781,overall#93995782,ret#93995783,resret#93995784,retnet#93995785,turnover#93995786,numcos#93995787,benchmark#93995788,excess_ret#93995789,excess_resret#93995790,excess_retnet#93995791] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:s...
+- *(1) Project [date#93995982, numcos#93996098]
+- *(1) Filter (isnotnull(cap#93995984) AND (cast(cap#93995984 as string) = 0))
+- InMemoryTableScan [cap#93995984, date#93995982, numcos#93996098], [isnotnull(cap#93995984), (cast(cap#93995984 as string) = 0)]
+- InMemoryRelation [date#93995982, cap#93995984, ret#93995986, resret#93995988, retnet#93996016, turnover#93996044, numcos#93996098, coverage#93996100, benchmark#93996105, excess_ret#93996110, excess_resret#93996113, excess_retnet#93996137], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [CASE WHEN (date#93995806 = null) THEN null ELSE cast(date#93995806 as date) END AS date#93995982, CASE WHEN ((cap#93995807 = NA) OR (cap#93995807 = null)) THEN null ELSE cast(cap#93995807 as float) END AS cap#93995984, CASE WHEN ((ret#93995808 = NA) OR (ret#93995808 = null)) THEN null ELSE cast(ret#93995808 as float) END AS ret#93995986, CASE WHEN ((resret#93995809 = NA) OR (resret#93995809 = null)) THEN null ELSE cast(resret#93995809 as float) END AS resret#93995988, CASE WHEN ((retnet#93995810 = NA) OR (retnet#93995810 = null)) THEN null ELSE cast(retnet#93995810 as float) END AS retnet#93996016, CASE WHEN ((turnover#93995811 = NA) OR (turnover#93995811 = null)) THEN null ELSE cast(turnover#93995811 as float) END AS turnover#93996044, CASE WHEN ((numcos#93995812 = NA) OR (numcos#93995812 = null)) THEN null ELSE cast(numcos#93995812 as float) END AS numcos#93996098, CASE WHEN ((coverage#93995813 = NA) OR (coverage#93995813 = null)) THEN null ELSE cast(coverage#93995813 as float) END AS coverage#93996100, CASE WHEN ((benchmark#93995814 = NA) OR (benchmark#93995814 = null)) THEN null ELSE cast(benchmark#93995814 as float) END AS benchmark#93996105, CASE WHEN ((excess_ret#93995815 = NA) OR (excess_ret#93995815 = null)) THEN null ELSE cast(excess_ret#93995815 as float) END AS excess_ret#93996110, CASE WHEN ((excess_resret#93995816 = NA) OR (excess_resret#93995816 = null)) THEN null ELSE cast(excess_resret#93995816 as float) END AS excess_resret#93996113, CASE WHEN ((excess_retnet#93995817 = NA) OR (excess_retnet#93995817 = null)) THEN null ELSE cast(excess_retnet#93995817 as float) END AS excess_retnet#93996137]
+- FileScan csv [date#93995806,cap#93995807,ret#93995808,resret#93995809,retnet#93995810,turnover#93995811,numcos#93995812,coverage#93995813,benchmark#93995814,excess_ret#93995815,excess_resret#93995816,excess_retnet#93995817] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:strin...
,None), [date#93995980 ASC NULLS FIRST]
(3) InMemoryTableScan
Output [2]: [date#93995980, numcos#93996017]
Arguments: [date#93995980, numcos#93996017]
(4) InMemoryRelation
Arguments: [date#93995980, overall#93995981, ret#93995983, resret#93995985, retnet#93995987, turnover#93995989, numcos#93996017, benchmark#93996071, excess_ret#93996099, excess_resret#93996103, excess_retnet#93996108], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN (date#93995781 = null) THEN null ELSE cast(date#93995781 as date) END AS date#93995980, CASE WHEN ((overall#93995782 = NA) OR (overall#93995782 = null)) THEN null ELSE cast(overall#93995782 as int) END AS overall#93995981, CASE WHEN ((ret#93995783 = NA) OR (ret#93995783 = null)) THEN null ELSE cast(ret#93995783 as float) END AS ret#93995983, CASE WHEN ((resret#93995784 = NA) OR (resret#93995784 = null)) THEN null ELSE cast(resret#93995784 as float) END AS resret#93995985, CASE WHEN ((retnet#93995785 = NA) OR (retnet#93995785 = null)) THEN null ELSE cast(retnet#93995785 as float) END AS retnet#93995987, CASE WHEN ((turnover#93995786 = NA) OR (turnover#93995786 = null)) THEN null ELSE cast(turnover#93995786 as float) END AS turnover#93995989, CASE WHEN ((numcos#93995787 = NA) OR (numcos#93995787 = null)) THEN null ELSE cast(numcos#93995787 as float) END AS numcos#93996017, CASE WHEN ((benchmark#93995788 = NA) OR (benchmark#93995788 = null)) THEN null ELSE cast(benchmark#93995788 as float) END AS benchmark#93996071, CASE WHEN ((excess_ret#93995789 = NA) OR (excess_ret#93995789 = null)) THEN null ELSE cast(excess_ret#93995789 as float) END AS excess_ret#93996099, CASE WHEN ((excess_resret#93995790 = NA) OR (excess_resret#93995790 = null)) THEN null ELSE cast(excess_resret#93995790 as float) END AS excess_resret#93996103, CASE WHEN ((excess_retnet#93995791 = NA) OR (excess_retnet#93995791 = null)) THEN null ELSE cast(excess_retnet#93995791 as float) END AS excess_retnet#93996108]
+- FileScan csv [date#93995781,overall#93995782,ret#93995783,resret#93995784,retnet#93995785,turnover#93995786,numcos#93995787,benchmark#93995788,excess_ret#93995789,excess_resret#93995790,excess_retnet#93995791] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:s...
,None)
(5) Scan csv
Output [11]: [date#93995781, overall#93995782, ret#93995783, resret#93995784, retnet#93995785, turnover#93995786, numcos#93995787, benchmark#93995788, excess_ret#93995789, excess_resret#93995790, excess_retnet#93995791]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_preearnings/longshort_overall.csv]
ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:string,benchmark:string,excess_ret:string,excess_resret:string,excess_retnet:string>
(6) Project [codegen id : 1]
Output [11]: [CASE WHEN (date#93995781 = null) THEN null ELSE cast(date#93995781 as date) END AS date#93995980, CASE WHEN ((overall#93995782 = NA) OR (overall#93995782 = null)) THEN null ELSE cast(overall#93995782 as int) END AS overall#93995981, CASE WHEN ((ret#93995783 = NA) OR (ret#93995783 = null)) THEN null ELSE cast(ret#93995783 as float) END AS ret#93995983, CASE WHEN ((resret#93995784 = NA) OR (resret#93995784 = null)) THEN null ELSE cast(resret#93995784 as float) END AS resret#93995985, CASE WHEN ((retnet#93995785 = NA) OR (retnet#93995785 = null)) THEN null ELSE cast(retnet#93995785 as float) END AS retnet#93995987, CASE WHEN ((turnover#93995786 = NA) OR (turnover#93995786 = null)) THEN null ELSE cast(turnover#93995786 as float) END AS turnover#93995989, CASE WHEN ((numcos#93995787 = NA) OR (numcos#93995787 = null)) THEN null ELSE cast(numcos#93995787 as float) END AS numcos#93996017, CASE WHEN ((benchmark#93995788 = NA) OR (benchmark#93995788 = null)) THEN null ELSE cast(benchmark#93995788 as float) END AS benchmark#93996071, CASE WHEN ((excess_ret#93995789 = NA) OR (excess_ret#93995789 = null)) THEN null ELSE cast(excess_ret#93995789 as float) END AS excess_ret#93996099, CASE WHEN ((excess_resret#93995790 = NA) OR (excess_resret#93995790 = null)) THEN null ELSE cast(excess_resret#93995790 as float) END AS excess_resret#93996103, CASE WHEN ((excess_retnet#93995791 = NA) OR (excess_retnet#93995791 = null)) THEN null ELSE cast(excess_retnet#93995791 as float) END AS excess_retnet#93996108]
Input [11]: [date#93995781, overall#93995782, ret#93995783, resret#93995784, retnet#93995785, turnover#93995786, numcos#93995787, benchmark#93995788, excess_ret#93995789, excess_resret#93995790, excess_retnet#93995791]
(7) InMemoryTableScan
Output [3]: [cap#93995984, date#93995982, numcos#93996098]
Arguments: [cap#93995984, date#93995982, numcos#93996098], [isnotnull(cap#93995984), (cast(cap#93995984 as string) = 0)]
(8) InMemoryRelation
Arguments: [date#93995982, cap#93995984, ret#93995986, resret#93995988, retnet#93996016, turnover#93996044, numcos#93996098, coverage#93996100, benchmark#93996105, excess_ret#93996110, excess_resret#93996113, excess_retnet#93996137], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN (date#93995806 = null) THEN null ELSE cast(date#93995806 as date) END AS date#93995982, CASE WHEN ((cap#93995807 = NA) OR (cap#93995807 = null)) THEN null ELSE cast(cap#93995807 as float) END AS cap#93995984, CASE WHEN ((ret#93995808 = NA) OR (ret#93995808 = null)) THEN null ELSE cast(ret#93995808 as float) END AS ret#93995986, CASE WHEN ((resret#93995809 = NA) OR (resret#93995809 = null)) THEN null ELSE cast(resret#93995809 as float) END AS resret#93995988, CASE WHEN ((retnet#93995810 = NA) OR (retnet#93995810 = null)) THEN null ELSE cast(retnet#93995810 as float) END AS retnet#93996016, CASE WHEN ((turnover#93995811 = NA) OR (turnover#93995811 = null)) THEN null ELSE cast(turnover#93995811 as float) END AS turnover#93996044, CASE WHEN ((numcos#93995812 = NA) OR (numcos#93995812 = null)) THEN null ELSE cast(numcos#93995812 as float) END AS numcos#93996098, CASE WHEN ((coverage#93995813 = NA) OR (coverage#93995813 = null)) THEN null ELSE cast(coverage#93995813 as float) END AS coverage#93996100, CASE WHEN ((benchmark#93995814 = NA) OR (benchmark#93995814 = null)) THEN null ELSE cast(benchmark#93995814 as float) END AS benchmark#93996105, CASE WHEN ((excess_ret#93995815 = NA) OR (excess_ret#93995815 = null)) THEN null ELSE cast(excess_ret#93995815 as float) END AS excess_ret#93996110, CASE WHEN ((excess_resret#93995816 = NA) OR (excess_resret#93995816 = null)) THEN null ELSE cast(excess_resret#93995816 as float) END AS excess_resret#93996113, CASE WHEN ((excess_retnet#93995817 = NA) OR (excess_retnet#93995817 = null)) THEN null ELSE cast(excess_retnet#93995817 as float) END AS excess_retnet#93996137]
+- FileScan csv [date#93995806,cap#93995807,ret#93995808,resret#93995809,retnet#93995810,turnover#93995811,numcos#93995812,coverage#93995813,benchmark#93995814,excess_ret#93995815,excess_resret#93995816,excess_retnet#93995817] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:strin...
,None)
(9) Scan csv
Output [12]: [date#93995806, cap#93995807, ret#93995808, resret#93995809, retnet#93995810, turnover#93995811, numcos#93995812, coverage#93995813, benchmark#93995814, excess_ret#93995815, excess_resret#93995816, excess_retnet#93995817]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_preearnings/longshort_cap.csv]
ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:string,coverage:string,benchmark:string,excess_ret:string,excess_resret:string,excess_retnet:string>
(10) Project [codegen id : 1]
Output [12]: [CASE WHEN (date#93995806 = null) THEN null ELSE cast(date#93995806 as date) END AS date#93995982, CASE WHEN ((cap#93995807 = NA) OR (cap#93995807 = null)) THEN null ELSE cast(cap#93995807 as float) END AS cap#93995984, CASE WHEN ((ret#93995808 = NA) OR (ret#93995808 = null)) THEN null ELSE cast(ret#93995808 as float) END AS ret#93995986, CASE WHEN ((resret#93995809 = NA) OR (resret#93995809 = null)) THEN null ELSE cast(resret#93995809 as float) END AS resret#93995988, CASE WHEN ((retnet#93995810 = NA) OR (retnet#93995810 = null)) THEN null ELSE cast(retnet#93995810 as float) END AS retnet#93996016, CASE WHEN ((turnover#93995811 = NA) OR (turnover#93995811 = null)) THEN null ELSE cast(turnover#93995811 as float) END AS turnover#93996044, CASE WHEN ((numcos#93995812 = NA) OR (numcos#93995812 = null)) THEN null ELSE cast(numcos#93995812 as float) END AS numcos#93996098, CASE WHEN ((coverage#93995813 = NA) OR (coverage#93995813 = null)) THEN null ELSE cast(coverage#93995813 as float) END AS coverage#93996100, CASE WHEN ((benchmark#93995814 = NA) OR (benchmark#93995814 = null)) THEN null ELSE cast(benchmark#93995814 as float) END AS benchmark#93996105, CASE WHEN ((excess_ret#93995815 = NA) OR (excess_ret#93995815 = null)) THEN null ELSE cast(excess_ret#93995815 as float) END AS excess_ret#93996110, CASE WHEN ((excess_resret#93995816 = NA) OR (excess_resret#93995816 = null)) THEN null ELSE cast(excess_resret#93995816 as float) END AS excess_resret#93996113, CASE WHEN ((excess_retnet#93995817 = NA) OR (excess_retnet#93995817 = null)) THEN null ELSE cast(excess_retnet#93995817 as float) END AS excess_retnet#93996137]
Input [12]: [date#93995806, cap#93995807, ret#93995808, resret#93995809, retnet#93995810, turnover#93995811, numcos#93995812, coverage#93995813, benchmark#93995814, excess_ret#93995815, excess_resret#93995816, excess_retnet#93995817]
(11) Filter [codegen id : 1]
Input [3]: [cap#93995984, date#93995982, numcos#93996098]
Condition : (isnotnull(cap#93995984) AND (cast(cap#93995984 as string) = 0))
(12) Project [codegen id : 1]
Output [2]: [date#93995982, numcos#93996098]
Input [3]: [cap#93995984, date#93995982, numcos#93996098]
(13) Union
(14) Exchange
Input [2]: [date#93995980, numcos#93996017]
Arguments: rangepartitioning(date#93995980 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7505019]
(15) Sort [codegen id : 2]
Input [2]: [date#93995980, numcos#93996017]
Arguments: [date#93995980 ASC NULLS FIRST], true, 0
(16) CollectLimit
Input [2]: [date#93995980, numcos#93996017]
Arguments: 1000000