Project [CASE WHEN (date#93980882 = null) THEN null ELSE cast(date#93980882 as date) END AS date#93981201, CASE WHEN ((overall#93980883 = NA) OR (overall#93980883 = null)) THEN null ELSE cast(overall#93980883 as int) END AS overall#93981202, CASE WHEN ((ret#93980884 = NA) OR (ret#93980884 = null)) THEN null ELSE cast(ret#93980884 as float) END AS ret#93981203, CASE WHEN ((resret#93980885 = NA) OR (resret#93980885 = null)) THEN null ELSE cast(resret#93980885 as float) END AS resret#93981205, CASE WHEN ((retnet#93980886 = NA) OR (retnet#93980886 = null)) THEN null ELSE cast(retnet#93980886 as float) END AS retnet#93981210, CASE WHEN ((turnover#93980887 = NA) OR (turnover#93980887 = null)) THEN null ELSE cast(turnover#93980887 as float) END AS turnover#93981214, CASE WHEN ((numcos#93980888 = NA) OR (numcos#93980888 = null)) THEN null ELSE cast(numcos#93980888 as float) END AS numcos#93981218, CASE WHEN ((benchmark#93980889 = NA) OR (benchmark#93980889 = null)) THEN null ELSE cast(benchmark#93980889 as float) END AS benchmark#93981222, CASE WHEN ((excess_ret#93980890 = NA) OR (excess_ret#93980890 = null)) THEN null ELSE cast(excess_ret#93980890 as float) END AS excess_ret#93981225, CASE WHEN ((excess_resret#93980891 = NA) OR (excess_resret#93980891 = null)) THEN null ELSE cast(excess_resret#93980891 as float) END AS excess_resret#93981230, CASE WHEN ((excess_retnet#93980892 = NA) OR (excess_retnet#93980892 = null)) THEN null ELSE cast(excess_retnet#93980892 as float) END AS excess_retnet#93981234]
Filter (isnotnull(cap#93981207) AND (cast(cap#93981207 as string) = 0))
WholeStageCodegen (1)
InMemoryTableScan [cap#93981207, date#93981206, numcos#93981229], [isnotnull(cap#93981207), (cast(cap#93981207 as string) = 0)]
Project [CASE WHEN (date#93980905 = null) THEN null ELSE cast(date#93980905 as date) END AS date#93981206, CASE WHEN ((cap#93980906 = NA) OR (cap#93980906 = null)) THEN null ELSE cast(cap#93980906 as float) END AS cap#93981207, CASE WHEN ((ret#93980907 = NA) OR (ret#93980907 = null)) THEN null ELSE cast(ret#93980907 as float) END AS ret#93981213, CASE WHEN ((resret#93980908 = NA) OR (resret#93980908 = null)) THEN null ELSE cast(resret#93980908 as float) END AS resret#93981217, CASE WHEN ((retnet#93980909 = NA) OR (retnet#93980909 = null)) THEN null ELSE cast(retnet#93980909 as float) END AS retnet#93981221, CASE WHEN ((turnover#93980910 = NA) OR (turnover#93980910 = null)) THEN null ELSE cast(turnover#93980910 as float) END AS turnover#93981226, CASE WHEN ((numcos#93980911 = NA) OR (numcos#93980911 = null)) THEN null ELSE cast(numcos#93980911 as float) END AS numcos#93981229, CASE WHEN ((coverage#93980912 = NA) OR (coverage#93980912 = null)) THEN null ELSE cast(coverage#93980912 as float) END AS coverage#93981233, CASE WHEN ((benchmark#93980913 = NA) OR (benchmark#93980913 = null)) THEN null ELSE cast(benchmark#93980913 as float) END AS benchmark#93981236, CASE WHEN ((excess_ret#93980914 = NA) OR (excess_ret#93980914 = null)) THEN null ELSE cast(excess_ret#93980914 as float) END AS excess_ret#93981239, CASE WHEN ((excess_resret#93980915 = NA) OR (excess_resret#93980915 = null)) THEN null ELSE cast(excess_resret#93980915 as float) END AS excess_resret#93981242, CASE WHEN ((excess_retnet#93980916 = NA) OR (excess_retnet#93980916 = null)) THEN null ELSE cast(excess_retnet#93980916 as float) END AS excess_retnet#93981256]
== Physical Plan ==
CollectLimit (16)
+- InMemoryTableScan (1)
+- InMemoryRelation (2)
+- * Sort (15)
+- Exchange (14)
+- Union (13)
:- InMemoryTableScan (3)
: +- InMemoryRelation (4)
: +- * Project (6)
: +- Scan csv (5)
+- * Project (12)
+- * Filter (11)
+- InMemoryTableScan (7)
+- InMemoryRelation (8)
+- * Project (10)
+- Scan csv (9)
(1) InMemoryTableScan
Output [2]: [date#93981201, numcos#93981218]
Arguments: [date#93981201, numcos#93981218]
(2) InMemoryRelation
Arguments: [date#93981201, numcos#93981218], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(2) Sort [date#93981201 ASC NULLS FIRST], true, 0
+- Exchange rangepartitioning(date#93981201 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7503866]
+- Union
:- InMemoryTableScan [date#93981201, numcos#93981218]
: +- InMemoryRelation [date#93981201, overall#93981202, ret#93981203, resret#93981205, retnet#93981210, turnover#93981214, numcos#93981218, benchmark#93981222, excess_ret#93981225, excess_resret#93981230, excess_retnet#93981234], StorageLevel(disk, memory, deserialized, 1 replicas)
: +- *(1) Project [CASE WHEN (date#93980882 = null) THEN null ELSE cast(date#93980882 as date) END AS date#93981201, CASE WHEN ((overall#93980883 = NA) OR (overall#93980883 = null)) THEN null ELSE cast(overall#93980883 as int) END AS overall#93981202, CASE WHEN ((ret#93980884 = NA) OR (ret#93980884 = null)) THEN null ELSE cast(ret#93980884 as float) END AS ret#93981203, CASE WHEN ((resret#93980885 = NA) OR (resret#93980885 = null)) THEN null ELSE cast(resret#93980885 as float) END AS resret#93981205, CASE WHEN ((retnet#93980886 = NA) OR (retnet#93980886 = null)) THEN null ELSE cast(retnet#93980886 as float) END AS retnet#93981210, CASE WHEN ((turnover#93980887 = NA) OR (turnover#93980887 = null)) THEN null ELSE cast(turnover#93980887 as float) END AS turnover#93981214, CASE WHEN ((numcos#93980888 = NA) OR (numcos#93980888 = null)) THEN null ELSE cast(numcos#93980888 as float) END AS numcos#93981218, CASE WHEN ((benchmark#93980889 = NA) OR (benchmark#93980889 = null)) THEN null ELSE cast(benchmark#93980889 as float) END AS benchmark#93981222, CASE WHEN ((excess_ret#93980890 = NA) OR (excess_ret#93980890 = null)) THEN null ELSE cast(excess_ret#93980890 as float) END AS excess_ret#93981225, CASE WHEN ((excess_resret#93980891 = NA) OR (excess_resret#93980891 = null)) THEN null ELSE cast(excess_resret#93980891 as float) END AS excess_resret#93981230, CASE WHEN ((excess_retnet#93980892 = NA) OR (excess_retnet#93980892 = null)) THEN null ELSE cast(excess_retnet#93980892 as float) END AS excess_retnet#93981234]
: +- FileScan csv [date#93980882,overall#93980883,ret#93980884,resret#93980885,retnet#93980886,turnover#93980887,numcos#93980888,benchmark#93980889,excess_ret#93980890,excess_resret#93980891,excess_retnet#93980892] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:s...
+- *(1) Project [date#93981206, numcos#93981229]
+- *(1) Filter (isnotnull(cap#93981207) AND (cast(cap#93981207 as string) = 0))
+- InMemoryTableScan [cap#93981207, date#93981206, numcos#93981229], [isnotnull(cap#93981207), (cast(cap#93981207 as string) = 0)]
+- InMemoryRelation [date#93981206, cap#93981207, ret#93981213, resret#93981217, retnet#93981221, turnover#93981226, numcos#93981229, coverage#93981233, benchmark#93981236, excess_ret#93981239, excess_resret#93981242, excess_retnet#93981256], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [CASE WHEN (date#93980905 = null) THEN null ELSE cast(date#93980905 as date) END AS date#93981206, CASE WHEN ((cap#93980906 = NA) OR (cap#93980906 = null)) THEN null ELSE cast(cap#93980906 as float) END AS cap#93981207, CASE WHEN ((ret#93980907 = NA) OR (ret#93980907 = null)) THEN null ELSE cast(ret#93980907 as float) END AS ret#93981213, CASE WHEN ((resret#93980908 = NA) OR (resret#93980908 = null)) THEN null ELSE cast(resret#93980908 as float) END AS resret#93981217, CASE WHEN ((retnet#93980909 = NA) OR (retnet#93980909 = null)) THEN null ELSE cast(retnet#93980909 as float) END AS retnet#93981221, CASE WHEN ((turnover#93980910 = NA) OR (turnover#93980910 = null)) THEN null ELSE cast(turnover#93980910 as float) END AS turnover#93981226, CASE WHEN ((numcos#93980911 = NA) OR (numcos#93980911 = null)) THEN null ELSE cast(numcos#93980911 as float) END AS numcos#93981229, CASE WHEN ((coverage#93980912 = NA) OR (coverage#93980912 = null)) THEN null ELSE cast(coverage#93980912 as float) END AS coverage#93981233, CASE WHEN ((benchmark#93980913 = NA) OR (benchmark#93980913 = null)) THEN null ELSE cast(benchmark#93980913 as float) END AS benchmark#93981236, CASE WHEN ((excess_ret#93980914 = NA) OR (excess_ret#93980914 = null)) THEN null ELSE cast(excess_ret#93980914 as float) END AS excess_ret#93981239, CASE WHEN ((excess_resret#93980915 = NA) OR (excess_resret#93980915 = null)) THEN null ELSE cast(excess_resret#93980915 as float) END AS excess_resret#93981242, CASE WHEN ((excess_retnet#93980916 = NA) OR (excess_retnet#93980916 = null)) THEN null ELSE cast(excess_retnet#93980916 as float) END AS excess_retnet#93981256]
+- FileScan csv [date#93980905,cap#93980906,ret#93980907,resret#93980908,retnet#93980909,turnover#93980910,numcos#93980911,coverage#93980912,benchmark#93980913,excess_ret#93980914,excess_resret#93980915,excess_retnet#93980916] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:strin...
,None), [date#93981201 ASC NULLS FIRST]
(3) InMemoryTableScan
Output [2]: [date#93981201, numcos#93981218]
Arguments: [date#93981201, numcos#93981218]
(4) InMemoryRelation
Arguments: [date#93981201, overall#93981202, ret#93981203, resret#93981205, retnet#93981210, turnover#93981214, numcos#93981218, benchmark#93981222, excess_ret#93981225, excess_resret#93981230, excess_retnet#93981234], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN (date#93980882 = null) THEN null ELSE cast(date#93980882 as date) END AS date#93981201, CASE WHEN ((overall#93980883 = NA) OR (overall#93980883 = null)) THEN null ELSE cast(overall#93980883 as int) END AS overall#93981202, CASE WHEN ((ret#93980884 = NA) OR (ret#93980884 = null)) THEN null ELSE cast(ret#93980884 as float) END AS ret#93981203, CASE WHEN ((resret#93980885 = NA) OR (resret#93980885 = null)) THEN null ELSE cast(resret#93980885 as float) END AS resret#93981205, CASE WHEN ((retnet#93980886 = NA) OR (retnet#93980886 = null)) THEN null ELSE cast(retnet#93980886 as float) END AS retnet#93981210, CASE WHEN ((turnover#93980887 = NA) OR (turnover#93980887 = null)) THEN null ELSE cast(turnover#93980887 as float) END AS turnover#93981214, CASE WHEN ((numcos#93980888 = NA) OR (numcos#93980888 = null)) THEN null ELSE cast(numcos#93980888 as float) END AS numcos#93981218, CASE WHEN ((benchmark#93980889 = NA) OR (benchmark#93980889 = null)) THEN null ELSE cast(benchmark#93980889 as float) END AS benchmark#93981222, CASE WHEN ((excess_ret#93980890 = NA) OR (excess_ret#93980890 = null)) THEN null ELSE cast(excess_ret#93980890 as float) END AS excess_ret#93981225, CASE WHEN ((excess_resret#93980891 = NA) OR (excess_resret#93980891 = null)) THEN null ELSE cast(excess_resret#93980891 as float) END AS excess_resret#93981230, CASE WHEN ((excess_retnet#93980892 = NA) OR (excess_retnet#93980892 = null)) THEN null ELSE cast(excess_retnet#93980892 as float) END AS excess_retnet#93981234]
+- FileScan csv [date#93980882,overall#93980883,ret#93980884,resret#93980885,retnet#93980886,turnover#93980887,numcos#93980888,benchmark#93980889,excess_ret#93980890,excess_resret#93980891,excess_retnet#93980892] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:s...
,None)
(5) Scan csv
Output [11]: [date#93980882, overall#93980883, ret#93980884, resret#93980885, retnet#93980886, turnover#93980887, numcos#93980888, benchmark#93980889, excess_ret#93980890, excess_resret#93980891, excess_retnet#93980892]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_postearnings/longshort_overall.csv]
ReadSchema: struct<date:string,overall:string,ret:string,resret:string,retnet:string,turnover:string,numcos:string,benchmark:string,excess_ret:string,excess_resret:string,excess_retnet:string>
(6) Project [codegen id : 1]
Output [11]: [CASE WHEN (date#93980882 = null) THEN null ELSE cast(date#93980882 as date) END AS date#93981201, CASE WHEN ((overall#93980883 = NA) OR (overall#93980883 = null)) THEN null ELSE cast(overall#93980883 as int) END AS overall#93981202, CASE WHEN ((ret#93980884 = NA) OR (ret#93980884 = null)) THEN null ELSE cast(ret#93980884 as float) END AS ret#93981203, CASE WHEN ((resret#93980885 = NA) OR (resret#93980885 = null)) THEN null ELSE cast(resret#93980885 as float) END AS resret#93981205, CASE WHEN ((retnet#93980886 = NA) OR (retnet#93980886 = null)) THEN null ELSE cast(retnet#93980886 as float) END AS retnet#93981210, CASE WHEN ((turnover#93980887 = NA) OR (turnover#93980887 = null)) THEN null ELSE cast(turnover#93980887 as float) END AS turnover#93981214, CASE WHEN ((numcos#93980888 = NA) OR (numcos#93980888 = null)) THEN null ELSE cast(numcos#93980888 as float) END AS numcos#93981218, CASE WHEN ((benchmark#93980889 = NA) OR (benchmark#93980889 = null)) THEN null ELSE cast(benchmark#93980889 as float) END AS benchmark#93981222, CASE WHEN ((excess_ret#93980890 = NA) OR (excess_ret#93980890 = null)) THEN null ELSE cast(excess_ret#93980890 as float) END AS excess_ret#93981225, CASE WHEN ((excess_resret#93980891 = NA) OR (excess_resret#93980891 = null)) THEN null ELSE cast(excess_resret#93980891 as float) END AS excess_resret#93981230, CASE WHEN ((excess_retnet#93980892 = NA) OR (excess_retnet#93980892 = null)) THEN null ELSE cast(excess_retnet#93980892 as float) END AS excess_retnet#93981234]
Input [11]: [date#93980882, overall#93980883, ret#93980884, resret#93980885, retnet#93980886, turnover#93980887, numcos#93980888, benchmark#93980889, excess_ret#93980890, excess_resret#93980891, excess_retnet#93980892]
(7) InMemoryTableScan
Output [3]: [cap#93981207, date#93981206, numcos#93981229]
Arguments: [cap#93981207, date#93981206, numcos#93981229], [isnotnull(cap#93981207), (cast(cap#93981207 as string) = 0)]
(8) InMemoryRelation
Arguments: [date#93981206, cap#93981207, ret#93981213, resret#93981217, retnet#93981221, turnover#93981226, numcos#93981229, coverage#93981233, benchmark#93981236, excess_ret#93981239, excess_resret#93981242, excess_retnet#93981256], CachedRDDBuilder(org.apache.spark.sql.execution.columnar.DefaultCachedBatchSerializer@208e3fd9,StorageLevel(disk, memory, deserialized, 1 replicas),*(1) Project [CASE WHEN (date#93980905 = null) THEN null ELSE cast(date#93980905 as date) END AS date#93981206, CASE WHEN ((cap#93980906 = NA) OR (cap#93980906 = null)) THEN null ELSE cast(cap#93980906 as float) END AS cap#93981207, CASE WHEN ((ret#93980907 = NA) OR (ret#93980907 = null)) THEN null ELSE cast(ret#93980907 as float) END AS ret#93981213, CASE WHEN ((resret#93980908 = NA) OR (resret#93980908 = null)) THEN null ELSE cast(resret#93980908 as float) END AS resret#93981217, CASE WHEN ((retnet#93980909 = NA) OR (retnet#93980909 = null)) THEN null ELSE cast(retnet#93980909 as float) END AS retnet#93981221, CASE WHEN ((turnover#93980910 = NA) OR (turnover#93980910 = null)) THEN null ELSE cast(turnover#93980910 as float) END AS turnover#93981226, CASE WHEN ((numcos#93980911 = NA) OR (numcos#93980911 = null)) THEN null ELSE cast(numcos#93980911 as float) END AS numcos#93981229, CASE WHEN ((coverage#93980912 = NA) OR (coverage#93980912 = null)) THEN null ELSE cast(coverage#93980912 as float) END AS coverage#93981233, CASE WHEN ((benchmark#93980913 = NA) OR (benchmark#93980913 = null)) THEN null ELSE cast(benchmark#93980913 as float) END AS benchmark#93981236, CASE WHEN ((excess_ret#93980914 = NA) OR (excess_ret#93980914 = null)) THEN null ELSE cast(excess_ret#93980914 as float) END AS excess_ret#93981239, CASE WHEN ((excess_resret#93980915 = NA) OR (excess_resret#93980915 = null)) THEN null ELSE cast(excess_resret#93980915 as float) END AS excess_resret#93981242, CASE WHEN ((excess_retnet#93980916 = NA) OR (excess_retnet#93980916 = null)) THEN null ELSE cast(excess_retnet#93980916 as float) END AS excess_retnet#93981256]
+- FileScan csv [date#93980905,cap#93980906,ret#93980907,resret#93980908,retnet#93980909,turnover#93980910,numcos#93980911,coverage#93980912,benchmark#93980913,excess_ret#93980914,excess_resret#93980915,excess_retnet#93980916] Batched: false, DataFilters: [], Format: CSV, Location: InMemoryFileIndex(1 paths)[file:/srv/plusamp/data/default/ea-market/output/estimize_signal_histor..., PartitionFilters: [], PushedFilters: [], ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:strin...
,None)
(9) Scan csv
Output [12]: [date#93980905, cap#93980906, ret#93980907, resret#93980908, retnet#93980909, turnover#93980910, numcos#93980911, coverage#93980912, benchmark#93980913, excess_ret#93980914, excess_resret#93980915, excess_retnet#93980916]
Batched: false
Location: InMemoryFileIndex [file:/srv/plusamp/data/default/ea-market/output/estimize_signal_history/estimizesignal_postearnings/longshort_cap.csv]
ReadSchema: struct<date:string,cap:string,ret:string,resret:string,retnet:string,turnover:string,numcos:string,coverage:string,benchmark:string,excess_ret:string,excess_resret:string,excess_retnet:string>
(10) Project [codegen id : 1]
Output [12]: [CASE WHEN (date#93980905 = null) THEN null ELSE cast(date#93980905 as date) END AS date#93981206, CASE WHEN ((cap#93980906 = NA) OR (cap#93980906 = null)) THEN null ELSE cast(cap#93980906 as float) END AS cap#93981207, CASE WHEN ((ret#93980907 = NA) OR (ret#93980907 = null)) THEN null ELSE cast(ret#93980907 as float) END AS ret#93981213, CASE WHEN ((resret#93980908 = NA) OR (resret#93980908 = null)) THEN null ELSE cast(resret#93980908 as float) END AS resret#93981217, CASE WHEN ((retnet#93980909 = NA) OR (retnet#93980909 = null)) THEN null ELSE cast(retnet#93980909 as float) END AS retnet#93981221, CASE WHEN ((turnover#93980910 = NA) OR (turnover#93980910 = null)) THEN null ELSE cast(turnover#93980910 as float) END AS turnover#93981226, CASE WHEN ((numcos#93980911 = NA) OR (numcos#93980911 = null)) THEN null ELSE cast(numcos#93980911 as float) END AS numcos#93981229, CASE WHEN ((coverage#93980912 = NA) OR (coverage#93980912 = null)) THEN null ELSE cast(coverage#93980912 as float) END AS coverage#93981233, CASE WHEN ((benchmark#93980913 = NA) OR (benchmark#93980913 = null)) THEN null ELSE cast(benchmark#93980913 as float) END AS benchmark#93981236, CASE WHEN ((excess_ret#93980914 = NA) OR (excess_ret#93980914 = null)) THEN null ELSE cast(excess_ret#93980914 as float) END AS excess_ret#93981239, CASE WHEN ((excess_resret#93980915 = NA) OR (excess_resret#93980915 = null)) THEN null ELSE cast(excess_resret#93980915 as float) END AS excess_resret#93981242, CASE WHEN ((excess_retnet#93980916 = NA) OR (excess_retnet#93980916 = null)) THEN null ELSE cast(excess_retnet#93980916 as float) END AS excess_retnet#93981256]
Input [12]: [date#93980905, cap#93980906, ret#93980907, resret#93980908, retnet#93980909, turnover#93980910, numcos#93980911, coverage#93980912, benchmark#93980913, excess_ret#93980914, excess_resret#93980915, excess_retnet#93980916]
(11) Filter [codegen id : 1]
Input [3]: [cap#93981207, date#93981206, numcos#93981229]
Condition : (isnotnull(cap#93981207) AND (cast(cap#93981207 as string) = 0))
(12) Project [codegen id : 1]
Output [2]: [date#93981206, numcos#93981229]
Input [3]: [cap#93981207, date#93981206, numcos#93981229]
(13) Union
(14) Exchange
Input [2]: [date#93981201, numcos#93981218]
Arguments: rangepartitioning(date#93981201 ASC NULLS FIRST, 200), ENSURE_REQUIREMENTS, [id=#7503866]
(15) Sort [codegen id : 2]
Input [2]: [date#93981201, numcos#93981218]
Arguments: [date#93981201 ASC NULLS FIRST], true, 0
(16) CollectLimit
Input [2]: [date#93981201, numcos#93981218]
Arguments: 1000000