Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345678 ... 405
Topics (14173)
Replies Last Post Views
How does spark sql evaluate case statements? by kant kodali
3
by kant kodali
Is there any way to set the location of the history for the spark-shell per session? by Yeikel
3
by ZHANG Wei
Get Size of a column in Bytes for a Pyspark Dataframe by anbutech
1
by Yeikel
Re:RE: Going it alone. by jane thorpe
9
by Uwe@Moosheimer.com
unsubscribe by Jiang, Lan
2
by Rong, Jialei
wot no toggle ? by jane thorpe
6
by Mich Talebzadeh
[Spark SQL] AnalysisException: cannot resolve '`column_name`' given input columns by Joshua Conlin
0
by Joshua Conlin
[Spark Core]: Does an executor only cache the partitions it requires for its computations or always the full RDD? by zwithouta
1
by ZHANG Wei
[Pyspark] - Spark uses all available memory; unrelated to size of dataframe by Daniel Stojanov
2
by jane thorpe
Question about how parquet files are read and processed by Yeikel
1
by Kelvin Qin
Re: Going it alone. by jane thorpe
6
by Kelvin Qin
Going it alone. by jane thorpe
1
by Yeikel
Question on writing batch synchronized incremental graph algorithms by Kaan Sancak
0
by Kaan Sancak
Spark interrupts S3 request backoff by Lian Jiang
2
by Gabor Somogyi
Spark Streaming not working by mailfordebu
8
by maasg
Driver pods stuck in running state indefinitely by Prudhvi Chennuru (CO...
3
by ZHANG Wei
covid 19 Data [DISCUSSION] by jane thorpe
3
by jane thorpe
COVID 19 data by jane thorpe
0
by jane thorpe
Fwd: How to import PySpark into Jupyter by Yasir Elgohary
1
by Akchhaya S
Serialization or internal functions? by Yeikel
4
by Vadim Semenov-3
[Spark MLlib]: Multiple input dataframes and non-linear ML pipeline by Qingsheng Ren
0
by Qingsheng Ren
[Spark MLlib]: Multiple input dataframes and non-linear ML pipeline by Qingsheng Ren
0
by Qingsheng Ren
Can you view thread dumps on spark UI if job finished by Ruijing Li
3
by Zahid Rahman
Spark Streaming on Compact Kafka topic - consumers 1 message per partition per batch by sd.hrishi
2
by sd.hrishi
How to handle Blank values in Array of struct elements in pyspark by anbutech
0
by anbutech
IDE suitable for Spark by Zahid Rahman
7
by Som Lima
Scala version compatibility by Andrew Melo
6
by Koert Kuipers
Spark Union Breaks Caching Behaviour by Yi Huang
0
by Yi Huang
Lifecycle of a map function by Vadim Vararu
0
by Vadim Vararu
Fwd: HDFS file hdfs://127.0.0.1:9000/hdfs/spark/examples/README.txt by jane thorpe
2
by jane thorpe
spark-submit exit status on k8s by Marshall Markham
7
by Marshall Markham
pandas_udf is very slow by Lian Jiang
3
by Gourav Sengupta
Security vulnerabilities due to Jackson Databind by simonhampe
0
by simonhampe
Spark, read from Kafka stream failing AnalysisException by Sumit Agrawal
1
by Tathagata Das
(float(9)/5)*x + 32) when x = 12.8 by jane thorpe
0
by jane thorpe
12345678 ... 405