Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 3456789 ... 398
Topics (13916)
Replies Last Post Views
[Structured Streaming] Robust watermarking calculation with future timestamps by Anastasios Zouzias
0
by Anastasios Zouzias
[DISCUSS] Remove sorting of fields in PySpark SQL Row construction by Bryan Cutler
5
by Bryan Cutler
Temporary tables for Spark SQL by Laurent Bastien Corb...
0
by Laurent Bastien Corb...
RE:How to use spark-on-k8s pod template? by sora
0
by sora
Driver OutOfMemoryError in MapOutputTracker$.serializeMapStatuses for 40 TB shuffle. by harelglik
9
by abeboparebop
how to limit tasks num when read hive with orc by lk_spark
0
by lk_spark
What is directory "/path/_spark_metadata" for? by Mark Zhao
1
by Bin Fan
Using Percentile in Spark SQL by Tzahi File
5
by Jerry Vinokurov
Why Spark generates Java code and not Scala? by Bartosz Konieczny
3
by Marcin Tustin
spark streaming exception by Amit Sharma
2
by Akshay Bhardwaj
Re: Build customized resource manager by Klaus Ma
2
by Klaus Ma
announce: spark-postgres 3 released by Nicolas Paris-2
0
by Nicolas Paris-2
[pyspark 2.3.0] Task was denied committing errors by rishishah.star
2
by rishishah.star
How to use spark-on-k8s pod template? by sora
1
by jdavidmitchell
Can reduced parallelism lead to no shuffle spill? by V0lleyBallJunki3
2
by V0lleyBallJunki3
What's the deal with --proxy-user? by Jeff Evans
0
by Jeff Evans
Working failed to connect to master in Spark Apache by Ashish Mittal
0
by Ashish Mittal
'requirement failed: OneHotEncoderModel expected x categorical values for input column label, but the input column had metadata specifying n values.' by Mina Aslani
1
by Mina Aslani
static dataframe to streaming by aka.fe2s
0
by aka.fe2s
A question about skew join hint by zhangliyun
0
by zhangliyun
Avro file question by Sam-2
2
by ayan guha
Fwd: Delta with intelligent upsett by ayan guha
3
by Burak Yavuz-2
XGBoost Spark One Model Per Worker Integration by grp
0
by grp
Best practices for data like file storage by Patrick McCarthy-2
0
by Patrick McCarthy-2
pyspark - memory leak leading to OOM after submitting 100 jobs? by Paul Wais
4
by Holden Karau
[Spark SQL]: Dataframe group by potential bug (Scala) by ludwiggj
0
by ludwiggj
[Spark Streaming] Apply multiple ML pipelines(Models) to the same stream by spicoflorin
0
by spicoflorin
Need help regarding logging / log4j.properties by mailfordebu
1
by Roland Johann
Low-level behavior of Exchange by Joe Naegele
0
by Joe Naegele
Fwd: Recover RFormula Column Names by Andrew Redd
3
by Alessandro Solimando
Iterative Streaming with Spark by vibhatha
0
by vibhatha
Deleting columns within nested arrays/structs? by Jeff Evans
0
by Jeff Evans
MultiObjectDeleteException by Prudhvi Chennuru (CO...
0
by Prudhvi Chennuru (CO...
Spark Cluster over yarn cluster monitoring by Chetan Khatri
3
by Chetan Khatri
Spark - configuration setting doesn't work by Chetan Khatri
3
by Chetan Khatri
1 ... 3456789 ... 398