Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 393
Topics (13748)
Replies Last Post Views
RE: PySpark Pandas UDF by gal.benshlomo
6
by Gourav Sengupta
Is there a merge API available for writing DataFrame by Sivaprasanna
1
by ayan guha
Explode/Flatten Map type Data Using Pyspark by anbutech
3
by ayan guha
error , saving dataframe , LEGACY_PASS_PARTITION_BY_AS_OPTIONS by asma zgolli
3
by Russell Spitzer
[Structured Streaming] Robust watermarking calculation with future timestamps by Anastasios Zouzias
0
by Anastasios Zouzias
[DISCUSS] Remove sorting of fields in PySpark SQL Row construction by Bryan Cutler
5
by Bryan Cutler
Temporary tables for Spark SQL by Laurent Bastien Corb...
0
by Laurent Bastien Corb...
RE:How to use spark-on-k8s pod template? by sora
0
by sora
Driver OutOfMemoryError in MapOutputTracker$.serializeMapStatuses for 40 TB shuffle. by harelglik
9
by abeboparebop
how to limit tasks num when read hive with orc by lk_spark
0
by lk_spark
Is RDD thread safe? by Chang Chen
0
by Chang Chen
What is directory "/path/_spark_metadata" for? by Mark Zhao
1
by Bin Fan
Using Percentile in Spark SQL by Tzahi File
5
by Jerry Vinokurov
Why Spark generates Java code and not Scala? by Bartosz Konieczny
3
by Marcin Tustin
spark streaming exception by Amit Sharma
2
by Akshay Bhardwaj
Re: Build customized resource manager by Klaus Ma
2
by Klaus Ma
announce: spark-postgres 3 released by Nicolas Paris-2
0
by Nicolas Paris-2
[pyspark 2.3.0] Task was denied committing errors by rishishah.star
2
by rishishah.star
How to use spark-on-k8s pod template? by sora
1
by jdavidmitchell
Can reduced parallelism lead to no shuffle spill? by V0lleyBallJunki3
2
by V0lleyBallJunki3
What's the deal with --proxy-user? by Jeff Evans
0
by Jeff Evans
Working failed to connect to master in Spark Apache by Ashish Mittal
0
by Ashish Mittal
'requirement failed: OneHotEncoderModel expected x categorical values for input column label, but the input column had metadata specifying n values.' by Mina Aslani
1
by Mina Aslani
static dataframe to streaming by aka.fe2s
0
by aka.fe2s
A question about skew join hint by zhangliyun
0
by zhangliyun
Avro file question by Sam-2
2
by ayan guha
Fwd: Delta with intelligent upsett by ayan guha
3
by Burak Yavuz-2
XGBoost Spark One Model Per Worker Integration by grp
0
by grp
Best practices for data like file storage by Patrick McCarthy-2
0
by Patrick McCarthy-2
pyspark - memory leak leading to OOM after submitting 100 jobs? by Paul Wais
4
by Holden Karau
[Spark SQL]: Dataframe group by potential bug (Scala) by ludwiggj
0
by ludwiggj
[Spark Streaming] Apply multiple ML pipelines(Models) to the same stream by spicoflorin
0
by spicoflorin
Need help regarding logging / log4j.properties by Debabrata Ghosh
1
by Roland Johann
Low-level behavior of Exchange by Joe Naegele
0
by Joe Naegele
Fwd: Recover RFormula Column Names by Andrew Redd
3
by Alessandro Solimando
1234 ... 393