Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 391
Topics (13652)
Replies Last Post Views
Low cache hit ratio when running Spark on Alluxio by Jerry Yan
1
by Bin Fan
Can I set the Alluxio WriteType in Spark applications? by Mark Zhao
1
by Bin Fan
spark 2.x design docs by Kamal7.Kumar
4
by Yeikel
Incorrect results in left_outer join in DSv2 implementation with filter pushdown - spark 2.3.2 by Shubham Chaurasia
0
by Shubham Chaurasia
(no subject) by geoHeil
0
by geoHeil
unsubscribe by Mario Amatucci
0
by Mario Amatucci
intermittent Kryo serialization failures in Spark by Jerry Vinokurov
4
by Vadim Semenov-3
custom rdd - do I need a hadoop input format? by Marcelo Valle
3
by Marcelo Valle
How to Integrate Spark mllib Streaming Training Models To Spark Structured Streaming by Praful Rana
0
by Praful Rana
How to integrates MLeap to Spark Structured Streaming by Praful Rana
0
by Praful Rana
how can I dynamic parse json in kafka when using Structured Streaming by lk_spark
2
by lk_spark
Conflicting PySpark Storage Level Defaults? by grp
2
by grp
Can anyone suggest what is wrong with my spark job here? by Shyam P
0
by Shyam P
Unable to verify in-transit encryption by G R
0
by G R
Classloading issues when using connectors with Uber jars with improper Shading in single Spark job by Sharma, Praneet
1
by Sharma, Praneet
Monitor Spark Applications by raman gugnani
3
by Alex Landa
[Spark SQL]: Does Union operation followed by drop duplicate follows "keep first" by Abhinesh Hada
4
by Dhaval Patel
Partitioning query by ☼ R Nair (रविशंकर ना...
0
by ☼ R Nair (रविशंकर ना...
Cluster sizing by Riccardo Ferrari
0
by Riccardo Ferrari
Exception when reading multiline JSON file by Kumaresh AK
1
by kevin.r.mellott
Inconsistent dataset behavior between file and in-memory versions by Dean Arnold
0
by Dean Arnold
Spark Kafka Streaming making progress but there is no data to be consumed by Charles vinodh
7
by Charles vinodh
script running in jupyter 6-7x faster than spark submit by Dhrubajyoti Hati
14
by AbdealiJK
Access all of the custom streaming query listeners that were registered to spark session by Natalie Ruiz
1
by Gabor Somogyi
question about pyarrow.Table to pyspark.DataFrame conversion by Artem Kozhevnikov
1
by Bryan Cutler
Deadlock using Barrier Execution by csmith
0
by csmith
Custom encoders and udf's by jelmer
0
by jelmer
[ANNOUNCE] Announcing Apache Spark 2.3.4 by Kazuaki Ishizaki
0
by Kazuaki Ishizaki
Problem upgrading from 2.3.1 to 2.4.3 with gradle by Nathan Kronenfeld-2
0
by Nathan Kronenfeld-2
Re: read image or binary files / spark 2.3 by Peter Liu
1
by Peter Liu
OOM Error by Ankit Khettry
9
by Ankit Khettry
how to refresh the loaded non-streaming dataframe for each steaming batch ? by Shyam P
4
by Shyam P
Question on streaming job wait and re-run by David Zhou
0
by David Zhou
[Spark Streaming Kafka 0-10] - What was the reason for adding "spark-executor-" prefix to group id in executor configurations by Sethupathi T
5
by Sethupathi T
DataSourceV2: pushFilters() is not invoked for each read call - spark 2.3.2 by Shubham Chaurasia
1
by Hyukjin Kwon
1234 ... 391