Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 365
Topics (12766)
Replies Last Post Views
Spark DataSets and multiple write(.) calls by Rico B.
4
by Rico B.
streaming pdf by Nicolas Paris-2
4
by Jörn Franke
[Spark SQL] [Spark 2.4.0] v1 -> struct(v1.e) fails by François Sarradin
1
by kathleen li
PySpark Streaming and Secured Kafka. by Ramaswamy, Muthurama...
0
by Ramaswamy, Muthurama...
[Spark Structued Streaming]: Read kafka offset from a timestamp by puneetloya
1
by Jungtaek Lim
exhaustive list of configuration options by Shiyuan
0
by Shiyuan
Regression of external shuffle service spark 2.3 vs spark 2.2 by igor.berman
0
by igor.berman
Pre build for apache 2.4 broken by b-moisson
0
by b-moisson
CVE-2018-17190: Unsecured Apache Spark standalone executes user code by Sean Owen
0
by Sean Owen
Delta Logic in Spark by Mahender Sarangam
0
by Mahender Sarangam
Equivalent of emptyDataFrame in StructuredStreaming by arunodhaya80
2
by arunodhaya80
[Spark Shell on AWS K8s Cluster]: Is there more documentation regarding how to run spark-shell on k8s cluster? by Zhang, Yuqi
10
by Zhang, Yuqi
java.lang.ClassCastException: org.apache.spark.sql.catalyst.expressions.GenericRowWithSchema cannot be cast to Case class by Daniel Zhang
2
by Rico B.
programmatically set hadoop_conf_dir for spark by 崔苗(数据与人工智能产品开发部)
0
by 崔苗(数据与人工智能产品开发部)
spark in jupyter cannot find a class in a jar by Lian Jiang
0
by Lian Jiang
writing to local files on a worker by lordjoe
4
by lordjoe
How to address seemingly low core utilization on a spark workload? by Vitaliy Pisarev
11
by Thakrar, Jayesh
Testing Apache Spark applications by Omer.Ozsakarya
3
by Lars Albertsson
Using cosinSimilarity method for getting pairwise documents similarity by Soheil Pourbafrani
0
by Soheil Pourbafrani
Using columnSimilarity with threshold result in greater than one by Soheil Pourbafrani
0
by Soheil Pourbafrani
Re: How to address seemingly low core utilization on a spark workload? by Vitaliy Pisarev
0
by Vitaliy Pisarev
Measure Serialization / De-serialization Time by Jack Kolokasis
0
by Jack Kolokasis
[Spark SQL] [Spark 2.4.0] Performance regression when reading parquet files from S3 by Yann Moisan
0
by Yann Moisan
[SPARK-SQL] Writing partitioned parquet requires huge amounts of memory by Lienhart, Pierre (DI...
0
by Lienhart, Pierre (DI...
Read Avro Data using Spark Streaming by Divya Narayan
2
by smikesh
[Spark SQL] Does Spark group small files by Yann Moisan
2
by Lienhart, Pierre (DI...
[ANNOUNCE] Apache Toree 0.3.0-incubating Released by Luciano Resende
0
by Luciano Resende
[ANNOUNCE] Apache Bahir 2.2.2 Released by Luciano Resende
0
by Luciano Resende
[ANNOUNCE] Apache Bahir 2.1.3 Released by Luciano Resende
0
by Luciano Resende
inferred schemas for spark streaming from a Kafka source by Colin Williams-2
0
by Colin Williams-2
Failed to convert java.sql.Date to String by luby
0
by luby
Bucketing by Sai Kiran Kodukula
0
by Sai Kiran Kodukula
question about barrier execution mode in Spark 2.4.0 by Joe-2
0
by Joe-2
Questions on Python support with Spark by Arijit Tarafdar
1
by Patrick McCarthy-2
FW: Spark2 and Hive metastore by Ирина Шершукова
1
by Sergey B.
1234 ... 365