Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234567 ... 346
Topics (12100)
Replies Last Post Views
AccumulatorV2 vs AccumulableParam (V1) by Sergey Zhemzhitsky
2
by Sergey Zhemzhitsky
SparkContext taking time after adding jars and asking yarn for resources by neeravsalaria
0
by neeravsalaria
question on collect_list or say aggregations in general in structured streaming 2.3.0 by kant kodali
3
by kant kodali
native-lzo library not available by Fawze Abujaber
3
by ayan guha
Re: Read or save specific blocks of a file by Zois Theodoros
1
by ayan guha
[Structured streaming, V2] commit on ContinuousReader by xjrk
0
by xjrk
org.apache.spark.shuffle.FetchFailedException: Too large frame: by Pralabh Kumar
1
by Pralabh Kumar
MappingException - org.apache.spark.mllib.classification.LogisticRegressionModel.load by Mina Aslani
0
by Mina Aslani
Uncaught exception in thread heartbeat-receiver-event-loop-thread by Shiyuan
2
by ccherng
ConcurrentModificationException by ccherng
0
by ccherng
Problem in persisting file in S3 using Spark: xxx file does not exist Exception by Marco Mistroni
2
by Marco Mistroni
Running apps over a VPN by Christopher Piggott
0
by Christopher Piggott
(no subject) by Filippo Balicchia
0
by Filippo Balicchia
how to trace sparkDriver context creation for pyspark by Mihai Iacob
0
by Mihai Iacob
ML Linear and Logistic Regression - Poor Performance by Zois Theodoros
3
by Irving Duran
spark.executor.extraJavaOptions inside application code by Agostino Calamita
1
by Vadim Semenov-2
smarter way to "forget" DataFrame definition and stick to its values by Valery Khamenya
1
by JayeshLalwani
Dataset Caching and Unpersisting by Daniele Foroni
0
by Daniele Foroni
what is the query language used for graphX? by kant kodali
0
by kant kodali
[Spark scheduling] Spark schedules single task although rdd has 48 partitions? by Paul Borgmans
0
by Paul Borgmans
[Spark Streaming]: Does DStream workload run over Spark SQL engine? by Khaled Zaouk
1
by Saisai Shao
Poor performance reading Hive table made of sequence files by Patrick McCarthy
0
by Patrick McCarthy
keep getting empty table while using saveAsTable() to save DataFrame as table by nicholasl
0
by nicholasl
Filter one dataset based on values from another by lsn24
2
by lsn24
Fast Unit Tests by marcos rebelo
2
by Yeikel Santana
all calculations finished, but "VCores Used" value remains at its max by Valery Khamenya
1
by Felix Cheung
[Spark 2.x Core] .collect() size limit by klrmowse
11
by klrmowse
Dataframe vs dataset by MidwestMike
4
by MidwestMike
spark.python.worker.reuse not working as expected by 880f0464
0
by 880f0464
UnresolvedException: Invalid call to dataType on unresolved object by 880f0464
0
by 880f0464
Spark launcher listener not getting invoked k8s Spark 2.3 by purna m
1
by Marcelo Vanzin
NullPointerException when scanning HBase table by Huiliang Zhang
0
by Huiliang Zhang
re: spark streaming / AnalysisException on collect() by Peter Liu
0
by Peter Liu
[Spark on Google Kubernetes Engine] Properties File Error by Eric Wang
4
by Yinan Li
Best practices to keep multiple version of schema in Spark by unk1102
0
by unk1102
1234567 ... 346