Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
123456 ... 359
Topics (12555)
Replies Last Post Views
Spark Streaming - Kafka. java.lang.IllegalStateException: This consumer has already been closed. by Guillermo Ortiz Fern...
3
by Cody Koeninger
Default Java Opts Standalone by Evelyn Bayes
1
by Sonal Goyal
Spark Structured Streaming checkpointing with S3 data source by sherif98
0
by sherif98
Spark code to write to MySQL and Hive by ryandam.9
4
by Sonal Goyal
Long term arbitrary stateful processing - best practices by monohusche
0
by monohusche
Is there a plan for official spark-avro/spark-orc read/write library using Data Source V2 by yxchen
0
by yxchen
Plans for Session Windows? by msukmanowsky
5
by Arun Mahadevan
Parallelism: behavioural difference in version 1.2 and 2.1!? by jeevan.ks
2
by jeevan.ks
Which Py4J version goes with Spark 2.3.1? by Aakash Basu-2
1
by Gourav Sengupta
Spark udf from external jar without enabling Hive by Swapnil Chougule
0
by Swapnil Chougule
java.lang.OutOfMemoryError: Java heap space - Spark driver. by Guillermo Ortiz Fern...
0
by Guillermo Ortiz Fern...
Pitfalls of partitioning by host? by Patrick McCarthy-2
11
by Patrick McCarthy-2
RDD Collect Issue by Aakash Basu-2
0
by Aakash Basu-2
Slow Query Plan Generation by Rosbrook, Andrew J
2
by Rosbrook, Andrew J
How do I generate current UTC timestamp in raw spark sql? by kant kodali
1
by Nikita Goyal
How to use 'insert overwrite [local] directory' correctly? by Bang Xiao
3
by Xiao Li
Re: About the question of Spark Structured Streaming window output by maasg
5
by zrc@zjdex.com
How to deal with context dependent computing? by JF Chen
3
by devjyoti patra
Spark Structured Streaming using S3 as data source by sherif98
2
by sherif98
java.io.NotSerializableException: org.apache.spark.sql.TypedColumn by zzcclp
0
by zzcclp
Fw:multiple group by action by 崔苗
1
by rxin
Handling Very Large volume(500TB) data using spark by Great Info
0
by Great Info
Caching small Rdd's take really long time and Spark seems frozen by Guillermo Ortiz
4
by Sonal Goyal
About the question of Spark Structured Streaming window output by zrc@zjdex.com
0
by zrc@zjdex.com
How to merge multiple rows by msbreuer
2
by Patrick McCarthy-2
No space left on device by lordjoe
4
by Gourav Sengupta
: Failed to create file system watcher service: User limit of inotify instances reached or too many open files by Polisetti, Venkata S...
0
by Polisetti, Venkata S...
CBO not predicting cardinality on partition columns for Parquet tables by rajat mishra
0
by rajat mishra
Insert a pyspark dataframe in postgresql by dimitris plakas
0
by dimitris plakas
Structured Streaming on Kubernetes by Krishna Kalyan
5
by puneetloya
Spark with Scala : understanding closures or best way to take udf registrations' code out of main and put in utils by aastha
0
by aastha
Unsubscribe by Happy每一天
0
by Happy每一天
Why repartitionAndSortWithinPartitions slower than MapReducer by 周浥尘
2
by Koert Kuipers
Two different Hive instances running by Fabio Wada
2
by Vaibhav Kulkarni
Refresh broadcast variable when it isn't the value. by Guillermo Ortiz Fern...
0
by Guillermo Ortiz Fern...
123456 ... 359