Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345 ... 414
Topics (14463)
Replies Last Post Views
[ANNOUNCE] Announcing Apache Spark 3.0.1 by 郑瑞峰
2
by Wenchen Fan
Missing / Duplicate Data when Spark retries by Ruijing Li
2
by Ruijing Li
Spark 3.0 using S3 taking long time for some set of TPC DS Queries by Rao, Abhishek (Nokia...
8
by Rao, Abhishek (Nokia...
subscribe user@spark.apache.org by Joan
0
by Joan
arbitrary state handling in python API by Georg Heiler (TU Vie...
0
by Georg Heiler (TU Vie...
Query about Spark by Ankur Das
6
by Ankur Das
Elastic Search sink showing -1 for numOutputRows by jainshasha
3
by jainshasha
Keeping track of how long something has been in a queue by Hamish Whittal
2
by Jungtaek Lim-2
Spark Application REST API, looking for a way to kill specific task or executor by Ivan Petrov
2
by Ivan Petrov
Spark Streaming Checkpointing by andraskolbert
2
by andraskolbert
Iterating all columns in a pyspark dataframe by Devi P.V
1
by srowen
Merging Parquet Files by Tzahi File
3
by Michael Segel
Adding isolation level when reading from DB2 with spark.read by Filipa Sousa
3
by Jörg Strebel
value col is not a member of org.apache.spark.rdd.RDD by dwgw
0
by dwgw
Error while getting RDD partitions for a parquet dataframe in Spark 3 by Albert Butterscotch
0
by Albert Butterscotch
Adding Partioned Field to The File by Tzahi File
0
by Tzahi File
In driver, can I gc myArray after get a rdd by sparkContext.parallelize(myArray,100) by maqy
0
by maqy
[Spark Kafka Structured Streaming] Adding partition and topic to the kafka dynamically by Amit Joshi
4
by Amit Joshi
Connecting to Oracle Autonomous Data warehouse (ADW) from Spark via JDBC by Mich Talebzadeh
13
by kuassi.mensah
Kotlin for Apache Spark 1.0.0-preview released by Maria Khalusova
0
by Maria Khalusova
Some sort of chaos monkey for spark jobs, do we have it? by Ivan Petrov
0
by Ivan Petrov
Export subset of Oracle database by pduflot
0
by pduflot
Unsubscribe by Annabel Melongo
3
by Annabel Melongo
Referencing a scala/java PipelineStage from pyspark - constructor issues with HasInputCol by Aviad Klein
6
by srowen
Stream to Stream joins by Hamish Whittal
0
by Hamish Whittal
Delay starting jobs by Chris Thomas
3
by Ido Friedman
Structured Streaming metric for count of delayed/late data by GOEL Rajat
6
by GOEL Rajat
RDD which was checkpointed is not checkpointed by Ivan Petrov
7
by abeboparebop
Ability to have CountVectorizerModel vocab as empty by purijatin
2
by purijatin
Re: Spark3 on k8S reading encrypted data from HDFS with KMS in HA by Prashant Sharma
1
by msumbul
About how to read spark source code with a good way by 1266
3
by Jack Kolokasis
Out of scope RDDs not getting cleaned up by jainbhavya53
0
by jainbhavya53
How to migrate DataSourceV2 into Spark 3.0.0 by rafaelkyrdan
0
by rafaelkyrdan
Driver Information by Amit Sharma
0
by Amit Sharma
Block fetching fails due to change in local address by Samik Raychaudhuri
0
by Samik Raychaudhuri
12345 ... 414