Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
123456 ... 414
Topics (14463)
Replies Last Post Views
Is there any possibility to avoid double computation in case of RDD checkpointing by Ivan Petrov
0
by Ivan Petrov
Spark - Scala-Java interoperablity by Ramesh Mathikumar
1
by srowen
Fwd: Time stamp in Kafka by khajaasmath786
0
by khajaasmath786
Appropriate checkpoint interval in a spark streaming application by sheelstera
1
by sheelstera
ThriftServer LDAP doesn't work by ravi6c2
0
by ravi6c2
Where do the executors get my app jar from? by James Yu
5
by James Yu
Kafka spark structure streaming out of memory issue by km.santanu
1
by Srinivas V
help on use case - spark parquet processing by manjay
1
by Amit Sharma
How can I use pyspark to upsert one row without replacing entire table by Siavash Namvar
5
by Siavash Namvar
Spark ShutdownHook through python jobs. by Shriraj Bhardwaj
0
by Shriraj Bhardwaj
[SPARK-STRUCTURED-STREAMING] IllegalStateException: Race while writing batch 4 by Amit Joshi
1
by Jungtaek Lim-2
[Spark SQL]: Rationale for access modifiers and qualifiers in Spark by 김민우
0
by 김민우
Spark Streaming with Kafka and Python by Hamish Whittal
2
by srowen
S3 read/write from PySpark by Daniel Stojanov
4
by Stephen Coy
Support for group aggregate pandas UDF in streaming aggregation for SPARK 3.0 python by Aesha Dhar Roy
0
by Aesha Dhar Roy
Spark Structured streaming 2.4 - Kill and deploy in yarn by khajaasmath786
0
by khajaasmath786
regexp_extract regex for extracting the columns from string by anbutech
2
by Enrico Minack
Spark streaming receivers by Dark Crusader
3
by Russell Spitzer
Streaming AVRO data in console: java.lang.ArrayIndexOutOfBoundsException by dwgw
0
by dwgw
[Spark-Kafka-Streaming] Verifying the approach for multiple queries by Amit Joshi
1
by tianlangstudio
Spark batch job chaining by Amit Sharma
2
by Jun Zhu-2
[SPARK-SQL] How to return GenericInternalRow from spark udf by Amit Joshi
1
by srowen
join doesn't work by nt
0
by nt
Understanding Spark execution plans by Daniel Stojanov
0
by Daniel Stojanov
Multi insert with join in Spark SQL by moqi
0
by moqi
Tab delimited csv import and empty columns by Stephen Coy
5
by Stephen Coy
Comments conventions in Spark distribution official examples by Fuad Efendi
1
by srowen
Async API to save RDDs? by Antonin Delpeuch (li...
0
by Antonin Delpeuch (li...
file importing / hibernate by nt
0
by nt
Renaming a DataFrame column makes Spark lose partitioning information by Antoine Wendlinger
2
by Antoine Wendlinger
Pyspark: Issue using sql in foreachBatch sink by mmuru
2
by mmuru
What is an "analytics engine"? by Boris Gershfield
1
by tianlangstudio
DataSource API v2 & Spark-SQL by Lavelle, Shawn
2
by Lavelle, Shawn
CVE-2020-9480: Apache Spark RCE vulnerability in auth-enabled standalone master by Sean Owen
1
by Sean Owen
[Spark SQL]: Can't write DataFrame after using explode function on multiple columns. by hesouol
5
by hesouol
123456 ... 414