Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 426
Topics (14894)
Replies Last Post Views
Python level of knowledge for Spark and PySpark by Ashok Kumar-2
3
by Mich Talebzadeh
Overirde Jackson jar - Spark Submit by khajaasmath786
0
by khajaasmath786
Spark2.4 json Jackson errors by khajaasmath786
0
by khajaasmath786
Spark Session error with 30s by khajaasmath786
6
by khajaasmath786
[Spark Core][Advanced]: Problem with data locality when running Spark query with local nature on apache Hadoop by Mohamadreza Rostami
2
by Russell Spitzer
Spark and Bintray's shutdown by Florian CASTELAIN
3
by daunnc
Tasks are skewed to one executor by andraskolbert
14
by Gourav Sengupta
Automated setup of a multi-node cluster for Apache Spark by Dhruv Kumar
1
by Hariharan
Dynamic Allocation Backlog Property in Spark on Kubernetes by Ranju Jain
6
by ranju goel
[Spark SQL]:to calculate distance between four coordinates(Latitude1, Longtitude1, Latitude2, Longtitude2) in the pysaprk dataframe by Rao Bandaru
6
by ayan guha
Spark Hbase Hive error in EMR by khajaasmath786
0
by khajaasmath786
GPU job in Spark 3 by Martin Somers
4
by srowen
possible bug by Weiand, Markus, NMA-...
12
by Mich Talebzadeh
How to use spark steaming data to plot live line chart by Muhammed Favas
3
by Mich Talebzadeh
Why is Spark 3.0.x faster than Spark 3.1.x by maziyar
13
by Mich Talebzadeh
Evaluating Apache Spark with Data Orchestration using TPC-DS by Bin Fan-2
0
by Bin Fan-2
Big Broadcast Hash Join with Dynamic Partition Pruning gives wrong results by Tomas Bartalos
0
by Tomas Bartalos
How to adapt PySpark to optimize handling of large no. of partitions? by iashiq5
0
by iashiq5
Apache ML Agorithm Solution by SRITHALAM, ANUPAMA (...
3
by Mich Talebzadeh
jar incompatibility with Spark 3.1.1 for structured streaming with kafka by Mich Talebzadeh
18
by Mich Talebzadeh
Spark performance over S3 by Tzahi File
6
by Boris Litvak
Invite Spark community as Pulsar Summit NA 2021 Community Partner by Dianjin Wang
1
by Dianjin Wang
Mesos + Spark users going forward? by Sean Owen
2
by dmcwhorter
Data Lakes using Spark by Boris Litvak
0
by Boris Litvak
Tuning spark job to make count faster. by Krishna Chakka
1
by srowen
unsubscribe by Latha Appanna
0
by Latha Appanna
Spark Structured Streaming with PySpark throwing error in execution by Mich Talebzadeh
3
by Mich Talebzadeh
Ordering pushdown for Spark Datasources by Kohki Nishio
3
by Mich Talebzadeh
[SPARK SQL] Sometimes spark does not scale down on k8s by dmn42
1
by dmn42
Spark doesn't add _SUCCESS file when 'partitionBy' is used by Eric Beabes
0
by Eric Beabes
Spark structured streaming + offset management in kafka + kafka headers by AliGouta
8
by Gabor Somogyi
Writing to Google Cloud Storage with v2 algorithm safe? by Jacek Laskowski
4
by Jacek Laskowski
Source.getBatch and schema vs qe.analyzed.schema? by Jacek Laskowski
2
by Jacek Laskowski
PySpark functions for various sources and sinks by Mich Talebzadeh
0
by Mich Talebzadeh
FW: Email to Spark Org please by Williams, David (Ris...
7
by srowen
1234 ... 426