Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345 ... 411
Topics (14385)
Replies Last Post Views
How to map DataSet row to Struct in java? by anuragDada
0
by anuragDada
Spark memory distribution by dben
0
by dben
test by Suat Toksöz
1
by WranglingData
Apache Spark- Help with email library by sn.noufal
2
by Suat Toksöz
Spark 3 connect to Hive 1.2 by Ashika Umanga
2
by Suat Toksöz
Guidance by Suat Toksöz
0
by Suat Toksöz
http://spark.apache.org/docs/2.3.0/api/python/pyspark.sql.html#module-pyspark.sql.functions, v2.3.0.2.6.5.0-292 by Bredenkamp, Ben B
1
by Bredenkamp, Ben B
Kafka with Spark Streaming work on local but it doesn't work in Standalone mode by Davide Curcio
1
by Gabor Somogyi
spark exception by Amit Sharma
1
by Russell Spitzer
How to introduce reset logic when aggregating/joining streaming dataframe with static dataframe for spark streaming by spark-learner
0
by spark-learner
Unable to run bash script when using spark-submit in cluster mode. by Nasrulla Khan Haris
1
by Nasrulla Khan Haris
[Spark 3.0.0] Job fails with NPE - worked in Spark 2.4.4 by Neelesh Salian
0
by Neelesh Salian
Future timeout by Amit Sharma
7
by murat migdisoglu
Spark Job Fails with Unknown Error writing to S3 from AWS EMR by koti reddy
1
by Shriraj Bhardwaj
Spark DataFrame Creation by Mark Bidewell
2
by Andrew Melo
How to optimize the configuration and/or code to solve the cache overloading issue? by spark-learner
0
by spark-learner
spark job delay when starting by Bulldog20630405
0
by Bulldog20630405
java.lang.ClassNotFoundException: com.hortonworks.spark.cloud.commit.PathOutputCommitProtoco by murat migdisoglu
4
by Gourav Sengupta
Refreshing static data with streaming data at regular Intervals by mailfordebu
0
by mailfordebu
Using pyspark with Spark 2.4.3 a MultiLayerPerceptron model givens inconsistent outputs if a large amount of data is fed into it and at least one of the model outputs is fed to a Python UDF. by Ben Smith
3
by Ben Smith
Needed some best practices to integrate Spark with HBase by mailfordebu
1
by YogeshGovi
Insert overwrite using select within same table by Utkarsh Jain
1
by Umesh Bansal
Garbage collection issue by Amit Sharma
3
by Russell Spitzer
Insert overwrite using select with in same table by Utkarsh Jain
0
by Utkarsh Jain
persistent tables in DataSource api V2 by fansparker
6
by fansparker
Spark Streaming - Set Parallelism and Optimize driver by forece85
4
by Russell Spitzer
Spark UI by venkatadevarapu
3
by ArtemisDev
How to monitor the throughput and latency of the combineByKey transformation in Spark 3? by felipe.o.gutierrez
0
by felipe.o.gutierrez
Does Spark support column scan pruning for array of structs? by Haijia Zhou
0
by Haijia Zhou
Spark Structured Streaming keep on consuming usercache by spark-learner
1
by Piyush Acharya
Spark ETL use case by codingkapoor
0
by codingkapoor
Spark Deployment Strategy by codingkapoor
0
by codingkapoor
Spark 3.0 with Hadoop 2.6 HDFS/Hive by Ashika Umanga
4
by DB Tsai-3
Overwrite Mode not Working Correctly in spark 3.0.0 by anbutech
2
by anbutech
Schedule/Orchestrate spark structured streaming job by anbutech
1
by Piyush Acharya
12345 ... 411