Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 567891011 ... 417
Topics (14578)
Replies Last Post Views
How to introduce reset logic when aggregating/joining streaming dataframe with static dataframe for spark streaming by spark-learner
0
by spark-learner
Unable to run bash script when using spark-submit in cluster mode. by Nasrulla Khan Haris
1
by Nasrulla Khan Haris
[Spark 3.0.0] Job fails with NPE - worked in Spark 2.4.4 by Neelesh Salian
0
by Neelesh Salian
Future timeout by Amit Sharma-2
7
by murat migdisoglu
Spark Job Fails with Unknown Error writing to S3 from AWS EMR by koti reddy
1
by Shriraj Bhardwaj
Spark DataFrame Creation by Mark Bidewell
2
by Andrew Melo
How to optimize the configuration and/or code to solve the cache overloading issue? by spark-learner
0
by spark-learner
spark job delay when starting by Bulldog20630405
0
by Bulldog20630405
java.lang.ClassNotFoundException: com.hortonworks.spark.cloud.commit.PathOutputCommitProtoco by murat migdisoglu
4
by Gourav Sengupta
Refreshing static data with streaming data at regular Intervals by Debabrata Ghosh
0
by Debabrata Ghosh
Using pyspark with Spark 2.4.3 a MultiLayerPerceptron model givens inconsistent outputs if a large amount of data is fed into it and at least one of the model outputs is fed to a Python UDF. by Ben Smith
3
by Ben Smith
Needed some best practices to integrate Spark with HBase by Debabrata Ghosh
1
by YogeshGovi
Insert overwrite using select within same table by Utkarsh Jain
1
by Umesh Bansal
Garbage collection issue by Amit Sharma-2
3
by Russell Spitzer
Insert overwrite using select with in same table by Utkarsh Jain
0
by Utkarsh Jain
persistent tables in DataSource api V2 by fansparker
6
by fansparker
Spark Streaming - Set Parallelism and Optimize driver by forece85
4
by Russell Spitzer
Spark UI by venkatadevarapu
3
by Artemis User
How to monitor the throughput and latency of the combineByKey transformation in Spark 3? by felipe.o.gutierrez
0
by felipe.o.gutierrez
Does Spark support column scan pruning for array of structs? by Haijia Zhou
0
by Haijia Zhou
Spark Structured Streaming keep on consuming usercache by spark-learner
1
by Piyush Acharya
Spark ETL use case by codingkapoor
0
by codingkapoor
Spark Deployment Strategy by codingkapoor
0
by codingkapoor
Spark 3.0 with Hadoop 2.6 HDFS/Hive by Ashika Umanga
4
by DB Tsai-3
Overwrite Mode not Working Correctly in spark 3.0.0 by anbutech
2
by anbutech
Schedule/Orchestrate spark structured streaming job by anbutech
1
by Piyush Acharya
OOM while processing read/write to S3 using Spark Structured Streaming by Rachana Srivastava
3
by Piyush Acharya
subscribe by Piyush Acharya
0
by Piyush Acharya
Are there some pitfalls in my spark structured streaming code which causes slow response after several hours running? by spark-learner
1
by Jörn Franke
File not found exceptions on S3 while running spark jobs by Nagendra Darla
4
by Hulio andres
Spark 3.0.0 spark.read.json never completes by Sanjeev Mishra
1
by JasonLee
How To Access Hive 2 Through JDBC Using Kerberos by Daniel de Oliveira M...
5
by Daniel de Oliveira M...
Issue in parallelization of CNN model using spark by Mukhtaj Khan
9
by Mukhtaj Khan
“Pyspark.zip does not exist” using Spark in cluster mode with Yarn by Davide Curcio
1
by Hulio andres
Using spark.jars conf to override jars present in spark default classpath by nupurshukla
4
by Russell Spitzer
1 ... 567891011 ... 417