Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234567 ... 414
Topics (14463)
Replies Last Post Views
Lazy Spark Structured Streaming by Phillip Henry
4
by Jungtaek Lim-2
PySpark documentation main page by Hyukjin Kwon
0
by Hyukjin Kwon
Spark events log behavior in interactive vs batch job by Sriram Ganesh
0
by Sriram Ganesh
[Spark ML] existence of Matrix Factorization ALS algorithm's log version by James Yuan
2
by James Yuan
how to copy from one cassandra cluster to another by Amit Sharma
1
by Russell Spitzer
Is possible to give options when reading semistructured files using SQL Syntax? by Daniel de Oliveira M...
0
by Daniel de Oliveira M...
Load ML Pipeline model with UDF & Custom Transformer on Spark local mode by ihainan
0
by ihainan
Spark Stremaing - Dstreams - Removing RDD by forece85
0
by forece85
Secrets in Spark apps by Dávid Szakállas
0
by Dávid Szakállas
How to map DataSet row to Struct in java? by anuragDada
0
by anuragDada
How to map DataSet row to Struct in java? by anuragDada
0
by anuragDada
Spark memory distribution by dben
0
by dben
test by Suat Toksöz
1
by WranglingData
Apache Spark- Help with email library by sn.noufal
2
by Suat Toksöz
Spark 3 connect to Hive 1.2 by Ashika Umanga
2
by Suat Toksöz
Guidance by Suat Toksöz
0
by Suat Toksöz
http://spark.apache.org/docs/2.3.0/api/python/pyspark.sql.html#module-pyspark.sql.functions, v2.3.0.2.6.5.0-292 by Bredenkamp, Ben B
1
by Bredenkamp, Ben B
Kafka with Spark Streaming work on local but it doesn't work in Standalone mode by Davide Curcio
1
by Gabor Somogyi
spark exception by Amit Sharma
1
by Russell Spitzer
How to introduce reset logic when aggregating/joining streaming dataframe with static dataframe for spark streaming by spark-learner
0
by spark-learner
Unable to run bash script when using spark-submit in cluster mode. by Nasrulla Khan Haris
1
by Nasrulla Khan Haris
[Spark 3.0.0] Job fails with NPE - worked in Spark 2.4.4 by Neelesh Salian
0
by Neelesh Salian
Future timeout by Amit Sharma
7
by murat migdisoglu
Spark Job Fails with Unknown Error writing to S3 from AWS EMR by koti reddy
1
by Shriraj Bhardwaj
Spark DataFrame Creation by Mark Bidewell
2
by Andrew Melo
How to optimize the configuration and/or code to solve the cache overloading issue? by spark-learner
0
by spark-learner
spark job delay when starting by Bulldog20630405
0
by Bulldog20630405
java.lang.ClassNotFoundException: com.hortonworks.spark.cloud.commit.PathOutputCommitProtoco by murat migdisoglu
4
by Gourav Sengupta
Refreshing static data with streaming data at regular Intervals by Debabrata Ghosh
0
by Debabrata Ghosh
Using pyspark with Spark 2.4.3 a MultiLayerPerceptron model givens inconsistent outputs if a large amount of data is fed into it and at least one of the model outputs is fed to a Python UDF. by Ben Smith
3
by Ben Smith
Needed some best practices to integrate Spark with HBase by Debabrata Ghosh
1
by YogeshGovi
Insert overwrite using select within same table by Utkarsh Jain
1
by Umesh Bansal
Garbage collection issue by Amit Sharma
3
by Russell Spitzer
Insert overwrite using select with in same table by Utkarsh Jain
0
by Utkarsh Jain
persistent tables in DataSource api V2 by fansparker
6
by fansparker
1234567 ... 414