Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 429
Topics (14999)
Replies Last Post Views
Missing stack function from SQL functions API by Dávid Szakállas
0
by Dávid Szakállas
sparkml random forest classifier not learning (at all) compared to H2O implementation (on same data)? by Reed Villanueva
1
by Reed Villanueva
class KafkaCluster related errors by kiranbiswal
6
by Mich Talebzadeh
Need help to create database and integration woth Spark App in local machine by Himanshu Soni
0
by Himanshu Soni
Re: Spark-sql can replace Hive ? by ayan guha
0
by ayan guha
Apply window function on data consumed from Kafka topic by Muhammed Favas
0
by Muhammed Favas
NoSuchMethodError: org.apache.spark.network.util.AbstractFileRegion.transferred by XiaoboGu
5
by mirkel
Distributing a FlatMap across a Spark Cluster by Tom Barber
30
by Tom Barber-2
Spark Standalone Authentication and Encryption by N, Bharath
0
by N, Bharath
Problem in Restoring ML Pipeline with UDF by Artemis User
1
by srowen
addPyFile error: NotADirectoryError: [Errno 20] Not a directory by Gourav Sengupta-2
0
by Gourav Sengupta-2
Petastorm vs horovod vs tensorflowonspark vs spark_tensorflow_distributor by Gourav Sengupta-2
2
by Gourav Sengupta-2
Max of multiple columns of a row in spark by kushagra deep-2
1
by kushagra deep-2
RepartitionByCassandraReplica API Support on K8s by ranju goel
0
by ranju goel
[Spark SQL][Intermediate][How to] Custom transformation to datasource V2 write apis by Sivabalan
0
by Sivabalan
Kube estimate for Spark by Subash Prabanantham
2
by femibyte
Questions about `CreateViewCommand` by Zhun Wang
0
by Zhun Wang
Reading Large File in Pyspark by Sukanya Sarma
2
by Gourav Sengupta-2
Missing module spark-hadoop-cloud in Maven central by Erik Torres
3
by Steve Loughran-2
[apache spark] Does Spark 2.4.8 have issues with ServletContextHandler by Kanchan Kauthale
3
by Kanchan Kauthale
S3 Access Issues - Spark by khajaasmath786
1
by Badrinath Patchikoll...
[ANNOUNCE] Apache Spark 3.1.2 released by Dongjoon Hyun-2
2
by Xiao Li-2
Spark Structured Streaming by sheelstera
2
by sheelstera
Reading parquet files in parallel on the cluster by Eric Beabes
8
by Boris Litvak
spark sql StackOverflowError by Deemo
1
by Mich Talebzadeh
Load Share point list(file) data to impala table using pyspark by Rao Bandaru
1
by Rao Bandaru
Profiling options for PandasUDF (2.4.7 on yarn) by Patrick McCarthy-2
0
by Patrick McCarthy-2
mqtt module by jianxu
0
by jianxu
can not find module of mqtt under pyspark.streaming by jianxu
0
by jianxu
Calculate average from Spark stream by Giuseppe Ricci
13
by Mich Talebzadeh
Accumulators and other important metrics for your job by Hamish Whittal
0
by Hamish Whittal
NullPointerException in SparkSession while reading Parquet files on S3 by Eric Beabes
1
by YEONWOO BAEK
Spark query performance of cached data affected by RDD lineage by fwy
3
by fwy
Re: About Spark executs sqlscript by Mich Talebzadeh
0
by Mich Talebzadeh
Spark Prometheus Metrics for Executors Not Working by paulp-2
1
by Luca Canali
1234 ... 429