Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345678 ... 426
Topics (14898)
Replies Last Post Views
Unsubscribe by hodgesz
0
by hodgesz
Spark SQL Macros by hbutani
0
by hbutani
[Spark SQL] - Not able to consume Kafka topics by Rathore, Yashasvini
2
by Rathore, Yashasvini
Spark SQL Dataset and BigDecimal by Ivan Petrov
3
by Khalid Mammadov
Bursting Your On-Premises Data Lake Analytics and AI Workloads on AWS by Bin Fan-2
0
by Bin Fan-2
how to serve data over JDBC using simplest setup by Scott Ribe
7
by Lalwani, Jayesh
understanding spark shuffle file re-use better by Koert Kuipers
4
by Mandloi87-2
PySpark registerJavaUDAF doesn't accept UDAF Aggregator (Spark 3) by Grégory Dugernier
0
by Grégory Dugernier
KafkaUtils module not found on spark 3 pyspark by aupres
1
by Jungtaek Lim-2
Using Custom Scala Spark ML Estimator in PySpark by Harsh
3
by Harsh
vm.swappiness value for Spark on Kubernetes by Jahar Tyagi
1
by srowen
Using DataFrame to Read Avro files by VenkateshDurai
0
by VenkateshDurai
K8S spark-submit Loses Successful Driver Completion by Marshall Markham
1
by Attila Zsolt Piros
[SPARK-SQL] Does Spark 3.0 support parquet predicate pushdown for array of structs? by Haijia Zhou-2
0
by Haijia Zhou-2
How to handle spark state which is growing too big even with timeout set. by Robin Kuttaiah
1
by Jungtaek Lim-2
Does Spark 3.0 support parquet predicate pushdown for array of structures by Haijia Zhou-2
0
by Haijia Zhou-2
Spark Kubernetes 3.0.1 | podcreationTimeout not working by Ranju Jain
2
by Attila Zsolt Piros
Spark structured streaming with periodical persist and unpersist by act_coder
0
by act_coder
Unsubscribe by Sunil Prabhakara
0
by Sunil Prabhakara
Trigger on GroupStateTimeout with no new data in group by Abhishek Gupta
0
by Abhishek Gupta
Spark as an application server cache by javaguy44
0
by javaguy44
Issue with accessing S3 from EKS spark pod by Rishabh Jain
4
by Rishabh Jain
unsubscribe by Ricardo Sardenberg
0
by Ricardo Sardenberg
Testing ETL with Spark using Pytest by Mich Talebzadeh
6
by Mich Talebzadeh
Announcing Hyperspace v0.4.0 - an indexing subsystem for Apache Spark™ by imback82
0
by imback82
Getting : format(target_id, ".", name), value) .. error by shahabm
0
by shahabm
Converting RelationalGroupedDataSet to DataFrame by Soheil Pourbafrani
1
by Stéphane Verlet-2
Data source v2 streaming sinks does not support Update mode by Eric Beabes
18
by Eric Beabes
Databricks Spark Parallelism and Shuffle Partitions by Erica Lin
1
by Subhash Sriram
Exporting all Executor Metrics in Prometheus format in K8s cluster by Dávid Szakállas
0
by Dávid Szakállas
Spark Event Log Forwarding and Offset Tracking by raymond.tan
2
by raymond.tan
Large Scheduler Delay Causing Performance Issue in Spark Application by Akshat Bordia
0
by Akshat Bordia
Flink 1.11.3从Kafka提取数据到Hive问题求助 by 邮件帮助中心
0
by 邮件帮助中心
[Spark on Kubernetes] Spark Application dependency management Question. by xgong
0
by xgong
Poor performance caused by coalesce to 1 by James Yu
7
by Silvio Fiorito
12345678 ... 426