Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345678 ... 411
Topics (14385)
Replies Last Post Views
apache-spark mongodb dataframe issue by Mannat Singh
2
by Mannat Singh
Data Explosion and repartition before group bys by lsn24
0
by lsn24
Spark 3 pod template for the driver by msumbul
2
by msumbul
Spark 3 pod template for the driver by Michel Sumbul
0
by Michel Sumbul
Getting PySpark Partitions Locations by Tzahi File
4
by srowen
Where are all the jars gone ? by Anwar AliKhan
5
by Anwar AliKhan
Error: Vignette re-building failed. Execution halted by Anwar AliKhan
3
by Anwar AliKhan
Arrow RecordBatches to Spark Dataframe by Tanveer Ahmad - EWI
0
by Tanveer Ahmad - EWI
High Availability for spark streaming application running in kubernetes by shensonj
0
by shensonj
LynxKite is now open-source by Daniel Darabos-2
1
by Daniel Darabos-2
Found jars in /assembly/target/scala-2.12/jars by Anwar AliKhan
0
by Anwar AliKhan
[Spark Structured Streaming] predicate pushdown in custom connector source. by Rahul Kumar
0
by Rahul Kumar
How to disable pushdown predicate in spark 2.x query by Mohit
1
by Xiao Li-2
Documentation on SupportsReportStatistics Outdated? by Micah Kornfield
0
by Micah Kornfield
Using hadoop-cloud_2.12 jars by Rahij Ramsharan
2
by Rahij Ramsharan
Reg - Why Apache Hadoop need to be Installed separately for Running Apache Sparkā€¦? by Praveen Kumar Ramach...
0
by Praveen Kumar Ramach...
we control spark file names before we write them - should we opensource it? by ilaimalka
4
by ilaimalka
Spark Thrift Server in Kubernetes deployment by Subash K
1
by Rao, Abhishek (Nokia...
Unsubscribe by Punna Yenumala
1
by Wesley-2
Hey good looking toPandas () by Anwar AliKhan
6
by Anwar AliKhan
Kafka Zeppelin integration by ilavalasr
1
by Alex Ott
[pyspark 2.3+] read/write huge data with smaller block size (128MB per block) by rishishah.star
2
by rishishah.star
Reading TB of JSON file by Chetan Khatri
11
by Chetan Khatri
Re: [ANNOUNCE] Apache Spark 3.0.0 by Gourav Sengupta
3
by Jungtaek Lim-2
Custom Metrics by bryan.jeffrey@gmail....
0
by bryan.jeffrey@gmail....
GPU Acceleration for spark-3.0.0 by charles_cai
3
by Bobby Evans-2
Unsubscribe by Angel Angel
6
by Jeff Evans
how to know what happen between tasks launch by lk_spark
0
by lk_spark
Check point storage and its redundancy by shensonj
0
by shensonj
Spark dataframe creation through already distributed in-memory data sets by Tanveer Ahmad - EWI
0
by Tanveer Ahmad - EWI
GroupBy issue while running K-Means - Dataframe by Deepak Sharma
0
by Deepak Sharma
Broadcast join data reuse by tcondie
2
by gypsysunny
[2.4.5 Standalone Master]: Idle cores not being allocated by krchia
0
by krchia
[spark-structured-streaming] [stateful] by Srinivas V
0
by Srinivas V
Accessing Teradata DW data from Spark by Mich Talebzadeh
1
by Gourav Sengupta
12345678 ... 411