Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
123456 ... 409
Topics (14283)
Replies Last Post Views
[spark-structured-streaming] [stateful] by Srinivas V
0
by Srinivas V
Accessing Teradata DW data from Spark by Mich Talebzadeh
1
by Gourav Sengupta
Structured Streaming using File Source - How to handle live files by ArtemisDev
2
by Gourav Sengupta
Spark ml how to extract split points from trained decision tree mode by AaronLee-2
5
by AaronLee-2
Arrow RecordBatches/Pandas Dataframes to (Arrow enabled) Spark Dataframe conversion in streaming fashion by Tanveer Ahmad - EWI
2
by Tanveer Ahmad - EWI
Does Spark SQL support GRANT/REVOKE operations on Tables? by Nasrulla Khan Haris
2
by Bill Glennon
Issue with pyspark query by Tzahi File
0
by Tzahi File
how can i write spark addListener metric to kafka by a s
1
by Tathagata Das
[spark-structured-streaming] [kafka] consume topics from multiple Kafka clusters by Srinivas V
4
by Srinivas V
[PySpark CrossValidator] Dropping column randCol before fitting model by Ablaye F.
0
by Ablaye F.
Out of memory causing due to high number of spark submissions in FIFO mode by sunil_pp
0
by sunil_pp
Using Spark Accumulators with Structured Streaming by Eric Beabes
28
by Eric Beabes
Add python library with native code by Stone Zhong-2
6
by Patrick McCarthy-2
unsubscribe by Arkadiy Ver
2
by Wesley-2
NoClassDefFoundError: scala/Product$class by charles_cai
5
by charles_cai
Spark :- Update record in partition. by Sunil kalra
1
by ayan guha
[pyspark 2.3+] Add scala library to pyspark app and use to derive columns by rishishah.star
0
by rishishah.star
Unsubscribe by Sunil Prabhakara
0
by Sunil Prabhakara
How to set Description in UI SQL tab by gpatcham
1
by VP
[PySpark] Tagging descriptions by rishishah.star
8
by rishishah.star
[Spark RDD] Persisting Spark RDDs across spark contexts/applications - options by Boris Litvak
1
by Bin Fan
WARN ProcfsMetricsGetter: Exception when trying to compute pagesize, as a result reporting of ProcessTree metrics is stopped by YuqingWan
0
by YuqingWan
[PySpark 2.3+] Reading parquet entire path vs a set of file paths by rishishah.star
1
by rishishah.star
Spark stage stuck by Manjunath Shetty H
0
by Manjunath Shetty H
Join on Condition provide at run time by Chetan Khatri
0
by Chetan Khatri
Spark Security by wilbertseoane
6
by srowen
Using existing distribution for join when subset of keys by Patrick Woody
3
by imback82
Apache Spark Machine Learning Unleashed Book Review author: Jillur Quddus by patrice molinchaeux
0
by patrice molinchaeux
[bug] Scala reflection "assertion failed: class Byte" in Dataset.toJSON by Brandon Vincent
0
by Brandon Vincent
Dataframe to nested json document by Chidananda Unchi
3
by neerajbhadani
Unsubscribe by Sunil Prabhakara
0
by Sunil Prabhakara
Spark dataframe hdfs vs s3 by Dark Crusader
11
by Anwar AliKhan
[pyspark 2.3+] Dedupe records by rishishah.star
3
by Anwar AliKhan
Different execution results with wholestage codegen on and off by Pasha Finkelshteyn
2
by Pasha Finkelshteyn
CSV parsing issue by elango vaidyanathan
4
by elango vaidyanathan
123456 ... 409