Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 45678910 ... 327
Topics (11438)
Replies Last Post Views
Spark Data Frame. PreSorded partitions by Николай Ижиков
1
by MidwestMike
How to kill a query job when using spark thrift-server? by KevinZwx
0
by KevinZwx
Loading a large parquet file how much memory do I need by Alexander Czech
7
by Gourav Sengupta
Using MatrixFactorizationModel as a feature extractor by Corey Nolet
1
by Corey Nolet
Cosine Similarity between documents - Rows by Donni Khan
1
by Yao
[Spark ML] Compatibility between features and models by Ming Ma
0
by Ming Ma
Spark Streaming Kinesis Missing Records by Richard Moorhead
0
by Richard Moorhead
SparkSQL not support CharType by 163
1
by Jörn Franke
build spark source code by MidwestMike
1
by Jörn Franke
Hive From Spark: Jdbc VS sparkContext by Nicolas Paris
24
by Nicolas Paris
Spark Streaming Kerberos Issue by khajaasmath786
6
by geoHeil
newbie: how to partition data on file system. What are best practices? by Andy Davidson
0
by Andy Davidson
Spark Stremaing Hive Dynamic Partitions Issue by khajaasmath786
0
by khajaasmath786
Spark Writing to parquet directory : java.io.IOException: Disk quota exceeded by Chetan Khatri
2
by Vadim Semenov
unsubscribe by HanPan
0
by HanPan
Caching dataframes and overwrite by MidwestMike
0
by MidwestMike
What do you pay attention to when validating Spark jobs? by Holden Karau
1
by lucas.gary@gmail.com
Process large JSON file without causing OOM by Alec Swan
9
by Alec Swan
Spark/Parquet/Statistics question by djiang
1
by Rabin Banerjee
Parquet Filter pushdown not working and statistics are not generating for any column with Spark 1.6 CDH 5.7 by Rabin Banerjee
0
by Rabin Banerjee
Dynamic data ingestion into SparkSQL - Interesting question by Aakash Basu-2
3
by Aakash Basu-2
Long running Spark Job Status on Remote Submission by Harsh Choudhary
0
by Harsh Choudhary
PySpark 2.2.0, Kafka 0.10 DataFrames by salemi
2
by Shixiong(Ryan) Zhu
Re: How to print plan of Structured Streaming DataFrame by Shixiong(Ryan) Zhu
0
by Shixiong(Ryan) Zhu
Parquet files from spark not readable in Cascading by Vikas Gandham-2
2
by Vikas Gandham-2
Kryo not registered class by Angel Francisco Orta
1
by Vadim Semenov
spark streaming part files in hive partition by khajaasmath786
1
by khajaasmath786
[Spark SQL]: DataFrame schema resulting in NullPointerException by chitralverma
0
by chitralverma
Multiple transformations without recalculating or caching by Fernando Pereira
3
by Phillip Henry
Spark 2.1.2 Spark Streaming checkpoint interval not respected by Shing Hing Man-2
0
by Shing Hing Man-2
Spark based Data Warehouse by ashish rawat
16
by lucas.gary@gmail.com
SpecificColumnarIterator has grown past JVM limit of 0xFFF by Md. Rezaul Karim
0
by Md. Rezaul Karim
History server and non-HDFS filesystems by Paul Mackles
0
by Paul Mackles
Spark Streaming in Wait mode by khajaasmath786
0
by khajaasmath786
Union of streaming dataframes by JayeshLalwani
0
by JayeshLalwani
1 ... 45678910 ... 327