Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
123456 ... 382
Topics (13344)
Replies Last Post Views
The following Java MR code works for small dataset but throws(arrayindexoutofBound) error for large dataset by Balakumar iyer S
1
by maasg
Generic Dataset[T] Query by SNEHASISH DUTTA
1
by ramannanda9@gmail.co...
[ANNOUNCE] Announcing Apache Spark 2.4.3 by Xiao Li
0
by Xiao Li
Train ML models on each partition by Qian He
1
by Dillon Dukek
Re: Static partitioning in partitionBy() by Burak Yavuz-2
2
by Gourav Sengupta
pyspark on pycharm error by karan alang
0
by karan alang
IllegalArgumentException: Timestamp format must be yyyy-mm-dd hh:mm:ss[.fffffffff] while using spark-sql-2.4.1v to read data from oracle by Shyam P
0
by Shyam P
HiveTableRelation Assertion Error | Joining Stream with Hive table by manjun8
0
by manjun8
How does org.apache.spark.sql.catalyst.util.MapData support hash lookup? by Shawn Yang
0
by Shawn Yang
Create table from Avro-generated parquet files? by Coolbeth, Matthew
0
by Coolbeth, Matthew
Running Spark 2.4 on K8S by Bigg Ben
1
by Bigg Ben
ThriftServer gc over exceed and memory problem by shicheng31604@gmail....
0
by shicheng31604@gmail....
Dynamic metric names by Sergey Zhemzhitsky
4
by Roberto Coluccio
Spark SQL met "Block broadcast_xxx not found" by Xilang Yan
2
by Jacek Laskowski
Spark structured streaming watermarks on nested attributes by Joe Ammann
4
by Joe Ammann
Performance Decrease in spark by yuvraj singh
1
by Gourav Sengupta
Image Grep by swastik mittal
0
by swastik mittal
Anaconda installation with Pyspark on cloudera managed server by rishishah.star
14
by Gourav Sengupta
Deep Learning with Spark, what is your experience? by Riccardo Ferrari
9
by Gourav Sengupta
K8S spark submit for spark 2.4 by Ben Chukwumobi (CONT...
0
by Ben Chukwumobi (CONT...
K8S Spark submit by Ben Chukwumobi (CONT...
0
by Ben Chukwumobi (CONT...
batch processing in spark by swastik mittal
1
by uncleGen
write files of a specific size by kumar.rajat20del
2
by Alonso
Request for a working example of using Pregel API in GraphX using Spark Scala by Basavaraj
0
by Basavaraj
This MapR-DB Spark Connector with Secondary Indexes by Mich Talebzadeh
1
by Mich Talebzadeh
pySpark - pandas UDF and binaryType by Nicolas Paris-2
4
by Gourav Sengupta
error when running decisiontree in java by Serena S Yuan
0
by Serena S Yuan
Howto force spark to honor parquet partitioning by Tomas Bartalos
1
by Gourav Sengupta
Spark SQL JDBC teradata syntax error by khajaasmath786
1
by Gourav Sengupta
[MLlib][Beginner][Debug]: Logistic Regression model always predicts the same value by Josue Lopes
0
by Josue Lopes
Spark SQL Teradata load is very slow by khajaasmath786
1
by Shyam P
Update / Delete records in Parquet by Chetan Khatri
5
by Chetan Khatri
Getting EOFFileException while reading from sequence file in spark by Prateek Rajput
3
by Prateek Rajput
Spark 2.4.1 on Kubernetes - DNS resolution of driver fails by Olivier Girardot-2
2
by Olivier Girardot-2
What is Spark context cleaner in structured streaming by Akshay Bhardwaj
1
by kanchan tewary
123456 ... 382