Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
123456 ... 389
Topics (13588)
Replies Last Post Views
Can pyspark use --archives to upload self-defined module than --py-files? by zenglong chen
0
by zenglong chen
concat function nesting function, column printing failed by 李斌松
1
by 李斌松
[Pyspark 2.4] Large number of row groups in parquet files created using spark by rishishah.star
0
by rishishah.star
[Spark SQL] dependencies to use test helpers by james.pirz
0
by james.pirz
How to get Peak CPU Utilization Rate in Spark by Pralabh Kumar
0
by Pralabh Kumar
Spark 2.3 Dataframe Grouby operation throws IllegalArgumentException on Large dataset by Balakumar iyer S
3
by Chris Teoh
Long-Running Spark application doesn't clean old shuffle data correctly by colon3l_landa
4
by colon3l_landa
Apache Spark Log4j logging applicationId by Luxabor
0
by Luxabor
Avro large binary read memory problem by Nicolas Paris-2
2
by Nicolas Paris-2
spark dataset.cache is not thread safe by Amit Sharma
1
by Amit Sharma
(no subject) by Hieu Nguyen
0
by Hieu Nguyen
NoSuchMethodError: org.apache.spark.network.util.AbstractFileRegion.transferred by XiaoboGu
4
by Stephen Boesch
Spark SaveMode by Richard
4
by Mich Talebzadeh
How to get loss per iteration in Spark MultilayerPerceptronClassificationModel? by Shamshad Ansari
0
by Shamshad Ansari
Spark dataset to explode json string by Richard
6
by Richard
Spark ImportError: No module named XXX by zenglong chen
0
by zenglong chen
Unsubscribe by Aslan Bekirov
0
by Aslan Bekirov
Looking for a developer to help us with a small ETL project using Spark and Kubernetes by Information Technolo...
1
by sebastian.piu
Usage of PyArrow in Spark by AbdealiJK
3
by Bryan Cutler
spark standalone mode problem about executor add and removed again and again! by zenglong chen
2
by Riccardo Ferrari
Binding spark workers to a network interface by Supun Kamburugamuve
0
by Supun Kamburugamuve
Re: Release Apache Spark 2.4.4 before 3.0.0 by Dongjoon Hyun-2
7
by Joevu
[Beginner] Run compute on large matrices and return the result in seconds? by Gautham Acharya
8
by Gautham Acharya
event log directory(spark-history) filled by large .inprogress files for spark streaming applications by raman gugnani
2
by Shahid K. I.
CPU:s per task by Magnus Nilsson-2
0
by Magnus Nilsson-2
[PySpark] [SparkR] Is it possible to invoke a PySpark function with a SparkR DataFrame? by Fiske, Danny
1
by Felix Cheung
Sorting tuples with byte key and byte value by Supun Kamburugamuve
2
by Supun Kamburugamuve
spark python script importError problem by zenglong chen
1
by Patrick McCarthy-2
Parse RDD[Seq[String]] to DataFrame with types. by Guillermo Ortiz Fern...
0
by Guillermo Ortiz Fern...
Spark 2.4 scala 2.12 Regular Expressions Approach by anbutech
0
by anbutech
unsubscribe by paras301
1
by Raj Adyanthaya
How to use HDFS >3.1.1 with spark 2.3.3 to output parquet files to S3? by Alexander Czech-2
0
by Alexander Czech-2
Spark CSV Quote only NOT NULL by Anil Kulkarni
5
by Swetha Ramaiah
write csv does not handle \r correctly by Nicolas Paris-2
0
by Nicolas Paris-2
timestamp column orc problem with hive by Nicolas Paris-2
0
by Nicolas Paris-2
123456 ... 389