Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234 ... 331
Topics (11584)
Replies Last Post Views
Does Pyspark Support Graphx? by XiaoboGu
2
by XiaoboGu
can we do self join on streaming dataset in 2.2.0? by kant kodali
0
by kant kodali
Can spark handle this scenario? by Lian Jiang
11
by Lian Jiang
Java Heap Space Error by Vinay Muttineni
0
by Vinay Muttineni
"Too Large DataFrame" shuffle Fetch Failed exception in Spark SQL (SPARK-16753) (SPARK-9862)(SPARK-5928)(TAGs - Spark SQL, Intermediate Level, Debug) by Ashutosh Ranjan
0
by Ashutosh Ranjan
Does the classloader used by spark blocks the I/O calls from UDF's? by kant kodali
0
by kant kodali
[spark-sql] Custom Query Execution listener via conf properties by kurian vs
1
by Marcelo Vanzin
pyspark+spacy throwing pickling exception by Selvam Raman
2
by Selvam Raman
Pyspark UDF/map fucntion throws pickling exception by Selvam Raman
1
by Selvam Raman
Run Multiple Spark jobs. Reduce Execution time. by akshay naidu
4
by akshay naidu
stdout: org.apache.spark.sql.AnalysisException: nondeterministic expressions are only allowed in by kant kodali
0
by kant kodali
[Structured Streaming] Avoiding multiple streaming queries by Priyank Shrivastava
3
by Tathagata Das
Spark structured streaming: periodically refresh static data frame by Appu K
4
by Tathagata Das
[Spark-Core]port opened by the SparkDriver is vulnerable to flooding attacks by sandeep-katta
0
by sandeep-katta
SparkR test script issue: unable to run run-tests.h on spark 2.2 by chandan prakash
3
by chandan prakash
read parallel processing spark-cassandra by cyberjog
0
by cyberjog
not able to read git info from Scala Test Suite by karan alang
0
by karan alang
[Spark GraphX pregel] default value for EdgeDirection not consistent between programming guide and API documentation by Ramon Bejar Torres
0
by Ramon Bejar Torres
Inefficient state management in stream to stream join in 2.3 by Yogesh
0
by Yogesh
Why python cluster mode is not supported in standalone cluster? by Ashwin Sai Shankar
0
by Ashwin Sai Shankar
org.apache.kafka.clients.consumer.OffsetOutOfRangeException by Mina Aslani
1
by dcam
Spark 2.2.1 EMR 5.11.1 Encrypted S3 bucket overwriting parquet file by Stephen Robinson
0
by Stephen Robinson
Retrieve batch metadata via the spark monitoring api by Hendrik Dev
0
by Hendrik Dev
can udaf's return complex types? by kant kodali
1
by matteuan
[Spark-Listener] [How-to] Listen only to specific events by Naved Alam
0
by Naved Alam
Spark on K8s with Romana by Jenna Hoole
1
by Yinan Li
optimize hive query to move a subset of data from one partition table to another table by amit kumar singh
3
by devjyoti patra
Schema - DataTypes.NullType by jgp
4
by jgp
Efficient way to compare the current row with previous row contents by Debabrata Ghosh
4
by geoHeil
Spark sortByKey is not lazy evaluated by sandudi
0
by sandudi
[pyspark] structured streaming deployment & monitoring recommendation by Bram
0
by Bram
Unsubscribe by Archit Thakur
2
by Sandeep Varma
saveAsTable does not respect spark.sql.warehouse.dir by Lian Jiang
2
by Lian Jiang
Spark Dataframe and HIVE by ☼ R Nair (रविशंकर ना...
24
by Patrick Alwell
Spark cannot find tables in Oracle database by Lian Jiang
4
by Lian Jiang
1234 ... 331