Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1234567 ... 312
Topics (10896)
Replies Last Post Views
Reading ORC file - fine on 1.6; GC timeout on 2+ by Nick Chammas
0
by Nick Chammas
Spark-SQL collect function by hsden85
2
by hsden85
Kerberos impersonation of a Spark Context at runtime by matd
0
by matd
any support to use Spark UDF in HIVE by Manohar753
0
by Manohar753
In-order processing using spark streaming by scorpio
1
by JayeshLalwani
Refreshing a persisted RDD by JayeshLalwani
2
by hsden85
large spark job hang on with many active stages/jobs by yzhang20170501
0
by yzhang20170501
Benchmark of XGBoost, Vowpal Wabbit and Spark ML on Criteo 1TB Dataset by pklemenkov
0
by pklemenkov
Quantile Discretizer on multiple Groups of same Dataframe. by kalyanischakravarthi...
0
by kalyanischakravarthi...
"java.lang.IllegalStateException: There is no space for new record" in GraphFrames by rok
0
by rok
Has anyone used CoreNLP from stanford for sentiment analysis in Spark? It does not work as desired for me. by Gaurav1809
0
by Gaurav1809
Problem in adding a contact in Skype by wtsspencer
0
by wtsspencer
Spark Stanadlone mode PipelineModel.save not storing the trained by parameswarnc
0
by parameswarnc
javaRDD to collectasMap throuwa ava.lang.NegativeArraySizeException by Manohar753
0
by Manohar753
Contact Help Desk Phone Number of Gmail by maccnacc
0
by maccnacc
cluster setup for Spark streaming and scheduled batch jobs and adhoc queries by anna
0
by anna
Spark issue with large data by kumarbharath
0
by kumarbharath
Spark Structured Streaming with Kafka convert json string to dataframe or Json by Shashank734
2
by Shashank734
WrappedArray to row of relational Db by vaibhavrtk
0
by vaibhavrtk
Pyspark reading parquet from HDFS by phonchi
0
by phonchi
Partitions - Distribute By - MapPartitions by Balaji Krishnan
1
by Balaji Krishnan
pyspark-Failed to run first by Congrui Yi
7
by rvalero
Off heap memory settings and Tungsten by geoHeil
0
by geoHeil
Could Dataset not register Chinese table name? by evil
0
by evil
Spark 2.1.0 hanging while writing a table in HDFS in parquet format by gae123
0
by gae123
In an executor, are the Python worker memory and the MemoryOverhead overlapping? by o_rayer
0
by o_rayer
Is there a way to tell if a receiver is a Reliable Receiver? by Justin Pihony
0
by Justin Pihony
Search and Replace issue by nischay21
0
by nischay21
Spark Streaming. Real-time save data and visualize on dashboard by tencas
3
by Balaji Krishnan
How to store 10M records in HDFS to speed up further filtering? by MoTao
0
by MoTao
Shall I use Apache Zeppelin for data analytics & visualization? by Gaurav1809
0
by Gaurav1809
Spark Streaming: java.io.InvalidClassException: scala.concurrent.duration.Duration; local class incompatible: by min
0
by min
Spark API authentication by Sergey
0
by Sergey
How to deploy a Spark client outside of the cdh cluster by min
2
by min
Reading Large SequenceFile into RDD Results in Imbalance Task by phonchi
0
by phonchi
1234567 ... 312