Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 303304305306307308309 ... 312
Topics (10896)
Replies Last Post Views
updateStateByKey Question by craigv
0
by craigv
Problems while moving from 0.8.0 to 0.8.1 by Archit Thakur
0
by Archit Thakur
Inaccurate Estimates from LinearRegressionWithSGD by herbps10
1
by sowen
How to create RDD over hashmap? by Manoj Samel
14
by Manoj Samel
Stalling during large iterative PySpark jobs by Jeremy Freeman
4
by Jeremy Freeman
Spark Scheduler by Sai Prasanna
2
by Sai Prasanna
how to set SPARK_WORKER_INSTANCES and SPARK_WORKER_CORES otpimally by Chen Jin
2
by Chen Jin
Division of work between master, worker, executor and driver by Manoj Samel
4
by Mark Hamstra
GroupByKey implementation. by Archit Thakur
0
by Archit Thakur
Moving from java.util.HashMap to org.apache.spark.util.AppendOnlyMap.scala by Archit Thakur
0
by Archit Thakur
ClassNotFoundException with simple Spark job on cluster by zhanif
2
by Archit Thakur
Does foreach operation increase rdd lineage? by guojc
5
by Mark Hamstra
Spark connecting to wrong Filesystem.uri by Mskh
0
by Mskh
subscribe by Shafaq
0
by Shafaq
SparkStreaming not read hadoop configuration from its sparkContext on Stand Alone app? by robin_up
0
by robin_up
Running spark driver inside a servlet by Kapil Malik
2
by Kapil Malik
Suggestion for ec2 script by Mingyu Kim
2
by Mingyu Kim
Submitting job to Yarn's ResourceManager by DB Tsai
1
by Tom Graves
Non-deterministic behavior in spark by od
7
by Ognen Duzlevski-2
Exception when running ALS (MLlib) by samuel281
0
by samuel281
.intersection() method on RDDs? by Andrew Ash
15
by Andrew Ash
JavaKMeans:OutOfMemoryError: Java heap space by buring
0
by buring
Is SparkContext.stop() optional or required? by Mingyu Kim
2
by Mingyu Kim
Giraph Vs SPARK by suman bharadwaj
6
by Matei Zaharia
Too many RDD partititons ??? by Manoj Samel
1
by Jey Kottalam
Problem with newAPIHadoopFile by chadi jaber
1
by chadi jaber
Exception in thread "DAGScheduler" java.lang.OutOfMemoryError: GC overhead limit exceeded by Manoj Samel
3
by Kal El
Advices if your worker die often by Guillaume Pitel
4
by yadid
Time window size in Spark Streaming by Ricky Ho
0
by Ricky Ho
data within batchduration in RDD of a Dstream reliable? by aecc
0
by aecc
Options for connecting with Shark from BI Tools by manish.gforce
0
by manish.gforce
Handling occasional bad data ... by Manoj Samel
3
by Manoj Samel
Running K-Means on a cluster setup by Kal El
11
by Mayur Rustagi
Location and memory allocations for master / worker nodes by Manoj Samel
1
by Prashant Sharma
DStream foreachRdd not working in standalone cluster mode by Sourav Chandra
6
by Sourav Chandra
1 ... 303304305306307308309 ... 312