Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 359360361362363364365 ... 367
Topics (12834)
Replies Last Post Views
Is SparkContext.stop() optional or required? by Mingyu Kim
2
by Mingyu Kim
Giraph Vs SPARK by suman bharadwaj
6
by Matei Zaharia
Too many RDD partititons ??? by Manoj Samel
1
by Jey Kottalam
Problem with newAPIHadoopFile by chadi jaber
1
by chadi jaber
Exception in thread "DAGScheduler" java.lang.OutOfMemoryError: GC overhead limit exceeded by Manoj Samel
3
by Kal El
Advices if your worker die often by Guillaume Pitel
4
by yadid
Time window size in Spark Streaming by Ricky Ho
0
by Ricky Ho
data within batchduration in RDD of a Dstream reliable? by aecc
0
by aecc
Options for connecting with Shark from BI Tools by manish.gforce
0
by manish.gforce
Handling occasional bad data ... by Manoj Samel
3
by Manoj Samel
Running K-Means on a cluster setup by Kal El
11
by Mayur Rustagi
Location and memory allocations for master / worker nodes by Manoj Samel
1
by Prashant Sharma
DStream foreachRdd not working in standalone cluster mode by Sourav Chandra
6
by Sourav Chandra
DStream foreachRdd not working in standalone cluster mode by souravchandra
0
by souravchandra
Windows submit JavaSparkPi through mvn:Initial job has not accepted any resources by buring
0
by buring
Spark does not retry failed tasks initiated by hadoop by Aureliano Buendia
2
by Aureliano Buendia
Using persistent hdfs on spark ec2 instanes by Aureliano Buendia
7
by Aureliano Buendia
KRYO usage details: Need Help by suman bharadwaj
2
by suman bharadwaj
Spark streaming on YARN? by Mike Percy
11
by Mike Percy
Running make-distribution.sh .. compilation errors in streaming/api/java/JavaPairDStream.scala by Manoj Samel
4
by Manoj Samel
Quality of documentation (rant) by od
15
by Aureliano Buendia
How to use cluster for large set of linux files by Manoj Samel
5
by Matei Zaharia
How to perform multi dimensional reduction in spark? by Aureliano Buendia
2
by Evan R. Sparks
why is it so slow to run sbt/sbt assembly in my machine? by dachuan
4
by dachuan
OOM - Help Optimizing Local Job by Brad Ruderman
9
by Tathagata Das
make-distribution.sh error org.apache.hadoop#hadoop-client;2.0.0: not found by Manoj Samel
1
by Manoj Samel
a newbee trying to compile and execute examples from 0.9.0-incubating-SNAPSHOT by Alonso
0
by Alonso
Memory Exception by nazar
0
by nazar
use Pipe to run perl(output "utf-8" chinese textfile),the result is wrong by taozhou2
0
by taozhou2
TorrentBroadcast + persist = bug by losmi83
4
by losmi83
Forcing RDD computation with something else than count() ? by Guillaume Pitel
5
by Guillaume Pitel
Lazy evaluation of RDD data transformation by DB Tsai
3
by Reynold Xin
Re: reading LZO compressed file in spark by Andrew Ash
6
by rajeev
Read multiple HDFS files in parallel? by robin_up
1
by robin_up
spark.default.parallelism by od
4
by Andrew Ash
1 ... 359360361362363364365 ... 367