Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 324325326327328329330 ... 357
Topics (12464)
Replies Last Post Views
sbt assembly error by Yiou Li
5
by Azuryy Yu
choose the number of partition according to the number of nodes by Mina
2
by Mina
GC overhead limit exceeded by Sai Prasanna
11
by Nick Chammas
Error when run Spark on mesos by felix
8
by scox
Spark packaging by Pradeep baji
3
by Arpit Tak
Pyspark job not starting by lobdellb
2
by gbrouwer5151
Error reading HDFS file using spark 0.9.0 / hadoop 2.2.0 - incompatible protobuf 2.5 and 2.4.1 by Prasad
22
by Arpit Tak-2
Why these operations are slower than the equivalent on Hadoop? by Yanzhe Chen
10
by Eugen Cepoi
Proper caching method by Mina
5
by Arpit Tak
what is a partition? how it works? by Mina
0
by Mina
Spark resilience by Ian Ferreira
7
by Aaron Davidson
Could I improve Spark performance partitioning elements in a RDD? by Mina
0
by Mina
partitioning of small data sets by Diana Carroll
4
by YouPeng Yang
java.net.SocketException: Network is unreachable while connecting to HBase by amit
1
by amit
groupByKey returns a single partition in a RDD? by Mina
1
by wxhsdp
storage.MemoryStore estimated size 7 times larger than real by wxhsdp
6
by wxhsdp
Multi-tenant? by Ian Ferreira
2
by Ian Ferreira
Can't run a simple spark application with 0.9.1 by Paul Schooss
1
by Paul Schooss
How to stop system info output in spark shell by Wei Da
3
by Nick Chammas
Problem with KryoSerializer by yh18190
0
by yh18190
can't sc.paralellize in Spark 0.7.3 spark-shell by Walrus theCat
4
by Walrus theCat
scheduler question by Mohit Jaggi
0
by Mohit Jaggi
Streaming job having Cassandra query : OutOfMemoryError by sonyjv
0
by sonyjv
Twitter4j 4.0.1 compatibility by codeRunner
0
by codeRunner
standalone vs YARN by ishaaq
2
by Surendranauth Hirama...
blinkdb status by pti
0
by pti
Lost an executor error - Jobs fail by Praveen R-2
5
by Aaron Davidson
process_local vs node_local by Nathan Kronenfeld
3
by Nathan Kronenfeld
Unsubscribe by Chhaya Vishwakarma
0
by Chhaya Vishwakarma
shuffle vs performance by Mina
0
by Mina
Use combineByKey and StatCount by Jaonary Rabarisoa
2
by Cheng Lian
using Kryo with pyspark? by Diana Carroll
1
by Matei Zaharia
RDD.tail() by Philip Ogren
2
by Matei Zaharia
Measure the Total Network I/O, Cpu and Memory Consumed by Spark Job by yxzhao
2
by yxzhao
cannot exec. job: "TaskSchedulerImpl: Initial job has not accepted any resources" by ge ko
1
by Praveen R-2
1 ... 324325326327328329330 ... 357