Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
1 ... 317318319320321322323 ... 350
Topics (12236)
Replies Last Post Views
(no subject) by Sai Prasanna
0
by Sai Prasanna
Spark Example Project, runnable on EMR, open sourced by Alex Dean
2
by Alex Dean
join with inputs co-partitioned? by Mina
0
by Mina
confused by reduceByKey usage by 诺铁
6
by 诺铁
Errors occurred while compiling module 'spark-streaming-zeromq' (IntelliJ IDEA 13.0.2) by zgalic
8
by imatespl
distinct on huge dataset by Kane
18
by Mayur Rustagi
Read/Write data to hive with Spark for ETL by egustavson
0
by egustavson
Continuously running non-streaming jobs by Jim Carroll
3
by Daniel Darabos
Spark program thows OutOfMemoryError by Qin Wei
3
by YouPeng Yang
strange StreamCorruptedException by Lukas Nalezenec
0
by Lukas Nalezenec
Shark: ClassNotFoundException org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat by ge ko
3
by Gerd Koenig
Shark: class java.io.IOException: Cannot run program "/bin/java" by ge ko
2
by Gerd Koenig
what is the difference between element and partition? by Mina
1
by wxhsdp
groupByKey(None) returns partitions according to the keys? by Mina
1
by wxhsdp
using saveAsNewAPIHadoopFile with OrcOutputFormat by Brock Bose
2
by MLnick
Create cache fails on first time by Arpit Tak-2
1
by Andre Bois-Crettez
graph.reverse & Pregel API by Bogdan Ghidireac
2
by Bogdan Ghidireac
sbt assembly error by Yiou Li
5
by Azuryy Yu
choose the number of partition according to the number of nodes by Mina
2
by Mina
GC overhead limit exceeded by Sai Prasanna
11
by Nick Chammas
Error when run Spark on mesos by felix
8
by scox
Spark packaging by Pradeep baji
3
by Arpit Tak
Pyspark job not starting by lobdellb
2
by gbrouwer5151
Error reading HDFS file using spark 0.9.0 / hadoop 2.2.0 - incompatible protobuf 2.5 and 2.4.1 by Prasad
22
by Arpit Tak-2
Why these operations are slower than the equivalent on Hadoop? by Yanzhe Chen
10
by Eugen Cepoi
Proper caching method by Mina
5
by Arpit Tak
what is a partition? how it works? by Mina
0
by Mina
Spark resilience by Ian Ferreira
7
by Aaron Davidson
Could I improve Spark performance partitioning elements in a RDD? by Mina
0
by Mina
partitioning of small data sets by Diana Carroll
4
by YouPeng Yang
java.net.SocketException: Network is unreachable while connecting to HBase by amit
1
by amit
groupByKey returns a single partition in a RDD? by Mina
1
by wxhsdp
storage.MemoryStore estimated size 7 times larger than real by wxhsdp
6
by wxhsdp
Multi-tenant? by Ian Ferreira
2
by Ian Ferreira
Can't run a simple spark application with 0.9.1 by Paul Schooss
1
by Paul Schooss
1 ... 317318319320321322323 ... 350