Apache Spark User List

This forum is an archive for the mailing list user@spark.apache.org (more options) Messages posted here will be sent to this mailing list.
12345 ... 315
Topics (11019)
Replies Last Post Views
Nested RDD operation by Daniel O' Shaughness...
4
by jgp
Uses of avg hash probe metric in HashAggregateExec? by Jacek Laskowski
0
by Jacek Laskowski
ConcurrentModificationException using Kafka Direct Stream by Harsh
8
by Harsh
Spark Executor - jaas.conf with useTicketCache=true by Hugo Reinwald
0
by Hugo Reinwald
[Timer-0:WARN] Logging$class: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient resources by jgp
1
by Riccardo Ferrari
Configuration for unit testing and sql.shuffle.partitions by peay
3
by Vadim Semenov
Re: Chaining Spark Streaming Jobs by Sunita
3
by Michael Armbrust
Builder Pattern used by Spark source code architecture by Patrick-2
0
by Patrick-2
Size exceeds Integer.MAX_VALUE issue with RandomForest by rpulluru
2
by rpulluru
[SPARK-SQL] Does spark-sql have Authorization built in? by Arun Khetarpal
3
by Arun Khetarpal
spark 2.1.1 ml.LogisticRegression with large feature set cause Kryo serialization failed: Buffer overflow by haibo wu
0
by haibo wu
Spark 2.1.1 Driver OOM when use interaction for large scale Sparse Vector by haibo wu
0
by haibo wu
PLs assist: trying to FlatMap a DataSet / partially OT by Marco Mistroni
2
by Marco Mistroni
spark.streaming.receiver.maxRate by Margusja
3
by Akhil Das-2
RDD order preservation through transformations by johan.grande.ext
13
by johan.grande.ext
Should I use Dstream or Structured Stream to transfer data from source to sink and then back from sink to source? by kant kodali
2
by kant kodali
[SS] Any way to optimize memory consumption of SS? by KevinZwx
4
by KevinZwx
cannot cast to double from spark row by khajaasmath786
1
by Ram Sriharsha
[SS]How to add a column with custom system time? by KevinZwx
9
by Michael Armbrust
[Structured Streaming] Multiple sources best practice/recommendation by JG Perrin
1
by Michael Armbrust
compile error: No classtag available while calling RDD.zip() by 沈志宏
2
by 沈志宏
Re-sharded kinesis stream starts generating warnings after kinesis shard numbers were doubled by Mikhailau, Alex
0
by Mikhailau, Alex
how sequence of chained jars in spark.(driver/executor).extraClassPath matters by Richard Xin-2
0
by Richard Xin-2
Minimum cost flow problem solving in Spark by Swapnil Shinde
0
by Swapnil Shinde
HiveThriftserver does not seem to respect partitions by Yana
0
by Yana
[Spark Dataframe] How can I write a correct filter so the Hive table partitions are pruned correctly by Patrick Duin
0
by Patrick Duin
spark streaming executor number still increase by zhan8610189
0
by zhan8610189
Multiple Sources found for csv by jeffsaremi
1
by jeffsaremi
Continue reading dataframe from file despite errors by jeffsaremi
3
by jeffsaremi
Queries with streaming sources must be executed with writeStream.start() by kant kodali
7
by kant kodali
How can I Upgrade Spark 1.6 to 2.x in Cloudera QuickStart VM 5.7 by Gaurav1809
0
by Gaurav1809
How do I create a JIRA issue and associate it with a PR that I created for a bug in master? by Mikhailau, Alex
0
by Mikhailau, Alex
How to run "merge into" ACID transaction hive query using hive java api? by hokam chauhan
0
by hokam chauhan
Re: Why do checkpoints work the way they do? by Dmitry Naumenko
1
by Hugo Reinwald
How does spark work? by 陈卓
3
by Jules Damji
12345 ... 315