Kafka Spark Streaming Python

Previous Topic Next Topic
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
Report Content as Inappropriate

Kafka Spark Streaming Python

This post has NOT been accepted by the mailing list yet.

I am new and making my first steps to use the technology.
The purpose is to build online application in PySpark that accepts binary stream that can be divided on topics by Kafka, processes it by Spark Stream and saves it in DataFrames format to hdfs.

Please if anyone can show me direction.