Pyspark and searching items from data structures

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Pyspark and searching items from data structures

Esa Heikkinen-2

Hi

 

I would want to build pyspark-application, which searches sequential items or events of time series from csv-files.

 

What are the best data structures for this purpose ? Dataframe of pyspark or pandas, or RDD or SQL or something else ?

 

---

Esa