Passing Hive Context to FPGrowth.

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

Passing Hive Context to FPGrowth.

Sbf xyz
Hi,

I am using Apache Spark 2.2 and mllib library in Python. I have to pass a Hive context to FPGrowth algorithm. For that, I converted a Df to RDD. I am struggling with some pickling errors. After going through stack overflow. It seems we need to convert an RDD to pipelineRDD. Could anyone suggest how that could be done ? 


Thanks.