Hi
Can you please mention the spark version, give us the code for setting up spark session, and the operation you are talking about? It will be good to know the amount of memory that your system has as well and number of executors you are using per system
In general I have faced issues when doing group by or running aggregates over datasets which are more than 2 GB but my system has lower ram.
Regards
Gourav
Hi, I am facing this issue while using toPandas() and Pyarrow simultaneously.