Want to save each spark batch run stats (start, end, ID etc) and watermark ( Last processed timestamp from external data source).
We have tried Hive JDBC, but it is very slow due MR jobs it will trigger. Cant save to normal Hive tables as it will create lots of small files in HDFS.
Please suggest what is the recommended way to do this ? Any pointers will be helpful
Thanks and regards