Quantcast

[Spark SQL & Presto] : Spark SQL & Presto's performance about read hive

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate

[Spark SQL & Presto] : Spark SQL & Presto's performance about read hive

JWang
This post has NOT been accepted by the mailing list yet.
Hi,
I did a test on SparkSQL &Presto to read about 30,000 files (1,953,764,027 records) stored in hive, then count the records number as the output.Spark got the worse performance (SparkSQL Duration : 24s ; Presto  Duration : 8s).Would it be sensible?  why ?
Thanks
Loading...