Login  Register

Re: Low Performance of full scan.

Posted by xm_zzc on Oct 29, 2018; 4:02pm
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Low-Performance-of-full-scan-tp62859p66812.html

Hi Ravindra:
    I re-test my test cases mentioned above with Spark 2.3.2 + CarbonData
master branch, the query performance of carbondata are almost the same as
the parquet:

*Test result:**
  SQL1:    Parquet:      4.6s       4s         3.8s  
           CarbonData:   4.7s       3.6s       3.5s  
  SQL2:    Parquet:      9s     8s      8s    
           CarbonData:   9s     8s      8s

  The query performance of CarbonData has improved a lot (SQL1: 12s to 4s,
SQL2: 18 to 8s) while the query performance of parquet has also improved
(SQL2: 10s to 8s). That's great.
  But I saw the test result you mentioned in
'http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/CarbonData-Performance-Optimization-td62950.html',
the query performance of carbondata were almost better than the parquet. I
want to know how you tested those cases? And are there other optimizations
that have not been merged yet?

Regards,
Zhichao.



--
Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/