Apache CarbonData Dev Mailing List archive

Re: Low Performance of full scan.

Posted by xm_zzc on
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Low-Performance-of-full-scan-tp62859p66812.html

Hi Ravindra:
I re-test my test cases mentioned above with Spark 2.3.2 + CarbonData
master branch, the query performance of carbondata are almost the same as
the parquet:

*Test result:**
SQL1: Parquet: 4.6s 4s 3.8s
CarbonData: 4.7s 3.6s 3.5s
SQL2: Parquet: 9s 8s 8s
CarbonData: 9s 8s 8s

The query performance of CarbonData has improved a lot (SQL1: 12s to 4s,
SQL2: 18 to 8s) while the query performance of parquet has also improved
(SQL2: 10s to 8s). That's great.
But I saw the test result you mentioned in
'http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/CarbonData-Performance-Optimization-td62950.html',
the query performance of carbondata were almost better than the parquet. I
want to know how you tested those cases? And are there other optimizations
that have not been merged yet?

Regards,
Zhichao.

--
Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/