http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/DISCUSSION-About-the-future-of-hive-integration-tp24064.html
Hi, community:
The purpose I create the module "hive integration" is not just making carbondata available to query in hive. I want carbondata to be a common fileformat in hadoop ecosystem like orc、parquet.
In hadoop ecosystem, the common fileformat in data warehouse are orc、parquet. For my company, we have thousands of orc tables.
Can carbondata become a common fileformat? I think yes!
The detailed things we can do in Carbondata 1.3.0 are as below:
3. Improve batch query performance, query without index should be as faster as parquet (In my test, query without index is slower than parquet, It is blocked by initializing tasks)
4. Improve the time to load the index, it is too slow for the first time
If there are omissions, please add.
My imagination, What do you think?