xuchuanyin created CARBONDATA-2204:
--------------------------------------
Summary: Access tablestatus file too many times during query
Key: CARBONDATA-2204
URL:
https://issues.apache.org/jira/browse/CARBONDATA-2204 Project: CarbonData
Issue Type: Improvement
Components: data-query
Affects Versions: 1.3.0
Reporter: xuchuanyin
Assignee: xuchuanyin
# Problems
Currently in carbondata, a single query will access tablestatus file 7 times, which will definitely slow down the query performance especially when this file is in remote cluster since reading this file is purely client side operation.
# Steps to reproduce
1. Add logger in `AtomicFileOperationsImpl.openForRead` and printout the file name to read.
2. Run a query on carbondata table. Here I ran `TestLoadDataGeneral.test("test data loading CSV file without extension name")`.
3. Observe the output log and search the keyword 'tablestatus'.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)