[jira] [Created] (CARBONDATA-2204) Access tablestatus file too many times during query

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (CARBONDATA-2204) Access tablestatus file too many times during query

Akash R Nilugal (Jira)
xuchuanyin created CARBONDATA-2204:
--------------------------------------

             Summary: Access tablestatus file too many times during query
                 Key: CARBONDATA-2204
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2204
             Project: CarbonData
          Issue Type: Improvement
          Components: data-query
    Affects Versions: 1.3.0
            Reporter: xuchuanyin
            Assignee: xuchuanyin


# Problems

Currently in carbondata, a single query will access tablestatus file 7 times, which will definitely slow down the query performance especially when this file is in remote cluster since reading this file is purely client side operation.

 

# Steps to reproduce

1. Add logger in `AtomicFileOperationsImpl.openForRead` and printout the file name to read.

2. Run a query on carbondata table. Here I ran `TestLoadDataGeneral.test("test data loading CSV file without extension name")`.

3. Observe the output log and search the keyword 'tablestatus'.  

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)