[jira] [Created] (CARBONDATA-4182) Performance issue when multiple load happeneing to same table for same interval with 2 MVs.

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (CARBONDATA-4182) Performance issue when multiple load happeneing to same table for same interval with 2 MVs.

Akash R Nilugal (Jira)
suyash yadav created CARBONDATA-4182:
----------------------------------------

             Summary: Performance issue when multiple load happeneing to same table for same interval with 2 MVs.
                 Key: CARBONDATA-4182
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-4182
             Project: CarbonData
          Issue Type: Improvement
          Components: core
    Affects Versions: 2.1.0
         Environment: Apache carbon 2.1.0
            Reporter: suyash yadav


Hi Team,

We need your help to resolve one of the performance issue that we are facing. Please see below details about the table structure and schema implemented at our end:

1.We have 25 tables and 2 MVS created for these tables for hour and day granularity.
 2.One table can have more than 1 can for same interval and whenver multiple csv are there as an input to these tables then sequential loading will take place.
 3.for different tables data loading is parallal but whenver 2 csvs are there for same table then for that table sequential load will happen by one and then other csv respectivly.
 4.We have observed for 1minute of csv to be loaded into the table it is taking approximalty 345 to 60 minutes which is creating a huge backlog.

We need your help to resolve this performanec issue as there will be no use of 15 minutes of data will take more than 15 minutes to load. Users are not going to wait and will get fed up by this slowness.

Kindly advice.

Regards
Suyash Yadav



--
This message was sent by Atlassian Jira
(v8.3.4#803005)