This post was updated on .
Hi Community,
Currently we have datamaps like,* default datamaps* which are block and blocklet and *coarse grained datamaps* like bloom, and *fine grained datamaps* like lucene which helps in better pruning during query. What if we introduce another kind of datamap which can hold blockletId as index? Initial level, we call it as index which will work as a child table to the main table like we have MV in our current code. Yes, lets introduce the secondary index to carbon table which will be the child table to main table and it can be created on column like we create lucene datamap, where we give index columns to create index. In a similar way, we create secondary index on column, so indexes on these column will be blocklet IDs which will help in better pruning and faster query when we have a filter query on the index column. Currenlty we will take it as index table and then later part we will make it inline to datamap interface. So design document is attached in JIRA, please give your suggestion/inputs. JIRA Link: CARBONDATA-3680 <https://issues.apache.org/jira/browse/CARBONDATA-3680> Thanks & Regards, Indhumathi M |
+1
Regards, Ravindra. On Wed, 5 Feb 2020 at 8:03 PM, Indhumathi M <[hidden email]> wrote: > Hi Community, > > Currently we have datamaps like,* default datamaps* which are block and > blocklet and *coarse grained datamaps* like bloom, and *fine grained > datamaps* like lucene > which helps in better pruning during query. What if we introduce another > kind of datamap which can hold blockletId as index? Initial level, we call > it as index which > will work as a child table to the main table like we have MV in our current > code. > > Yes, lets introduce the secondary index to carbon table which will be the > child table to main table and it can be created on column like we create > lucene datamap, > where we give index columns to create index. In a similar way, we create > secondary index on column, so indexes on these column will be blocklet IDs > which will > help in better pruning and faster query when we have a filter query on the > index column. > > Currenlty we will take it as index table and then later part we will make > it inline to datamap interface. > > So design document is attached in JIRA, please give your suggestion/inputs. > > JIRA Link: CARBONDATA-3680 > <https://issues.apache.org/jira/browse/CARBONDATA-3680> > > Thanks & Regards, > Indhumathi M > Thanks & Regards, Ravi |
+1
On Wed, 5 Feb, 2020, 8:02 pm Ravindra Pesala, <[hidden email]> wrote: > +1 > > Regards, > Ravindra. > > On Wed, 5 Feb 2020 at 8:03 PM, Indhumathi M <[hidden email]> > wrote: > > > Hi Community, > > > > Currently we have datamaps like,* default datamaps* which are block and > > blocklet and *coarse grained datamaps* like bloom, and *fine grained > > datamaps* like lucene > > which helps in better pruning during query. What if we introduce another > > kind of datamap which can hold blockletId as index? Initial level, we > call > > it as index which > > will work as a child table to the main table like we have MV in our > current > > code. > > > > Yes, lets introduce the secondary index to carbon table which will be the > > child table to main table and it can be created on column like we create > > lucene datamap, > > where we give index columns to create index. In a similar way, we create > > secondary index on column, so indexes on these column will be blocklet > IDs > > which will > > help in better pruning and faster query when we have a filter query on > the > > index column. > > > > Currenlty we will take it as index table and then later part we will make > > it inline to datamap interface. > > > > So design document is attached in JIRA, please give your > suggestion/inputs. > > > > JIRA Link: CARBONDATA-3680 > > <https://issues.apache.org/jira/browse/CARBONDATA-3680> > > > > Thanks & Regards, > > Indhumathi M > > > -- > Thanks & Regards, > Ravi > |
+1
-Regards Kumar Vishal On Wed, 5 Feb 2020 at 8:08 PM, Ajantha Bhat <[hidden email]> wrote: > +1 > > On Wed, 5 Feb, 2020, 8:02 pm Ravindra Pesala, <[hidden email]> > wrote: > > > +1 > > > > Regards, > > Ravindra. > > > > On Wed, 5 Feb 2020 at 8:03 PM, Indhumathi M <[hidden email]> > > wrote: > > > > > Hi Community, > > > > > > Currently we have datamaps like,* default datamaps* which are block and > > > blocklet and *coarse grained datamaps* like bloom, and *fine grained > > > datamaps* like lucene > > > which helps in better pruning during query. What if we introduce > another > > > kind of datamap which can hold blockletId as index? Initial level, we > > call > > > it as index which > > > will work as a child table to the main table like we have MV in our > > current > > > code. > > > > > > Yes, lets introduce the secondary index to carbon table which will be > the > > > child table to main table and it can be created on column like we > create > > > lucene datamap, > > > where we give index columns to create index. In a similar way, we > create > > > secondary index on column, so indexes on these column will be blocklet > > IDs > > > which will > > > help in better pruning and faster query when we have a filter query on > > the > > > index column. > > > > > > Currenlty we will take it as index table and then later part we will > make > > > it inline to datamap interface. > > > > > > So design document is attached in JIRA, please give your > > suggestion/inputs. > > > > > > JIRA Link: CARBONDATA-3680 > > > <https://issues.apache.org/jira/browse/CARBONDATA-3680> > > > > > > Thanks & Regards, > > > Indhumathi M > > > > > -- > > Thanks & Regards, > > Ravi > > >
kumar vishal
|
In reply to this post by Indhumathi
+1
Thanks Kunal Kapoor On Wed, Feb 5, 2020, 5:33 PM Indhumathi M <[hidden email]> wrote: > Hi Community, > > Currently we have datamaps like,* default datamaps* which are block and > blocklet and *coarse grained datamaps* like bloom, and *fine grained > datamaps* like lucene > which helps in better pruning during query. What if we introduce another > kind of datamap which can hold blockletId as index? Initial level, we call > it as index which > will work as a child table to the main table like we have MV in our current > code. > > Yes, lets introduce the secondary index to carbon table which will be the > child table to main table and it can be created on column like we create > lucene datamap, > where we give index columns to create index. In a similar way, we create > secondary index on column, so indexes on these column will be blocklet IDs > which will > help in better pruning and faster query when we have a filter query on the > index column. > > Currenlty we will take it as index table and then later part we will make > it inline to datamap interface. > > So design document is attached in JIRA, please give your suggestion/inputs. > > JIRA Link: CARBONDATA-3680 > <https://issues.apache.org/jira/browse/CARBONDATA-3680> > > Thanks & Regards, > Indhumathi M > |
+1
Thanks for proposing this :) Regards, Jacky ------------------ 原始邮件 ------------------ 发件人: "Kunal Kapoor"<[hidden email]>; 发送时间: 2020年2月6日(星期四) 凌晨2:13 收件人: "dev"<[hidden email]>; 主题: Re: [Discussion] Support Secondary Index on Carbon Table +1 Thanks Kunal Kapoor On Wed, Feb 5, 2020, 5:33 PM Indhumathi M <[hidden email]> wrote: > Hi Community, > > Currently we have datamaps like,* default datamaps* which are block and > blocklet and *coarse grained datamaps* like bloom, and *fine grained > datamaps* like lucene > which helps in better pruning during query. What if we introduce another > kind of datamap which can hold blockletId as index? Initial level, we call > it as index which > will work as a child table to the main table like we have MV in our current > code. > > Yes, lets introduce the secondary index to carbon table which will be the > child table to main table and it can be created on column like we create > lucene datamap, > where we give index columns to create index. In a similar way, we create > secondary index on column, so indexes on these column will be blocklet IDs > which will > help in better pruning and faster query when we have a filter query on the > index column. > > Currenlty we will take it as index table and then later part we will make > it inline to datamap interface. > > So design document is attached in JIRA, please give your suggestion/inputs. > > JIRA Link: CARBONDATA-3680 > <https://issues.apache.org/jira/browse/CARBONDATA-3680> > > Thanks & Regards, > Indhumathi M > |
In reply to this post by Indhumathi
+1
I have a suggestion. Comparatively, query hint will be better to avoid queries push downed to SI table. Example: SELECT /*disable_si*/ * FROM main_table WHERE name='abc' Regards, Zhi Liu -- Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/ |
In reply to this post by Indhumathi
+1
----- Best Regards David Cai -- Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/
Best Regards
David Cai |
+1
Regards Manish Gupta On Thu, 6 Feb 2020 at 1:50 PM, David CaiQiang <[hidden email]> wrote: > +1 > > > > ----- > Best Regards > David Cai > -- > Sent from: > http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/ > |
Free forum by Nabble | Edit this page |