Apache CarbonData Dev Mailing List archive

[Discussion] Support Secondary Index on Carbon Table

Classic

List

Threaded

9 messages Options

Indhumathi

[Discussion] Support Secondary Index on Carbon Table

This post was updated on .

Hi Community,

Currently we have datamaps like,* default datamaps* which are block and
blocklet and *coarse grained datamaps* like bloom, and *fine grained
datamaps* like lucene which helps in better pruning during query. What if we
introduce another kind of datamap which can hold blockletId as index? Initial level,
we call it as index which will work as a child table to the main table like we have
MV in our current code.

Yes, lets introduce the secondary index to carbon table which will be the
child table to main table and it can be created on column like we create
lucene datamap, where we give index columns to create index. In a similar way,
we create secondary index on column, so indexes on these column will be blocklet IDs
which will help in better pruning and faster query when we have a filter query on the
index column.

Currenlty we will take it as index table and then later part we will make
it inline to datamap interface.

So design document is attached in JIRA, please give your suggestion/inputs.

JIRA Link: CARBONDATA-3680
<https://issues.apache.org/jira/browse/CARBONDATA-3680>

Thanks & Regards,
Indhumathi M

ravipesala

Re: [Discussion] Support Secondary Index on Carbon Table

+1

Regards,
Ravindra.

On Wed, 5 Feb 2020 at 8:03 PM, Indhumathi M <[hidden email]> wrote:

> Hi Community,
>
> Currently we have datamaps like,* default datamaps* which are block and
> blocklet and *coarse grained datamaps* like bloom, and *fine grained
> datamaps* like lucene
> which helps in better pruning during query. What if we introduce another
> kind of datamap which can hold blockletId as index? Initial level, we call
> it as index which
> will work as a child table to the main table like we have MV in our current
> code.
>
> Yes, lets introduce the secondary index to carbon table which will be the
> child table to main table and it can be created on column like we create
> lucene datamap,
> where we give index columns to create index. In a similar way, we create
> secondary index on column, so indexes on these column will be blocklet IDs
> which will
> help in better pruning and faster query when we have a filter query on the
> index column.
>
> Currenlty we will take it as index table and then later part we will make
> it inline to datamap interface.
>
> So design document is attached in JIRA, please give your suggestion/inputs.
>
> JIRA Link: CARBONDATA-3680
> <https://issues.apache.org/jira/browse/CARBONDATA-3680>
>
> Thanks & Regards,
> Indhumathi M
>

--
Thanks & Regards,
Ravi

Ajantha Bhat

Re: [Discussion] Support Secondary Index on Carbon Table

+1

On Wed, 5 Feb, 2020, 8:02 pm Ravindra Pesala, <[hidden email]> wrote:

> +1
>
> Regards,
> Ravindra.
>
> On Wed, 5 Feb 2020 at 8:03 PM, Indhumathi M <[hidden email]>
> wrote:
>
> > Hi Community,
> >
> > Currently we have datamaps like,* default datamaps* which are block and
> > blocklet and *coarse grained datamaps* like bloom, and *fine grained
> > datamaps* like lucene
> > which helps in better pruning during query. What if we introduce another
> > kind of datamap which can hold blockletId as index? Initial level, we
> call
> > it as index which
> > will work as a child table to the main table like we have MV in our
> current
> > code.
> >
> > Yes, lets introduce the secondary index to carbon table which will be the
> > child table to main table and it can be created on column like we create
> > lucene datamap,
> > where we give index columns to create index. In a similar way, we create
> > secondary index on column, so indexes on these column will be blocklet
> IDs
> > which will
> > help in better pruning and faster query when we have a filter query on
> the
> > index column.
> >
> > Currenlty we will take it as index table and then later part we will make
> > it inline to datamap interface.
> >
> > So design document is attached in JIRA, please give your
> suggestion/inputs.
> >
> > JIRA Link: CARBONDATA-3680
> > <https://issues.apache.org/jira/browse/CARBONDATA-3680>
> >
> > Thanks & Regards,
> > Indhumathi M
> >
> --
> Thanks & Regards,
> Ravi
>

kumarvishal09

Re: [Discussion] Support Secondary Index on Carbon Table

+1
-Regards
Kumar Vishal

On Wed, 5 Feb 2020 at 8:08 PM, Ajantha Bhat <[hidden email]> wrote:

> +1
>
> On Wed, 5 Feb, 2020, 8:02 pm Ravindra Pesala, <[hidden email]>
> wrote:
>
> > +1
> >
> > Regards,
> > Ravindra.
> >
> > On Wed, 5 Feb 2020 at 8:03 PM, Indhumathi M <[hidden email]>
> > wrote:
> >
> > > Hi Community,
> > >
> > > Currently we have datamaps like,* default datamaps* which are block and
> > > blocklet and *coarse grained datamaps* like bloom, and *fine grained
> > > datamaps* like lucene
> > > which helps in better pruning during query. What if we introduce
> another
> > > kind of datamap which can hold blockletId as index? Initial level, we
> > call
> > > it as index which
> > > will work as a child table to the main table like we have MV in our
> > current
> > > code.
> > >
> > > Yes, lets introduce the secondary index to carbon table which will be
> the
> > > child table to main table and it can be created on column like we
> create
> > > lucene datamap,
> > > where we give index columns to create index. In a similar way, we
> create
> > > secondary index on column, so indexes on these column will be blocklet
> > IDs
> > > which will
> > > help in better pruning and faster query when we have a filter query on
> > the
> > > index column.
> > >
> > > Currenlty we will take it as index table and then later part we will
> make
> > > it inline to datamap interface.
> > >
> > > So design document is attached in JIRA, please give your
> > suggestion/inputs.
> > >
> > > JIRA Link: CARBONDATA-3680
> > > <https://issues.apache.org/jira/browse/CARBONDATA-3680>
> > >
> > > Thanks & Regards,
> > > Indhumathi M
> > >
> > --
> > Thanks & Regards,
> > Ravi
> >
>

kumar vishal

kunalkapoor

Re: [Discussion] Support Secondary Index on Carbon Table

In reply to this post by Indhumathi

+1

Thanks
Kunal Kapoor

On Wed, Feb 5, 2020, 5:33 PM Indhumathi M <[hidden email]> wrote:

Jacky Li

回复： [Discussion] Support Secondary Index on Carbon Table

+1 
Thanks for proposing this :)

Regards,
Jacky

------------------ 原始邮件 ------------------
发件人: "Kunal Kapoor"<[hidden email]>;
发送时间: 2020年2月6日(星期四) 凌晨2:13
收件人: "dev"<[hidden email]>;

主题: Re: [Discussion] Support Secondary Index on Carbon Table

+1

Thanks
Kunal Kapoor

On Wed, Feb 5, 2020, 5:33 PM Indhumathi M <[hidden email]> wrote:

> Hi Community,
>
> Currently we have datamaps like,* default datamaps* which are block and
> blocklet and *coarse grained datamaps* like bloom, and *fine grained
> datamaps* like lucene
> which helps in better pruning during query. What if we introduce another
> kind of datamap which can hold blockletId as index? Initial level, we call
> it as index which
> will work as a child table to the main table like we have MV in our current
> code.
>
> Yes, lets introduce the secondary index to carbon table which will be the
> child table to main table and it can be created on column like we create
> lucene datamap,
> where we give index columns to create index. In a similar way, we create
> secondary index on column, so indexes on these column will be blocklet IDs
> which will
> help in better pruning and faster query when we have a filter query on the
> index column.
>
> Currenlty we will take it as index table and then later part we will make
> it inline to datamap interface.
>
> So design document is attached in JIRA, please give your suggestion/inputs.
>
> JIRA Link: CARBONDATA-3680
> <https://issues.apache.org/jira/browse/CARBONDATA-3680>
>
> Thanks & Regards,
> Indhumathi M
>

niuge

Re: [Discussion] Support Secondary Index on Carbon Table

In reply to this post by Indhumathi

+1

I have a suggestion.

Comparatively, query hint will be better to avoid queries push downed to SI
table.
Example:
SELECT /*disable_si*/ * FROM main_table WHERE name='abc'

Regards,
Zhi Liu

--
Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/

David CaiQiang

Re: [Discussion] Support Secondary Index on Carbon Table

In reply to this post by Indhumathi

+1

-----
Best Regards
David Cai
--
Sent from: http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/

Best Regards
David Cai

manishgupta88

Re: [Discussion] Support Secondary Index on Carbon Table

+1

Regards
Manish Gupta

On Thu, 6 Feb 2020 at 1:50 PM, David CaiQiang <[hidden email]> wrote:

> +1
>
>
>
> -----
> Best Regards
> David Cai
> --
> Sent from:
> http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/
>