Apache CarbonData Dev Mailing List archive

Why INT type is stored like BIGINT?

Classic

List

Threaded

3 messages Options

cenyuhai

Dec 04, 2016; 7:58am

Why INT type is stored like BIGINT?

Hi, all:
I find that INT is is stored like BIGINT? Why?

ravipesala

Dec 04, 2016; 5:43pm

Re: Why INT type is stored like BIGINT?

Hi,

Since we use delta compression for measure types in carbondata , it stores
the data with least datatype as per the values in blocklet. So it does not
matter whether we store INT or BIGINT in carbondata files, it always use
least datatype to store.

Regards,
Ravi

On 4 December 2016 at 13:28, Sea <[hidden email]> wrote:

> Hi, all:
> I find that INT is is stored like BIGINT? Why?

--
Thanks & Regards,
Ravi

Jacky Li

Dec 05, 2016; 1:51pm

Re: Why INT type is stored like BIGINT?

Hi,

As Ravindra explained, when writing to file, INT is stored using least datatype adaptively according to the actual data in the column chunk, it could be byte, or short, or int. But during decoding and encoding, it does use long (bigint) as temporary structure. I am working on a patch to optimize this part.

Regards,
Jacky

> 在 2016年12月5日，上午1:43，Ravindra Pesala <[hidden email]> 写道：
>
> Hi,
>
> Since we use delta compression for measure types in carbondata , it stores
> the data with least datatype as per the values in blocklet. So it does not
> matter whether we store INT or BIGINT in carbondata files, it always use
> least datatype to store.
>
> Regards,
> Ravi
>
> On 4 December 2016 at 13:28, Sea <[hidden email]> wrote:
>
>> Hi, all:
>> I find that INT is is stored like BIGINT? Why?
>
>
>
>
> --
> Thanks & Regards,
> Ravi

... [show rest of quote]