Login  Register

Re: White Space does not trim in CarbonData

Posted by Jacky Li on May 17, 2017; 6:51am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/White-Space-does-not-trim-in-CarbonData-tp12736p12756.html

Hi,

Generally speaking, I think it is preferable to make system more tolerant to input data so that it is simpler to use from user point of view. And I think it is safe to align with other format like ORC and parquet.

Regards,
Jacky

> 在 2017年5月17日,下午12:30,manish gupta <[hidden email]> 写道:
>
> Hi Vin,
>
> Value is getting displayed as null because data is not getting trimmed. By
> default carbondata will not modify any data. But there is option to verify
> your data.
>
> 1. While loading the data set the property "
> BAD_RECORDS_LOGGER_ENABLE=true". This will create a bad record logger where
> in you can see what all records are invalid.
>
> 2. Secondly by default trimming data is disabled in the code.
>
> parserSettings.setIgnoreLeadingWhitespaces(false);
> parserSettings.setIgnoreTrailingWhitespaces(false);
>
>
> We can provide an option during data load to trim whitespaces and keep the
> default value as false to keep the current behavior intact. Example
> load data inpath 'D:/my_user2.csv' into table my_user_c
> OPTIONS('TRIM_WHITESPACE'='TRUE')
>
> @Dev team please share your inputs for the 2nd option.
>
> Regards
> Manish Gupta
>
>
> On Wed, May 17, 2017 at 7:47 AM, David Cai <[hidden email]> wrote:
>
>> I suggest to trim white space before converting a string value to a numeric
>> value.
>>
>>
>>
>> -----
>> Best Regards
>> David Cai
>> --
>> View this message in context: http://apache-carbondata-dev-
>> mailing-list-archive.1130556.n5.nabble.com/White-Space-
>> does-not-trim-in-CarbonData-tp12736p12751.html
>> Sent from the Apache CarbonData Dev Mailing List archive mailing list
>> archive at Nabble.com.
>>