Manish Gupta created CARBONDATA-1062:
----------------------------------------
Summary: Data load fails if a column specified as sort column is of numeric data type
Key: CARBONDATA-1062
URL:
https://issues.apache.org/jira/browse/CARBONDATA-1062 Project: CarbonData
Issue Type: Bug
Reporter: Manish Gupta
Assignee: Manish Gupta
Priority: Minor
Fix For: 1.2.0
If a numeric data type column is specified as sort column and if it contains non numeric value then data load fails with the below error.
ERROR UnsafeBatchParallelReadMergeSorterImpl: pool-20-thread-1
java.lang.ClassCastException: java.lang.String cannot be cast to [B
at org.apache.carbondata.processing.newflow.sort.unsafe.UnsafeCarbonRowPage.addRow(UnsafeCarbonRowPage.java:89)
at org.apache.carbondata.processing.newflow.sort.unsafe.UnsafeCarbonRowPage.addRow(UnsafeCarbonRowPage.java:74)
at org.apache.carbondata.processing.newflow.sort.unsafe.UnsafeSortDataRows.addRowBatch(UnsafeSortDataRows.java:170)
at org.apache.carbondata.processing.newflow.sort.impl.UnsafeBatchParallelReadMergeSorterImpl$SortIteratorThread.call(UnsafeBatchParallelReadMergeSorterImpl.java:150)
at org.apache.carbondata.processing.newflow.sort.impl.UnsafeBatchParallelReadMergeSorterImpl$SortIteratorThread.call(UnsafeBatchParallelReadMergeSorterImpl.java:117)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Steps to reproduce
------------------------------
CREATE TABLE test_sort_col (id INT, name STRING, age INT) STORED BY 'org.apache.carbondata.format' TBLPROPERTIES('SORT_COLUMNS'='id,age')
LOAD DATA local inpath '<CSV_file_path>' INTO TABLE test_sort_col
select * from test_sort_col
Data
-----------
id,name,age
1,Pallavi,25
2,Rahul,24
3,Prabhat,twenty six
7,Neha,25
2,Geetika,22
3,Sangeeta,26
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)