Posted by
Indhumathi on
Sep 14, 2018; 6:30am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/DISCUSSION-Support-Binary-DataType-tp62435.html
Hello All,
I am working on supporting Binary DataType. Please find below
the scope and design approach for the same.
**Scope:**
1. Create Table DDL support for Binary Data type column.
2. Support loading of data for binary data type column [DataLoad && Insert
into DDL].
3. Support querying binary data type columns.
4. Describe formatted support to display the binary data type column.
**Proposed Solution:**
1. Implement a Binary converter BinaryFieldConverterImpl, which takes
CarbonRow as
input and converts data to Hex decoded ByteArray during
RowConverterImpl step.
2. Create a Column Page for Binary Datatype similar to BYTE_ARRAY page.
3. Encoding type for Binary data type is DIRECT_COMPRESS and compress data
using getLVFlattenedBytePage() and return encoded data.
While decoding, check whether if column is of BINARY datatype,
and decode it to a newBinaryPage().
4. For Querying Binary data type columns, implement a BinaryVectorFiller,
which fills byte array data to CarbonColumnVector.
Please provide your inputs and comments. Any suggestion from community is
most welcomed.
Regards,
Indhumathi
--
Sent from:
http://apache-carbondata-dev-mailing-list-archive.1130556.n5.nabble.com/