Login  Register

Re: Open Discussion:Apache CarbonData Roadmap

Posted by Jean-Baptiste Onofré on Aug 09, 2016; 5:00am
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Open-Discussion-Apache-CarbonData-Roadmap-tp49p51.html

Hi Liang,

it sounds good.

Any plan to support Apache Beam (instead of Spark directly) ?

Regards
JB

On 08/09/2016 06:02 AM, chenliang613 wrote:

> HiI would like to start one discussion thread for Apache CarbonData
> Roadmap.Your any input and comments would be very appreciated!
> Apache CarbonData 0.1.0-incubating
> Support integration with Apache Spark1.5.2,1.6.1,1.6.2Support integration
> with Apache Hadoop 2.2 later versionColumnar data storeFully Index: it can
> significantly accelerate query performance and reduces the I/O scans and CPU
> resources, where there are filters in the query. it can also do skip scan in
> more finer grain unit (called blocklet) in task side scanning instead of
> scanning the whole file.Global Multi Dimensional Keys(MDK) based B+Tree
> Index for all non-measure columnsMin-Max Index for all columns:.Inverted
> index for all dimensionsOperable encoded data :Through supporting efficient
> compression and global encoding schemes, can query on compressed/encoded
> data, the data can be converted just before returning the results to the
> users, which is "late materialized".Column group: Allow multiple columns to
> form a column group that would be stored as row format. This reduces the row
> reconstruction cost at query time.Supports for various use cases with one
> single Data format : like interactive OLAP-style query, Sequential Access
> (big scan), Random Access (narrow scan).
> Apache CarbonData 0.2.0-incubating
> Support integration with Apache Spark 2.1Support Map data
> type(CARBONDATA-45)Support create carbondata table select from other
> datastore’s tableFor supporting more flexible data load, remove
> kettleSupport CarbonDataOutputFormat.RegardsLiang
>
>
>
> --
> View this message in context: http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/Open-Discussion-Apache-CarbonData-Roadmap-tp49.html
> Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.
>

--
Jean-Baptiste Onofré
[hidden email]
http://blog.nanthrax.net
Talend - http://www.talend.com