Hi All,
The Apache CarbonData PMC team is happy to annouce the release of Apache CarbonData version 1.0.0-incubating. Apache CarbonData(incubating) is an indexed columnar data format for fast analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc. The release notes is available at: https://cwiki.apache.org/confluence/display/CARBONDATA/Apache+CarbonData+1.0.0-incubating The release artifacts are available at: https://www.apache.org/dyn/closer.lua/incubator/carbondata/1.0.0-incubating You can follow this document to use these artifacts: https://github.com/apache/incubator-carbondata/blob/master/docs/quick-start-guide.md You can find the latest CarbonData document and learn more at: http://carbondata.incubator.apache.org Thanks The Apache CarbonData team ---- DISCLAIMER Apache CarbonData is an effort undergoing incubation at the Apache Software Foundation (ASF), sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. |
Firstly, configuration to *Apache CarbonData 1.0.0-incubating released*
and Thanks for the great works. Test about CarbonData 1.0.0-incubating found that this version is better in availability, reliability and performance than previous ones. Especially the performance of loading data improved significantly. As well as, new features such as supporting update/delete functionality, integration with Spark 2.x and removing kettle for loading data solution, etc. are really amazing. In order to improve the dictionary module performance, I will keep to work for continuous improvement and optimization about "Double Array Trie". I am from MEITUAN which is (one of) the biggest O2O internet company in China, Query scenario we are facing is very complex and diverse, and CarbonData is just match some of that. so we are in the process of making a thorough research and plan to deploy CarbonData on our production environment. On Mon, Jan 30, 2017 at 12:01 PM, Jacky Li <[hidden email]> wrote: > Hi All, > > The Apache CarbonData PMC team is happy to annouce the release of Apache > CarbonData version 1.0.0-incubating. > > Apache CarbonData(incubating) is an indexed columnar data format for fast > analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc. > > The release notes is available at: > https://cwiki.apache.org/confluence/display/CARBONDATA/ > Apache+CarbonData+1.0.0-incubating > > The release artifacts are available at: > https://www.apache.org/dyn/closer.lua/incubator/ > carbondata/1.0.0-incubating > > You can follow this document to use these artifacts: > https://github.com/apache/incubator-carbondata/blob/ > master/docs/quick-start-guide.md > > You can find the latest CarbonData document and learn more at: > http://carbondata.incubator.apache.org > > Thanks > The Apache CarbonData team > > ---- > > DISCLAIMER > > Apache CarbonData is an effort undergoing incubation at the Apache > > Software Foundation (ASF), sponsored by the Apache Incubator PMC. > > > > Incubation is required of all newly accepted projects until a further > > review indicates that the infrastructure, communications, and decision > > making process have stabilized in a manner consistent with other > > successful ASF projects. > > > > While incubation status is not necessarily a reflection of the > > completeness or stability of the code, it does indicate that the > > project has yet to be fully endorsed by the ASF. > > > > |
Administrator
|
Hi xiaoqiao
Very happy to see that you will keep contributing on CarbonData, "Double Array Trie" is really a good feature to improve dictionary part. Yes, CarbonData's goal is for solving complex and diversity scenarios. Please let us(community) know if you deploy CarbonData on scenario system meets any issues and requires any helps. Regards Liang 2017-02-06 0:39 GMT+08:00 Xiaoqiao He <[hidden email]>: > Firstly, configuration to *Apache CarbonData 1.0.0-incubating released* > and Thanks for the great works. > > Test about CarbonData 1.0.0-incubating found that this version is better in > availability, reliability and performance than previous > ones. Especially the performance of loading data improved significantly. > > As well as, new features such as supporting update/delete functionality, > integration with Spark 2.x and removing kettle for loading data solution, > etc. are really amazing. In order to improve the dictionary module > performance, I will keep to work for continuous improvement and > optimization about "Double Array Trie". > > I am from MEITUAN which is (one of) the biggest O2O internet company in > China, Query scenario we are facing is very complex and diverse, and > CarbonData is just match some of that. so we are in the process of making a > thorough research and plan to deploy CarbonData on our production > environment. > > > On Mon, Jan 30, 2017 at 12:01 PM, Jacky Li <[hidden email]> wrote: > > > Hi All, > > > > The Apache CarbonData PMC team is happy to annouce the release of Apache > > CarbonData version 1.0.0-incubating. > > > > Apache CarbonData(incubating) is an indexed columnar data format for fast > > analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc. > > > > The release notes is available at: > > https://cwiki.apache.org/confluence/display/CARBONDATA/ > > Apache+CarbonData+1.0.0-incubating > > > > The release artifacts are available at: > > https://www.apache.org/dyn/closer.lua/incubator/ > > carbondata/1.0.0-incubating > > > > You can follow this document to use these artifacts: > > https://github.com/apache/incubator-carbondata/blob/ > > master/docs/quick-start-guide.md > > > > You can find the latest CarbonData document and learn more at: > > http://carbondata.incubator.apache.org > > > > Thanks > > The Apache CarbonData team > > > > ---- > > > > DISCLAIMER > > > > Apache CarbonData is an effort undergoing incubation at the Apache > > > > Software Foundation (ASF), sponsored by the Apache Incubator PMC. > > > > > > > > Incubation is required of all newly accepted projects until a further > > > > review indicates that the infrastructure, communications, and decision > > > > making process have stabilized in a manner consistent with other > > > > successful ASF projects. > > > > > > > > While incubation status is not necessarily a reflection of the > > > > completeness or stability of the code, it does indicate that the > > > > project has yet to be fully endorsed by the ASF. > > > > > > > > > -- Regards Liang |
Hi,
I have data stored in S3 and use AWS Athena to do ad-hoc queries. How can I leverage CarbonData for my business, please? On Sun, Feb 5, 2017 at 5:27 PM, Liang Chen <[hidden email]> wrote: > Hi xiaoqiao > > Very happy to see that you will keep contributing on CarbonData, "Double > Array Trie" is really a good feature to improve dictionary part. > > Yes, CarbonData's goal is for solving complex and diversity scenarios. > Please let us(community) know if you deploy CarbonData on scenario system > meets any issues and requires any helps. > > Regards > Liang > > > 2017-02-06 0:39 GMT+08:00 Xiaoqiao He <[hidden email]>: > > > Firstly, configuration to *Apache CarbonData 1.0.0-incubating released* > > and Thanks for the great works. > > > > Test about CarbonData 1.0.0-incubating found that this version is better > in > > availability, reliability and performance than previous > > ones. Especially the performance of loading data improved significantly. > > > > As well as, new features such as supporting update/delete functionality, > > integration with Spark 2.x and removing kettle for loading data solution, > > etc. are really amazing. In order to improve the dictionary module > > performance, I will keep to work for continuous improvement and > > optimization about "Double Array Trie". > > > > I am from MEITUAN which is (one of) the biggest O2O internet company in > > China, Query scenario we are facing is very complex and diverse, and > > CarbonData is just match some of that. so we are in the process of > making a > > thorough research and plan to deploy CarbonData on our production > > environment. > > > > > > On Mon, Jan 30, 2017 at 12:01 PM, Jacky Li <[hidden email]> wrote: > > > > > Hi All, > > > > > > The Apache CarbonData PMC team is happy to annouce the release of > Apache > > > CarbonData version 1.0.0-incubating. > > > > > > Apache CarbonData(incubating) is an indexed columnar data format for > fast > > > analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc. > > > > > > The release notes is available at: > > > https://cwiki.apache.org/confluence/display/CARBONDATA/ > > > Apache+CarbonData+1.0.0-incubating > > > > > > The release artifacts are available at: > > > https://www.apache.org/dyn/closer.lua/incubator/ > > > carbondata/1.0.0-incubating > > > > > > You can follow this document to use these artifacts: > > > https://github.com/apache/incubator-carbondata/blob/ > > > master/docs/quick-start-guide.md > > > > > > You can find the latest CarbonData document and learn more at: > > > http://carbondata.incubator.apache.org > > > > > > Thanks > > > The Apache CarbonData team > > > > > > ---- > > > > > > DISCLAIMER > > > > > > Apache CarbonData is an effort undergoing incubation at the Apache > > > > > > Software Foundation (ASF), sponsored by the Apache Incubator PMC. > > > > > > > > > > > > Incubation is required of all newly accepted projects until a further > > > > > > review indicates that the infrastructure, communications, and decision > > > > > > making process have stabilized in a manner consistent with other > > > > > > successful ASF projects. > > > > > > > > > > > > While incubation status is not necessarily a reflection of the > > > > > > completeness or stability of the code, it does indicate that the > > > > > > project has yet to be fully endorsed by the ASF. > > > > > > > > > > > > > > > > > > -- > Regards > Liang > |
Administrator
|
Hi relatall
Looks that AWS Athena doesn't support CarbonData format currently. Maybe you can try your ad hoc queries on CarbonData(HDFS)HDSF+Spark directly. Regards Liang 2017-02-06 19:06 GMT-05:00 <[hidden email]>: > Hi, > I have data stored in S3 and use AWS Athena to do ad-hoc queries. How can > I leverage CarbonData for my business, please? > > On Sun, Feb 5, 2017 at 5:27 PM, Liang Chen <[hidden email]> > wrote: > > > Hi xiaoqiao > > > > Very happy to see that you will keep contributing on CarbonData, "Double > > Array Trie" is really a good feature to improve dictionary part. > > > > Yes, CarbonData's goal is for solving complex and diversity scenarios. > > Please let us(community) know if you deploy CarbonData on scenario system > > meets any issues and requires any helps. > > > > Regards > > Liang > > > > > > 2017-02-06 0:39 GMT+08:00 Xiaoqiao He <[hidden email]>: > > > > > Firstly, configuration to *Apache CarbonData 1.0.0-incubating > released* > > > and Thanks for the great works. > > > > > > Test about CarbonData 1.0.0-incubating found that this version is > better > > in > > > availability, reliability and performance than previous > > > ones. Especially the performance of loading data improved > significantly. > > > > > > As well as, new features such as supporting update/delete > functionality, > > > integration with Spark 2.x and removing kettle for loading data > solution, > > > etc. are really amazing. In order to improve the dictionary module > > > performance, I will keep to work for continuous improvement and > > > optimization about "Double Array Trie". > > > > > > I am from MEITUAN which is (one of) the biggest O2O internet company in > > > China, Query scenario we are facing is very complex and diverse, and > > > CarbonData is just match some of that. so we are in the process of > > making a > > > thorough research and plan to deploy CarbonData on our production > > > environment. > > > > > > > > > On Mon, Jan 30, 2017 at 12:01 PM, Jacky Li <[hidden email]> wrote: > > > > > > > Hi All, > > > > > > > > The Apache CarbonData PMC team is happy to annouce the release of > > Apache > > > > CarbonData version 1.0.0-incubating. > > > > > > > > Apache CarbonData(incubating) is an indexed columnar data format for > > fast > > > > analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc. > > > > > > > > The release notes is available at: > > > > https://cwiki.apache.org/confluence/display/CARBONDATA/ > > > > Apache+CarbonData+1.0.0-incubating > > > > > > > > The release artifacts are available at: > > > > https://www.apache.org/dyn/closer.lua/incubator/ > > > > carbondata/1.0.0-incubating > > > > > > > > You can follow this document to use these artifacts: > > > > https://github.com/apache/incubator-carbondata/blob/ > > > > master/docs/quick-start-guide.md > > > > > > > > You can find the latest CarbonData document and learn more at: > > > > http://carbondata.incubator.apache.org > > > > > > > > Thanks > > > > The Apache CarbonData team > > > > > > > > ---- > > > > > > > > DISCLAIMER > > > > > > > > Apache CarbonData is an effort undergoing incubation at the Apache > > > > > > > > Software Foundation (ASF), sponsored by the Apache Incubator PMC. > > > > > > > > > > > > > > > > Incubation is required of all newly accepted projects until a further > > > > > > > > review indicates that the infrastructure, communications, and > decision > > > > > > > > making process have stabilized in a manner consistent with other > > > > > > > > successful ASF projects. > > > > > > > > > > > > > > > > While incubation status is not necessarily a reflection of the > > > > > > > > completeness or stability of the code, it does indicate that the > > > > > > > > project has yet to be fully endorsed by the ASF. > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > Regards > > Liang > > > -- Regards Liang |
Free forum by Nabble | Edit this page |