[jira] [Updated] (CARBONDATA-3283) Apache CarbonData should provides python interface to manage and analysis data based on Apache Spark. Apache CarbonData should support DDL, DML, DataMap feature in Python.

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Updated] (CARBONDATA-3283) Apache CarbonData should provides python interface to manage and analysis data based on Apache Spark. Apache CarbonData should support DDL, DML, DataMap feature in Python.

Akash R Nilugal (Jira)

     [ https://issues.apache.org/jira/browse/CARBONDATA-3283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bo Xu updated CARBONDATA-3283:
------------------------------
    Description:
Apache CarbonData should provides python interface to manage and analysis data based on Apache Spark. Apache CarbonData should support DDL, DML, DataMap feature in Python.

Goals:

1). PyCarbon support  read data from local/HDFS/S3 in python code by PySpark DataFrame
2). PyCarbon support  write data in python code to local/HDFS/S3 by PySpark DataFrame
3). PyCarbon support  DDL in python with sql format
4). PyCarbon support  DML in python with sql format
5). PyCarbon support  DataMap in python with sql format

  was:
Apache CarbonData should provides python interface to manage and analysis data based on Apache Spark. Apache CarbonData should support DDL, DML, DataMap feature in Python.

TODO:


> Apache CarbonData should provides python interface to manage and analysis data based on Apache Spark. Apache CarbonData should support DDL, DML, DataMap feature in Python.
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-3283
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3283
>             Project: CarbonData
>          Issue Type: Sub-task
>            Reporter: Bo Xu
>            Assignee: Bo Xu
>            Priority: Major
>
> Apache CarbonData should provides python interface to manage and analysis data based on Apache Spark. Apache CarbonData should support DDL, DML, DataMap feature in Python.
> Goals:
> 1). PyCarbon support  read data from local/HDFS/S3 in python code by PySpark DataFrame
> 2). PyCarbon support  write data in python code to local/HDFS/S3 by PySpark DataFrame
> 3). PyCarbon support  DDL in python with sql format
> 4). PyCarbon support  DML in python with sql format
> 5). PyCarbon support  DataMap in python with sql format



--
This message was sent by Atlassian Jira
(v8.3.4#803005)