[jira] [Resolved] (CARBONDATA-3695) Apache CarbonData should provides python interface to support deep learning framework PyTorch to ready and write data from/to CarbonData

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Resolved] (CARBONDATA-3695) Apache CarbonData should provides python interface to support deep learning framework PyTorch to ready and write data from/to CarbonData

Akash R Nilugal (Jira)

     [ https://issues.apache.org/jira/browse/CARBONDATA-3695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ajantha Bhat resolved CARBONDATA-3695.
--------------------------------------
    Fix Version/s: 2.0.0
       Resolution: Fixed

> Apache CarbonData should provides python interface to support deep learning framework PyTorch to ready and write data from/to CarbonData
> ----------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: CARBONDATA-3695
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-3695
>             Project: CarbonData
>          Issue Type: Sub-task
>            Reporter: Bo Xu
>            Assignee: Bo Xu
>            Priority: Major
>             Fix For: 2.0.0
>
>          Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> Nowadays AI model training is getting more and more popular. Currently many AI framework uses raw data files or row format data files for model training, it could not provide projection, filtering, and fast scan capability like in columnar store. So, if CarbonData supports AI framework, it can speed up model training by increase IO throughput, and provide more flexible training set selection ability to AI developers
> AI compute engine integration:
> PyTorch integration: New python API in pycarbon to support PyTorch to read data from CarbonData files for training model



--
This message was sent by Atlassian Jira
(v8.3.4#803005)