Posted by
Jacky Li-3 on
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/DISCUSSION-Support-heterogeneous-format-segments-in-carbondata-tp84252p84896.html
IMHO
On 2019/09/11 06:46:21, chetan bhat <
[hidden email]> wrote:
> Hi Ravi,
>
> 1. What are the data formats that shall be supported to add segment. ?
I think for the first phase we can target the tables that user may want to migrate to carbon, like orc and parquet tables. In future, we can consider CSV also.
> 2. Will the alter table be supported after loading multiple segments each having different data format.
Since this feature is only target for migrating the legacy table, I think we should keep it simple. So, no.
> 3. If user wants to execute select query from certain segments only using set segments feature will he/she able to do so now after this feature implementation?
Yes, I think it should be supported
> 4. Will the index files be created for the segments created from external formats. If yes will the merge index feature be supported.?
Same as query 1, no.
>
> Regards
> Chetan
>
> On 2019/09/10 14:41:22, Ravindra Pesala <
[hidden email]> wrote:
> > Hi All,
> >
> > This discussion is regarding support of other formats in carbon. Already
> > existing customers use other formats like parquet, orc etc., but if they
> > want to migrate to carbon there is no proper solution at hand. So this
> > feature allows all the old data to add as a segment to carbondata . And
> > during query, it reads old data in its respective format and all new
> > segments will be read in carbon.
> >
> > I have created the design document and attached to the jira. Please review
> > it.
> >
https://issues.apache.org/jira/browse/CARBONDATA-3516> >
> >
> > --
> > Thanks & Regards,
> > Ravindra
> >
>