Login  Register

RE: Discussion about using multi local directorys to improve dataloading perfomance

Posted by Jihong Ma on Oct 10, 2016; 6:26pm
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/Discussion-about-using-multi-local-directorys-to-improve-dataloading-perfomance-tp1678p1731.html

Agree, help boost performance.

Jenny

-----Original Message-----
From: Jacky Li [mailto:[hidden email]]
Sent: Saturday, October 08, 2016 9:09 AM
To: [hidden email]
Subject: Re: Discussion about using multi local directorys to improve dataloading perfomance

Yes, I think it is a good feature to have. Please feel free to create JIRA issue and Pull Request.

Regards,
Jacky

> 在 2016年10月9日,上午12:04,caiqiang <[hidden email]> 写道:
>
> Hi All,
>  For each dataloading, we write the sorted temp files into only one different local directory. I think this is a bottle neck of dataloading. It is neccessary to use multi local directorys in multi disks for each dataloading to improve dataloading performance.