Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[GitHub] carbondata pull request #2693: [CARBONDATA-2915] Reformat Documentation of C...

Classic

List

Threaded

36 messages Options

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2693

Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.1/143/

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2693

Build Failed with Spark 2.3.1, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder2.3/8385/

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2693

Build Failed with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/314/

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2693

Build Success with Spark 2.2.1, Please check CI http://95.216.28.178:8080/job/ApacheCarbonPRBuilder1/320/

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

qiuchenjian-2

[GitHub] carbondata pull request #2693: [CARBONDATA-2915] Reformat Documentation of C...

In reply to this post by qiuchenjian-2

Github user xuchuanyin commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2693#discussion_r215976048

--- Diff: docs/configuration-parameters.md ---
@@ -70,7 +70,7 @@ This section provides the details of all the configurations required for the Car
| carbon.enable.calculate.size | true | **For Load Operation**: Setting this property calculates the size of the carbon data file (.carbondata) and carbon index file (.carbonindex) for every load and updates the table status file. **For Describe Formatted**: Setting this property calculates the total size of the carbon data files and carbon index files for the respective table and displays in describe formatted command.**NOTE:** This is useful to determine the overall size of the carbondata table and also get an idea of how the table is growing in order to take up other backup strategy decisions. |
| carbon.cutOffTimestamp | (none) | CarbonData has capability to generate the Dictionary values for the timestamp columns from the data itself without the need to store the computed dictionary values. This configuration sets the start date for calculating the timestamp. Java counts the number of milliseconds from start of "1970-01-01 00:00:00". This property is used to customize the start of position. For example "2000-01-01 00:00:00". **NOTE:** The date must be in the form ***carbon.timestamp.format***. CarbonData supports storing data for upto 68 years.For example, if the cut-off time is 1970-01-01 05:30:00, then data upto 2038-01-01 05:30:00 will be supported by CarbonData. |
| carbon.timegranularity | SECOND | The configuration is used to specify the data granularity level such as DAY, HOUR, MINUTE, or SECOND.This helps to store more than 68 years of data into CarbonData. |
-| carbon.use.local.dir | false | CarbonData during data loading, writes files to local temp directories before copying the files to HDFS.This configuration is used to specify whether CarbonData can write locally to tmp directory of the container or to the YARN application directory. |
+| carbon.use.local.dir | false | CarbonData,during data loading, writes files to local temp directories before copying the files to HDFS.This configuration is used to specify whether CarbonData can write locally to tmp directory of the container or to the YARN application directory. |
| carbon.use.multiple.temp.dir | false | When multiple disks are present in the system, YARN is generally configured with multiple disks to be used as temp directories for managing the containers.This configuration specifies whether to use multiple YARN local directories during data loading for disk IO load balancing.Enable ***carbon.use.local.dir*** for this configuration to take effect.**NOTE:** Data Loading is an IO intensive operation whose performance can be limited by the disk IO threshold, particularly during multi table concurrent data load.Configuring this parameter, balances the disk IO across multiple disks there by improving the over all load performance. |
--- End diff --

I think this can be turned to enable along with the previous configuration for the beginners to achieve better performance.

---

qiuchenjian-2

[GitHub] carbondata pull request #2693: [CARBONDATA-2915] Reformat Documentation of C...

In reply to this post by qiuchenjian-2

Github user xuchuanyin commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2693#discussion_r215974593

--- Diff: docs/configuration-parameters.md ---
@@ -16,7 +16,7 @@
-->

# Configuring CarbonData
- This guide explains the configurations that can be used to tune CarbonData to achieve better performance.Some of the properties can be set dynamically and are explained in the section Dynamic Configuration In CarbonData Using SET-RESET.Most of the properties that control the internal settings have reasonable default values.They are listed along with the properties along with explanation.
+ This guide explains the configurations that can be used to tune CarbonData to achieve better performance.Most of the properties that control the internal settings have reasonable default values.They are listed along with the properties along with explanation.
--- End diff --

Need a space before each sentence.

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

Github user xuchuanyin commented on the issue:

https://github.com/apache/carbondata/pull/2693

I found many sentences are not start with a space to make a clear gap with the previous one and I think it should be optimized. Otherwise it will to too trivial and minor to raise other PRs.

---

qiuchenjian-2

[GitHub] carbondata pull request #2693: [CARBONDATA-2915] Reformat Documentation of C...

In reply to this post by qiuchenjian-2

Github user sraghunandan commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2693#discussion_r215979866

--- Diff: docs/configuration-parameters.md ---
@@ -16,7 +16,7 @@
-->

# Configuring CarbonData
- This guide explains the configurations that can be used to tune CarbonData to achieve better performance.Some of the properties can be set dynamically and are explained in the section Dynamic Configuration In CarbonData Using SET-RESET.Most of the properties that control the internal settings have reasonable default values.They are listed along with the properties along with explanation.
+ This guide explains the configurations that can be used to tune CarbonData to achieve better performance.Most of the properties that control the internal settings have reasonable default values.They are listed along with the properties along with explanation.
--- End diff --

what do you mean? i didn't get you

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

Github user sraghunandan commented on the issue:

https://github.com/apache/carbondata/pull/2693

is it a mark down rule? I'm unaware of it. what is the need for a space?

---

qiuchenjian-2

[GitHub] carbondata pull request #2693: [CARBONDATA-2915] Reformat Documentation of C...

In reply to this post by qiuchenjian-2

Github user sraghunandan commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2693#discussion_r215992608

--- Diff: docs/configuration-parameters.md ---
@@ -70,7 +70,7 @@ This section provides the details of all the configurations required for the Car
| carbon.enable.calculate.size | true | **For Load Operation**: Setting this property calculates the size of the carbon data file (.carbondata) and carbon index file (.carbonindex) for every load and updates the table status file. **For Describe Formatted**: Setting this property calculates the total size of the carbon data files and carbon index files for the respective table and displays in describe formatted command.**NOTE:** This is useful to determine the overall size of the carbondata table and also get an idea of how the table is growing in order to take up other backup strategy decisions. |
| carbon.cutOffTimestamp | (none) | CarbonData has capability to generate the Dictionary values for the timestamp columns from the data itself without the need to store the computed dictionary values. This configuration sets the start date for calculating the timestamp. Java counts the number of milliseconds from start of "1970-01-01 00:00:00". This property is used to customize the start of position. For example "2000-01-01 00:00:00". **NOTE:** The date must be in the form ***carbon.timestamp.format***. CarbonData supports storing data for upto 68 years.For example, if the cut-off time is 1970-01-01 05:30:00, then data upto 2038-01-01 05:30:00 will be supported by CarbonData. |
| carbon.timegranularity | SECOND | The configuration is used to specify the data granularity level such as DAY, HOUR, MINUTE, or SECOND.This helps to store more than 68 years of data into CarbonData. |
-| carbon.use.local.dir | false | CarbonData during data loading, writes files to local temp directories before copying the files to HDFS.This configuration is used to specify whether CarbonData can write locally to tmp directory of the container or to the YARN application directory. |
+| carbon.use.local.dir | false | CarbonData,during data loading, writes files to local temp directories before copying the files to HDFS.This configuration is used to specify whether CarbonData can write locally to tmp directory of the container or to the YARN application directory. |
| carbon.use.multiple.temp.dir | false | When multiple disks are present in the system, YARN is generally configured with multiple disks to be used as temp directories for managing the containers.This configuration specifies whether to use multiple YARN local directories during data loading for disk IO load balancing.Enable ***carbon.use.local.dir*** for this configuration to take effect.**NOTE:** Data Loading is an IO intensive operation whose performance can be limited by the disk IO threshold, particularly during multi table concurrent data load.Configuring this parameter, balances the disk IO across multiple disks there by improving the over all load performance. |
--- End diff --

you mean make it true by default? i think thats not the scope of this PR

---

qiuchenjian-2

[GitHub] carbondata issue #2693: [CARBONDATA-2915] Reformat Documentation of CarbonDa...

In reply to this post by qiuchenjian-2

Github user chenliang613 commented on the issue:

https://github.com/apache/carbondata/pull/2693

LGTM

---

qiuchenjian-2

[GitHub] carbondata pull request #2693: [CARBONDATA-2915] Reformat Documentation of C...

In reply to this post by qiuchenjian-2

Github user asfgit closed the pull request at:

https://github.com/apache/carbondata/pull/2693

---