Indhumathi27 opened a new pull request #3735: URL: https://github.com/apache/carbondata/pull/3735 ### Why is this PR needed? Fix spelling, link and ddl in SI and MV Documentation ### What changes were proposed in this PR? Fixed spelling, link and ddl in SI and MV Documentation ### Does this PR introduce any user interface change? - No ### Is any new testcase added? - No ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623115343 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623305852 Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1199/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623306471 Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/2918/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
chetandb commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419320498 ########## File path: docs/index/bloomfilter-index-guide.md ########## @@ -96,7 +96,7 @@ User can create BloomFilter Index using the Create Index DDL: PROPERTIES ('BLOOM_SIZE'='640000', 'BLOOM_FPP'='0.00001', 'BLOOM_COMPRESS'='true') ``` -Here, (name,id) are INDEX_COLUMNS. Carbondata will generate BloomFilter index on these columns. Queries on these columns are usually like 'COL = VAL'. +Here, (name,id) are INDEX_COLUMNS. Carbondata will generate BloomFilter index on these columns. Queries on these columns are usually like `'COL = VAL'`. Review comment: 'COL = VAL' is fine. Why need to add another ' at start and end ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
chetandb commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419321134 ########## File path: docs/index/lucene-index-guide.md ########## @@ -85,20 +85,20 @@ index_columns(String Columns) given in CREATE statement which contains informati location of index_columns. These index files will be written inside a folder named with index name inside each segment folders. Review comment: Change to "inside each segment folder" ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
Indhumathi27 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419322342 ########## File path: docs/index/bloomfilter-index-guide.md ########## @@ -96,7 +96,7 @@ User can create BloomFilter Index using the Create Index DDL: PROPERTIES ('BLOOM_SIZE'='640000', 'BLOOM_FPP'='0.00001', 'BLOOM_COMPRESS'='true') ``` -Here, (name,id) are INDEX_COLUMNS. Carbondata will generate BloomFilter index on these columns. Queries on these columns are usually like 'COL = VAL'. +Here, (name,id) are INDEX_COLUMNS. Carbondata will generate BloomFilter index on these columns. Queries on these columns are usually like `'COL = VAL'`. Review comment: to highlight it in the document ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
Indhumathi27 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419322342 ########## File path: docs/index/bloomfilter-index-guide.md ########## @@ -96,7 +96,7 @@ User can create BloomFilter Index using the Create Index DDL: PROPERTIES ('BLOOM_SIZE'='640000', 'BLOOM_FPP'='0.00001', 'BLOOM_COMPRESS'='true') ``` -Here, (name,id) are INDEX_COLUMNS. Carbondata will generate BloomFilter index on these columns. Queries on these columns are usually like 'COL = VAL'. +Here, (name,id) are INDEX_COLUMNS. Carbondata will generate BloomFilter index on these columns. Queries on these columns are usually like `'COL = VAL'`. Review comment: Added to highlight it in the document ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
chetandb commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419326273 ########## File path: docs/index/secondary-index-guide.md ########## @@ -159,7 +159,7 @@ with in WHERE clause. ``` SELECT country, sex from sales where NI(user_id = 'xxx') ``` -The above query ignores column user_id from secondary index and fetch data from main table. +The above query ignores column `user_id` from secondary index and fetch data from the main table. Review comment: Change fetch to fetches ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
chetandb commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419329554 ########## File path: docs/mv-guide.md ########## @@ -207,15 +207,15 @@ materialized view, if not, the operation is allowed, otherwise operation will be rejected by throwing exception. 3. Partition management command: `ALTER TABLE ADD/DROP PARTITION`. Note that dropping a partition - will be allowed only if partition is participating in all indexes associated with fact table. + will be allowed only if partition is participating in all materialized views associated with fact table. Review comment: "if partition" can be changed to "if the partition" ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
Indhumathi27 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419349795 ########## File path: docs/mv-guide.md ########## @@ -207,15 +207,15 @@ materialized view, if not, the operation is allowed, otherwise operation will be rejected by throwing exception. 3. Partition management command: `ALTER TABLE ADD/DROP PARTITION`. Note that dropping a partition - will be allowed only if partition is participating in all indexes associated with fact table. + will be allowed only if partition is participating in all materialized views associated with fact table. Review comment: changed ########## File path: docs/index/secondary-index-guide.md ########## @@ -159,7 +159,7 @@ with in WHERE clause. ``` SELECT country, sex from sales where NI(user_id = 'xxx') ``` -The above query ignores column user_id from secondary index and fetch data from main table. +The above query ignores column `user_id` from secondary index and fetch data from the main table. Review comment: changed ########## File path: docs/index/lucene-index-guide.md ########## @@ -85,20 +85,20 @@ index_columns(String Columns) given in CREATE statement which contains informati location of index_columns. These index files will be written inside a folder named with index name inside each segment folders. Review comment: changed ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623444233 Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/2930/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623447947 Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1212/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
akashrn5 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419871337 ########## File path: docs/index/lucene-index-guide.md ########## @@ -83,28 +84,28 @@ It will show all Indexes created on main table. When loading data to main table, lucene index files will be generated for all the index_columns(String Columns) given in CREATE statement which contains information about the data location of index_columns. These index files will be written inside a folder named with index name -inside each segment folders. +inside each segment folder. -A system level configuration carbon.lucene.compression.mode can be added for best compression of +A system level configuration `carbon.lucene.compression.mode` can be added for best compression of lucene index files. The default value is speed, where the index writing speed will be more. If the value is compression, the index file size will be compressed. ## Querying data As a technique for query acceleration, Lucene indexes cannot be queried directly. -Queries are to be made on main table. when a query with TEXT_MATCH('name:c10') or +Queries are to be made on main table. When a query with TEXT_MATCH('name:c10') or Review comment: ```suggestion Queries are to be made on the main table. When a query with TEXT_MATCH('name:c10') or ``` ########## File path: docs/index/index-management.md ########## @@ -51,15 +51,15 @@ Currently, there are 3 Index implementations in CarbonData. There are two kinds of management semantic for Index. -1. Automatic Refresh: Create index without `WITH DEFERRED REBUILD` in the statement, which is by default. -2. Manual Refresh: Create index with `WITH DEFERRED REBUILD` in the statement +1. Automatic Refresh: Create index without `WITH DEFERRED REFRESH` in the statement, which is by default. +2. Manual Refresh: Create index with `WITH DEFERRED REFRESH` in the statement Review comment: please rewrite the 1 and 2 point for manual and automatic refresh, its not completely meaningfull. ########## File path: docs/index/index-management.md ########## @@ -78,7 +78,7 @@ We do recommend you to use this management for index. ### Manual Refresh -When user creates a index specifying manual refresh semantic, the index is created with status *disabled* and query will NOT use this index until user can issue REFRESH INDEX command to build the index. For every REFRESH INDEX command, system will trigger a full rebuild of the index. After rebuild is done, system will change index status to *enabled*, so that it can be used in query rewrite. +When user creates a index specifying manual refresh semantic, the index is created with status *disabled* and query will NOT use this index until user can issue REFRESH INDEX command to build the index. For every REFRESH INDEX command, system will trigger a full refresh of the index. After a refresh is done, system will change index status to *enabled*, so that it can be used in query rewrite. Review comment: ` After a refresh is done` to `Once the refresh operation is finished` ########## File path: docs/mv-guide.md ########## @@ -334,10 +334,10 @@ ``` Then, the above query can be rolled up from materialized view 'agg_sales', by adding hour - level time series aggregation on minute level aggregation. Users can fire explain command - to check if query is rolled up from existing materialized view. + level time series aggregation on minute level aggregation. Users can fire the EXPLAIN command Review comment: put EXPLAIN in the quotes of code ########## File path: docs/mv-guide.md ########## @@ -300,19 +300,19 @@ granularity provided during creation and stored on each segment. **NOTE**: - 1. Single select statement cannot contain time series udf(s) neither with different granularity + 1. Single select statements cannot contain time series udf(s) neither with different granularity Review comment: i think this change is not required ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
Indhumathi27 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419875201 ########## File path: docs/index/index-management.md ########## @@ -51,15 +51,15 @@ Currently, there are 3 Index implementations in CarbonData. There are two kinds of management semantic for Index. -1. Automatic Refresh: Create index without `WITH DEFERRED REBUILD` in the statement, which is by default. -2. Manual Refresh: Create index with `WITH DEFERRED REBUILD` in the statement +1. Automatic Refresh: Create index without `WITH DEFERRED REFRESH` in the statement, which is by default. +2. Manual Refresh: Create index with `WITH DEFERRED REFRESH` in the statement Review comment: Description is already given below ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
akashrn5 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419876999 ########## File path: docs/index/index-management.md ########## @@ -51,15 +51,15 @@ Currently, there are 3 Index implementations in CarbonData. There are two kinds of management semantic for Index. -1. Automatic Refresh: Create index without `WITH DEFERRED REBUILD` in the statement, which is by default. -2. Manual Refresh: Create index with `WITH DEFERRED REBUILD` in the statement +1. Automatic Refresh: Create index without `WITH DEFERRED REFRESH` in the statement, which is by default. +2. Manual Refresh: Create index with `WITH DEFERRED REFRESH` in the statement Review comment: description is there, please change the existing sentence to more meaningful ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
Indhumathi27 commented on a change in pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#discussion_r419908371 ########## File path: docs/index/index-management.md ########## @@ -51,15 +51,15 @@ Currently, there are 3 Index implementations in CarbonData. There are two kinds of management semantic for Index. -1. Automatic Refresh: Create index without `WITH DEFERRED REBUILD` in the statement, which is by default. -2. Manual Refresh: Create index with `WITH DEFERRED REBUILD` in the statement +1. Automatic Refresh: Create index without `WITH DEFERRED REFRESH` in the statement, which is by default. +2. Manual Refresh: Create index with `WITH DEFERRED REFRESH` in the statement Review comment: done ########## File path: docs/index/index-management.md ########## @@ -78,7 +78,7 @@ We do recommend you to use this management for index. ### Manual Refresh -When user creates a index specifying manual refresh semantic, the index is created with status *disabled* and query will NOT use this index until user can issue REFRESH INDEX command to build the index. For every REFRESH INDEX command, system will trigger a full rebuild of the index. After rebuild is done, system will change index status to *enabled*, so that it can be used in query rewrite. +When user creates a index specifying manual refresh semantic, the index is created with status *disabled* and query will NOT use this index until user can issue REFRESH INDEX command to build the index. For every REFRESH INDEX command, system will trigger a full refresh of the index. After a refresh is done, system will change index status to *enabled*, so that it can be used in query rewrite. Review comment: done ########## File path: docs/index/lucene-index-guide.md ########## @@ -83,28 +84,28 @@ It will show all Indexes created on main table. When loading data to main table, lucene index files will be generated for all the index_columns(String Columns) given in CREATE statement which contains information about the data location of index_columns. These index files will be written inside a folder named with index name -inside each segment folders. +inside each segment folder. -A system level configuration carbon.lucene.compression.mode can be added for best compression of +A system level configuration `carbon.lucene.compression.mode` can be added for best compression of lucene index files. The default value is speed, where the index writing speed will be more. If the value is compression, the index file size will be compressed. ## Querying data As a technique for query acceleration, Lucene indexes cannot be queried directly. -Queries are to be made on main table. when a query with TEXT_MATCH('name:c10') or +Queries are to be made on main table. When a query with TEXT_MATCH('name:c10') or Review comment: done ########## File path: docs/mv-guide.md ########## @@ -300,19 +300,19 @@ granularity provided during creation and stored on each segment. **NOTE**: - 1. Single select statement cannot contain time series udf(s) neither with different granularity + 1. Single select statements cannot contain time series udf(s) neither with different granularity Review comment: done ########## File path: docs/mv-guide.md ########## @@ -334,10 +334,10 @@ ``` Then, the above query can be rolled up from materialized view 'agg_sales', by adding hour - level time series aggregation on minute level aggregation. Users can fire explain command - to check if query is rolled up from existing materialized view. + level time series aggregation on minute level aggregation. Users can fire the EXPLAIN command Review comment: done ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623953590 Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/2940/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
CarbonDataQA1 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623955083 Build Success with Spark 2.4.5, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1222/ ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
In reply to this post by GitBox
Indhumathi27 commented on pull request #3735: URL: https://github.com/apache/carbondata/pull/3735#issuecomment-623981264 @akashrn5 Please review and merge ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [hidden email] |
Free forum by Nabble | Edit this page |