[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

classic Classic list List threaded Threaded
30 messages Options
12
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
GitHub user akashrn5 opened a pull request:

    https://github.com/apache/incubator-carbondata/pull/398

    [CARBONDATA-400] Error message for dataload with a column having more than 100000 characters.

    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/akashrn5/incubator-carbondata br_message_correction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/398.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #398
   
----
commit 8a93e5f93000d124ae2fa4962b560fea0dbfcbaf
Author: Akash R Nilugal <[hidden email]>
Date:   2016-12-05T09:46:10Z

    [CARBONDATA-400]
    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
Github user ravipesala commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user ravipesala commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user ravipesala commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
   
    Refer to this link for build results (access rights to CI server needed):
    http://136.243.101.176:8080//job/ApacheCarbonPRBuilder/28/
    Build Success, Please check CI http://136.243.101.176:8080//28



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user gvramana commented on a diff in the pull request:

    https://github.com/apache/incubator-carbondata/pull/398#discussion_r91021993
 
    --- Diff: integration/spark-common/src/main/scala/org/apache/carbondata/spark/util/GlobalDictionaryUtil.scala ---
    @@ -836,8 +836,15 @@ object GlobalDictionaryUtil {
           }
         } catch {
           case ex: Exception =>
    -        LOGGER.error(ex, "generate global dictionary failed")
    -        throw ex
    +        ex match {
    +          case spx: SparkException =>
    +            LOGGER.error(spx, "generate global dictionary failed")
    +            throw new Exception(
    +              "Global dictionary generation failed. Please check the logs.")
    --- End diff --
   
    We cannot generally handle this SparkException, also we have to avoid "check logs". So handle specific TextParseException and compose required error message, during parsing only.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Success with Spark 1.5.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/63/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/667/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user akashrn5 closed the pull request at:

    https://github.com/apache/incubator-carbondata/pull/398


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
GitHub user akashrn5 reopened a pull request:

    https://github.com/apache/incubator-carbondata/pull/398

    [CARBONDATA-400] Error message for dataload with a column having more than 100000 characters.

    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/akashrn5/incubator-carbondata br_message_correction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/398.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #398
   
----
commit dbe4c5ea048ed943b9b841acf697316be82232cc
Author: Akash R Nilugal <[hidden email]>
Date:   2016-12-05T09:46:10Z

    [CARBONDATA-400]
    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Failed  with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/683/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user akashrn5 closed the pull request at:

    https://github.com/apache/incubator-carbondata/pull/398


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
GitHub user akashrn5 reopened a pull request:

    https://github.com/apache/incubator-carbondata/pull/398

    [CARBONDATA-400] Error message for dataload with a column having more than 100000 characters.

    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/akashrn5/incubator-carbondata br_message_correction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/398.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #398
   
----
commit a303d30b07d80aec5e2d3c4526145c96a1e28b9f
Author: Akash R Nilugal <[hidden email]>
Date:   2016-12-05T09:46:10Z

    [CARBONDATA-400]
    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user akashrn5 closed the pull request at:

    https://github.com/apache/incubator-carbondata/pull/398


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata pull request #398: [CARBONDATA-400] Error message for d...

qiuchenjian-2
In reply to this post by qiuchenjian-2
GitHub user akashrn5 reopened a pull request:

    https://github.com/apache/incubator-carbondata/pull/398

    [CARBONDATA-400] Error message for dataload with a column having more than 100000 characters.

    * Problem: When the number of characters in a column exceeds 100000 characters whole string appears in beeline with exception.
   
    Analysis: In univocity csv parser settings , the maximum number of characters per column is 100000 and when it exceeds that limit, TextparsingException is thrown with
    the complete string as error in beeline during data load.
   
    Fix: Now a proper error message is displayed in beeline and complete error messages and parser settings details will be present in logs.
    Impact area: Data loading with more than 100000 characters in a single column.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/akashrn5/incubator-carbondata br_message_correction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/incubator-carbondata/pull/398.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #398
   
----

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/703/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/1216/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/1442/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
Reply | Threaded
Open this post in threaded view
|

[GitHub] incubator-carbondata issue #398: [CARBONDATA-400] Error message for dataload...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/incubator-carbondata/pull/398
 
    Build Success with Spark 1.6.2, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/1443/



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [hidden email] or file a JIRA ticket
with INFRA.
---
12