[GitHub] [carbondata] ajantha-bhat opened a new pull request #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] ajantha-bhat opened a new pull request #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster

GitBox
ajantha-bhat opened a new pull request #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster
URL: https://github.com/apache/carbondata/pull/3673
 
 
    ### Why is this PR needed?
   select query fails when warehouse directory is default (not configured) with below callstak.
   
   ```
   0: jdbc:hive2://localhost:10000> create table ab(age int) stored as carbondata;
   ---------+
   Result
   ---------+
   ---------+
   No rows selected (0.093 seconds)
   0: jdbc:hive2://localhost:10000> select count from ab;
   Error: org.apache.spark.sql.catalyst.analysis.NoSuchTableException: Table or view 'ab' not found in database 'tpch'; (state=,code=0)
   
   caused by
   java.io.FileNotFoundException: File hdfs://localhost:54311/home/root1/tools/spark-2.3.4-bin-hadoop2.7/spark-warehouse/tpch.db/ab/Metadata does not exist.
   ```
   
    ### What changes were proposed in this PR?
   When the spark.sql.warehouse.dir is not configured, default local file system SPARK_HOME is used. But the describe table shows with HDFS prefix in cluster.
   
   Reason is we are removing the local filesystem scheme , so when table path is read we add HDFS prefix in cluster. instead if we keep the scheme issue will not come.    
   
   
    ### Does this PR introduce any user interface change?
    - No
   
    ### Is any new testcase added?
    - No. Happens only in cluster with HDFS or OBS.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster

GitBox
CarbonDataQA1 commented on issue #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster
URL: https://github.com/apache/carbondata/pull/3673#issuecomment-600504030
 
 
   Build Failed  with Spark 2.4.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.4/795/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster
URL: https://github.com/apache/carbondata/pull/3673#issuecomment-600504125
 
 
   Build Failed  with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/2500/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] ajantha-bhat closed pull request #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster

GitBox
In reply to this post by GitBox
ajantha-bhat closed pull request #3673: [CARBONDATA-3744] Fix select query failure issue when warehouse directory is default (not configured) in cluster
URL: https://github.com/apache/carbondata/pull/3673
 
 
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services