[GitHub] [carbondata] niuge01 opened a new pull request #3542: [CARBONDATA-3640] Insert from stage command support partition table

classic Classic list List threaded Threaded
41 messages Options
123
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] niuge01 commented on a change in pull request #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
niuge01 commented on a change in pull request #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#discussion_r361839173
 
 

 ##########
 File path: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/management/CarbonLoadDataCommand.scala
 ##########
 @@ -898,12 +896,32 @@ case class CarbonLoadDataCommand(
       sortScope: SortScopeOptions.SortScope,
       isDataFrame: Boolean): (LogicalPlan, Int, Option[RDD[InternalRow]]) = {
     // Converts the data as per the loading steps before give it to writer or sorter
-    val updatedRdd = convertData(
+    val convertedRdd = convertData(
       rdd,
       sparkSession,
       loadModel,
       isDataFrame,
       partitionValues)
+    val updatedRdd = if (isDataFrame) {
+      val columnCount = loadModel.getCsvHeaderColumns.length
+      convertedRdd.map{ row =>
 
 Review comment:
   done

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] niuge01 commented on a change in pull request #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
niuge01 commented on a change in pull request #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#discussion_r361839187
 
 

 ##########
 File path: integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/management/CarbonInsertFromStageCommand.scala
 ##########
 @@ -375,5 +466,37 @@ case class CarbonInsertFromStageCommand(
     }
   }
 
+  /**
+   * create DataFrame basing on specified splits
+   */
+  private def createInputDataFrameOfInternalRow(
+      sparkSession: SparkSession,
+      carbonTable: CarbonTable,
+      splits: Seq[InputSplit]
+    ): DataFrame = {
+    val columns = carbonTable
+      .getCreateOrderColumn
+      .asScala
+      .map(_.getColName)
+      .toArray
+    val schema = SparkTypeConverter.createSparkSchema(carbonTable, columns)
+    val rdd: RDD[Row] = new CarbonScanRDD[InternalRow](
+      sparkSession,
+      columnProjection = new CarbonProjection(columns),
+      null,
+      carbonTable.getAbsoluteTableIdentifier,
+      carbonTable.getTableInfo.serialize,
+      carbonTable.getTableInfo,
+      new CarbonInputMetrics,
+      null,
+      null,
+      classOf[SparkRowReadSupportImpl],
+      splits.asJava)
 
 Review comment:
   done

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569490908
 
 
   please test this

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569493086
 
 
   Build Failed with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/1337/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569493087
 
 
   Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/1327/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569505754
 
 
   Build Failed  with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/1350/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569508231
 
 
   please test this

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] jackylk commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
jackylk commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569514548
 
 
   retest this please

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569520595
 
 
   Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/1331/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569521508
 
 
   Build Failed with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/1341/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569521526
 
 
   Build Failed  with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/1354/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] jackylk commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
jackylk commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569561110
 
 
   retest this please

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569561408
 
 
   retest this please

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
niuge01 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569563650
 
 
   retest this please

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569565211
 
 
   Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/1339/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569568822
 
 
   Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/1337/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569570759
 
 
   Build Success with Spark 2.1.0, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.1/1341/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569575953
 
 
   Build Success with Spark 2.2.1, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.2/1351/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
CarbonDataQA1 commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569576691
 
 
   Build Success with Spark 2.3.4, Please check CI http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/1363/
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
Reply | Threaded
Open this post in threaded view
|

[GitHub] [carbondata] jackylk commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table

GitBox
In reply to this post by GitBox
jackylk commented on issue #3542: [CARBONDATA-3640] Insert from stage command support partition table
URL: https://github.com/apache/carbondata/pull/3542#issuecomment-569576979
 
 
   LGTM

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[hidden email]


With regards,
Apache Git Services
123