[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

classic Classic list List threaded Threaded
127 messages Options
1234567
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

qiuchenjian-2
Github user jackylk commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1713#discussion_r164734978
 
    --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/benchmark/ConcurrencyQueryBenchmark.scala ---
    @@ -15,47 +15,63 @@
      * limitations under the License.
      */
     
    -package org.apache.carbondata.examples
    +package org.apache.carbondata.benchmark
     
     import java.io.File
    +import java.text.SimpleDateFormat
     import java.util
    +import java.util.Date
     import java.util.concurrent.{Callable, Executors, Future, TimeUnit}
     
     import scala.util.Random
     
     import org.apache.spark.sql.{DataFrame, Row, SaveMode, SparkSession}
     import org.apache.spark.sql.types._
     
    -import org.apache.carbondata.core.constants.CarbonCommonConstants
    +import org.apache.carbondata.core.constants.{CarbonCommonConstants, CarbonVersionConstants}
     import org.apache.carbondata.core.util.{CarbonProperties, CarbonUtil}
     
     // scalastyle:off println
    -object ConcurrencyTest {
    -
    -  var totalNum = 100 * 1000 * 1000
    -  var ThreadNum = 16
    -  var TaskNum = 100
    -  var ResultIsEmpty = true
    -  val cardinalityId = 10000 * 10000
    +object ConcurrencyQueryBenchmark {
    --- End diff --
   
    Change to ConcurrentQueryBenchmark


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user jackylk commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1713#discussion_r164735139
 
    --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/benchmark/ConcurrencyQueryBenchmark.scala ---
    @@ -15,47 +15,63 @@
      * limitations under the License.
      */
     
    -package org.apache.carbondata.examples
    +package org.apache.carbondata.benchmark
     
     import java.io.File
    +import java.text.SimpleDateFormat
     import java.util
    +import java.util.Date
     import java.util.concurrent.{Callable, Executors, Future, TimeUnit}
     
     import scala.util.Random
     
     import org.apache.spark.sql.{DataFrame, Row, SaveMode, SparkSession}
     import org.apache.spark.sql.types._
     
    -import org.apache.carbondata.core.constants.CarbonCommonConstants
    +import org.apache.carbondata.core.constants.{CarbonCommonConstants, CarbonVersionConstants}
     import org.apache.carbondata.core.util.{CarbonProperties, CarbonUtil}
     
     // scalastyle:off println
    -object ConcurrencyTest {
    -
    -  var totalNum = 100 * 1000 * 1000
    -  var ThreadNum = 16
    -  var TaskNum = 100
    -  var ResultIsEmpty = true
    -  val cardinalityId = 10000 * 10000
    +object ConcurrencyQueryBenchmark {
    +
    +  // generate number of data
    +  var totalNum = 1 * 1000 * 1000
    --- End diff --
   
    For all these variable, why not use val


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user jackylk commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1713#discussion_r164735414
 
    --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/benchmark/ConcurrencyQueryBenchmark.scala ---
    @@ -15,47 +15,63 @@
      * limitations under the License.
      */
     
    -package org.apache.carbondata.examples
    +package org.apache.carbondata.benchmark
     
     import java.io.File
    +import java.text.SimpleDateFormat
     import java.util
    +import java.util.Date
     import java.util.concurrent.{Callable, Executors, Future, TimeUnit}
     
     import scala.util.Random
     
     import org.apache.spark.sql.{DataFrame, Row, SaveMode, SparkSession}
     import org.apache.spark.sql.types._
     
    -import org.apache.carbondata.core.constants.CarbonCommonConstants
    +import org.apache.carbondata.core.constants.{CarbonCommonConstants, CarbonVersionConstants}
     import org.apache.carbondata.core.util.{CarbonProperties, CarbonUtil}
     
     // scalastyle:off println
    -object ConcurrencyTest {
    -
    -  var totalNum = 100 * 1000 * 1000
    -  var ThreadNum = 16
    -  var TaskNum = 100
    -  var ResultIsEmpty = true
    -  val cardinalityId = 10000 * 10000
    +object ConcurrencyQueryBenchmark {
    --- End diff --
   
    Can you explain in comment what will print out in this benchmark


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user ravipesala commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3221/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1713#discussion_r164929882
 
    --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/benchmark/ConcurrencyQueryBenchmark.scala ---
    @@ -15,47 +15,63 @@
      * limitations under the License.
      */
     
    -package org.apache.carbondata.examples
    +package org.apache.carbondata.benchmark
     
     import java.io.File
    +import java.text.SimpleDateFormat
     import java.util
    +import java.util.Date
     import java.util.concurrent.{Callable, Executors, Future, TimeUnit}
     
     import scala.util.Random
     
     import org.apache.spark.sql.{DataFrame, Row, SaveMode, SparkSession}
     import org.apache.spark.sql.types._
     
    -import org.apache.carbondata.core.constants.CarbonCommonConstants
    +import org.apache.carbondata.core.constants.{CarbonCommonConstants, CarbonVersionConstants}
     import org.apache.carbondata.core.util.{CarbonProperties, CarbonUtil}
     
     // scalastyle:off println
    -object ConcurrencyTest {
    -
    -  var totalNum = 100 * 1000 * 1000
    -  var ThreadNum = 16
    -  var TaskNum = 100
    -  var ResultIsEmpty = true
    -  val cardinalityId = 10000 * 10000
    +object ConcurrencyQueryBenchmark {
    +
    +  // generate number of data
    +  var totalNum = 1 * 1000 * 1000
    --- End diff --
   
    Because user can change the variable in shell script, like:
   
    xubo # cat query3.sh
   
    #!/bin/bash
    for j in 100000000
    do
    for k in {1..32}
    do
    for l in true
    do
    for i in {1..8}
    do
    echo  'num,executors,resultIsEmpty,times'
    echo $j
    echo $k
    echo $l
    echo $i
   
    spark-submit \
       --class org.apache.carbondata.examples.ConcurrencyTest \
       --master  yarn \
       --deploy-mode client \
       --driver-memory 16g \
       --executor-cores $k \
       --executor-memory 24g \
       --num-executors 3  \
       concurrencyTest.jar \
       $j 16 100 $l '/tmp/carbondata' false false false
    done
    done
    done
    done



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1713#discussion_r164933914
 
    --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/benchmark/ConcurrencyQueryBenchmark.scala ---
    @@ -15,47 +15,63 @@
      * limitations under the License.
      */
     
    -package org.apache.carbondata.examples
    +package org.apache.carbondata.benchmark
     
     import java.io.File
    +import java.text.SimpleDateFormat
     import java.util
    +import java.util.Date
     import java.util.concurrent.{Callable, Executors, Future, TimeUnit}
     
     import scala.util.Random
     
     import org.apache.spark.sql.{DataFrame, Row, SaveMode, SparkSession}
     import org.apache.spark.sql.types._
     
    -import org.apache.carbondata.core.constants.CarbonCommonConstants
    +import org.apache.carbondata.core.constants.{CarbonCommonConstants, CarbonVersionConstants}
     import org.apache.carbondata.core.util.{CarbonProperties, CarbonUtil}
     
     // scalastyle:off println
    -object ConcurrencyTest {
    -
    -  var totalNum = 100 * 1000 * 1000
    -  var ThreadNum = 16
    -  var TaskNum = 100
    -  var ResultIsEmpty = true
    -  val cardinalityId = 10000 * 10000
    +object ConcurrencyQueryBenchmark {
    +
    +  // generate number of data
    +  var totalNum = 1 * 1000 * 1000
    --- End diff --
   
    now the class is 'org.apache.carbondata.benchmark.ConcurrentQueryBenchmark'


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on a diff in the pull request:

    https://github.com/apache/carbondata/pull/1713#discussion_r164938810
 
    --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/benchmark/ConcurrencyQueryBenchmark.scala ---
    @@ -15,47 +15,63 @@
      * limitations under the License.
      */
     
    -package org.apache.carbondata.examples
    +package org.apache.carbondata.benchmark
     
     import java.io.File
    +import java.text.SimpleDateFormat
     import java.util
    +import java.util.Date
     import java.util.concurrent.{Callable, Executors, Future, TimeUnit}
     
     import scala.util.Random
     
     import org.apache.spark.sql.{DataFrame, Row, SaveMode, SparkSession}
     import org.apache.spark.sql.types._
     
    -import org.apache.carbondata.core.constants.CarbonCommonConstants
    +import org.apache.carbondata.core.constants.{CarbonCommonConstants, CarbonVersionConstants}
     import org.apache.carbondata.core.util.{CarbonProperties, CarbonUtil}
     
     // scalastyle:off println
    -object ConcurrencyTest {
    -
    -  var totalNum = 100 * 1000 * 1000
    -  var ThreadNum = 16
    -  var TaskNum = 100
    -  var ResultIsEmpty = true
    -  val cardinalityId = 10000 * 10000
    +object ConcurrencyQueryBenchmark {
    --- End diff --
   
    ok, I add some annotation for ConcurrentQueryBenchmark


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Failed  with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3303/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2066/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Failed  with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3359/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2122/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2148/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3385/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    @jackylk @sraghunandan @ravipesala Please review it.


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/2393/



---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user xubo245 commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    retest this please


---
Reply | Threaded
Open this post in threaded view
|

[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

qiuchenjian-2
In reply to this post by qiuchenjian-2
Github user CarbonDataQA commented on the issue:

    https://github.com/apache/carbondata/pull/1713
 
    Build Failed  with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3634/



---
1234567