Apache CarbonData Dev Mailing List archive › Apache CarbonData JIRA issues

[jira] [Commented] (CARBONDATA-212) Use SQLContext to read CarbonData file

Classic

List

Threaded

1 message

Akash R Nilugal (Jira)

[jira] [Commented] (CARBONDATA-212) Use SQLContext to read CarbonData file

[ https://issues.apache.org/jira/browse/CARBONDATA-212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15466903#comment-15466903 ]

ASF GitHub Bot commented on CARBONDATA-212:
-------------------------------------------

Github user QiangCai commented on a diff in the pull request:

https://github.com/apache/incubator-carbondata/pull/126#discussion_r77598344

--- Diff: examples/src/main/scala/org/apache/carbondata/examples/DatasourceExample.scala ---
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.carbondata.examples
+
+import org.apache.spark.sql.{SaveMode, SQLContext}
+
+import org.apache.carbondata.examples.util.InitForExamples
+
+object DatasourceExample {
+
+ def main(args: Array[String]) {
+ // use CarbonContext to write CarbonData files
+ val cc = InitForExamples.createCarbonContext("DatasourceExample")
+ import cc.implicits._
+ val sc = cc.sparkContext
+ // create a dataframe, it can be from parquet or hive table
+ val df = sc.parallelize(1 to 1000)
+ .map(x => ("a", "b", x))
+ .toDF("c1", "c2", "c3")
+
+ // save dataframe to CarbonData files
+ df.write
+ .format("carbondata")
+ .option("tableName", "carbon1")
+ .mode(SaveMode.Overwrite)
+ .save()
+
+ // use SQLContext to read CarbonData files
+ val sqlContext = new SQLContext(sc)
+ sqlContext.sql(
+ """
+ | CREATE TEMPORARY TABLE source
+ | (c1 string, c2 string, c3 long)
+ | USING org.apache.spark.sql.CarbonSource
+ | OPTIONS (path './examples/target/store',
--- End diff --

please change path to './target/store'

> Use SQLContext to read CarbonData file
> --------------------------------------
>
> Key: CARBONDATA-212
> URL: https://issues.apache.org/jira/browse/CARBONDATA-212
> Project: CarbonData
> Issue Type: New Feature
> Reporter: Jacky Li
> Assignee: Jacky Li
> Fix For: 0.2.0-incubating
>
>
> User should be enable to use Spark's SQLContext to read CarbonData files. SQLContext will use datasource API to read corresponding CarbonData files.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)