Posted by
Akash R Nilugal (Jira) on
Nov 10, 2016; 3:26pm
URL: http://apache-carbondata-dev-mailing-list-archive.168.s1.nabble.com/jira-Created-CARBONDATA-404-Data-loading-from-DataFrame-to-carbon-table-is-FAILED-tp2846.html
Babulal created CARBONDATA-404:
----------------------------------
Summary: Data loading from DataFrame to carbon table is FAILED
Key: CARBONDATA-404
URL:
https://issues.apache.org/jira/browse/CARBONDATA-404 Project: CarbonData
Issue Type: Bug
Components: data-load
Affects Versions: 0.1.0-incubating
Reporter: Babulal
Data loading FAILED when Loading data from DataFrame with tempCSV option =true (Default option ) in 3 Node cluster .
Steps
val customSchema = StructType(Array( StructField("imei", StringType, true), StructField("deviceInformationId", IntegerType, true), StructField("mac", StringType, true), StructField("productdate", TimestampType , true), StructField("updatetime", TimestampType, true), StructField("gamePointId", DoubleType, true), StructField("contractNumber", DoubleType, true) ));
val df = cc.read.format("com.databricks.spark.csv").option("header", "false").schema(customSchema).load("/opt/data/xyz/100_default_date_11_header.csv");
Start data loading
scala> df.write.format("carbondata").option("tableName","mycarbon2").save();
INFO 10-11 23:24:35,970 - main Query [
CREATE TABLE IF NOT EXISTS DEFAULT.MYCARBON2
(IMEI STRING, DEVICEINFORMATIONID INT, MAC STRING, PRODUCTDATE TIMESTAMP, UPDATETIME TIMESTAMP, GAMEPOINTID DOUBLE, CONTRACTNUMBER DOUBLE)
STORED BY 'ORG.APACHE.CARBONDATA.FORMAT'
]
INFO 10-11 23:24:35,977 - Parsing command:
CREATE TABLE IF NOT EXISTS default.mycarbon2
(imei STRING, deviceInformationId INT, mac STRING, productdate TIMESTAMP, updatetime TIMESTAMP, gamePointId DOUBLE, contractNumber DOUBLE)
STORED BY 'org.apache.carbondata.format'
INFO 10-11 23:24:35,978 - Parse Completed
INFO 10-11 23:24:36,227 - main Query [
LOAD DATA INPATH './TEMPCSV'
INTO TABLE DEFAULT.MYCARBON2
OPTIONS ('FILEHEADER' = 'IMEI,DEVICEINFORMATIONID,MAC,PRODUCTDATE,UPDATETIME,GAMEPOINTID,CONTRACTNUMBER')
]
INFO 10-11 23:24:36,233 - Successfully able to get the table metadata file lock
AUDIT 10-11 23:24:36,234 - [BLR1000007781][root][Thread-1]Dataload failed for default.mycarbon2. The input file does not exist: ./tempCSV
INFO 10-11 23:24:36,234 - main Successfully deleted the lock file /tmp/default/mycarbon2/meta.lock
INFO 10-11 23:24:36,234 - Table MetaData Unlocked Successfully after data load
org.apache.carbondata.processing.etl.DataLoadingException: The input file does not exist: ./tempCSV
at org.apache.spark.util.FileUtils$$anonfun$getPaths$1.apply$mcVI$sp(FileUtils.scala:66)
CSV DATA
1AA1,1,Mikaa1,2015-01-01 11:00:00,2015-01-01 13:00:00,198,260
1AA2,3,Mikaa2,2015-01-02 12:00:00,2015-01-01 14:00:00,278,230
1AA3,1,Mikaa1,2015-01-03 13:00:00,2015-01-01 15:00:00,2556,1
1AA4,10,Mikaa2,2015-01-04 14:00:00,2015-01-01 16:00:00,640,254
1AA5,10,Mikaa,2015-01-05 15:00:00,2015-01-01 17:00:00,980,256
1AA6,10,Mikaa,2015-01-06 16:00:00,2015-01-01 18:00:00,1,2378
1AA7,10,Mikaa,2015-01-07 17:00:00,2015-01-01 19:00:00,96,234
1AA8,9,max,2015-01-08 18:00:00,2015-01-01 20:00:00,89,236
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)