DataX vs 腾讯云COS对象存储 -> StarRocks集群

打印 上一主题 下一主题

主题 551|帖子 551|积分 1653

本文将介绍使用DataX读出Cos的Orc文件往StarRocks里面写。
 
需求: 需要将腾讯云cos上84TB的数据, 同步到StarRocks某个大表。正常每个分区数据量20~30亿,600GB。
工具:DataX
插件:hdfsreader、starrockswriter
对象存储COS:非融合
 
DataX

这里我使用的datax版本是 DataX (DATAX-OPENSOURCE-3.0)
  1. [svccnetlhs@HOST datax]<231211 17:17:11>$ tree bin/ conf/
  2. bin/
  3. ├── datax.py
  4. ├── dxprof.py
  5. └── perftrace.py
  6. conf/
  7. ├── core.json
  8. └── logback.xml
  9. 0 directories, 5 files
  10. [svccnetlhs@HOST datax]<231211 17:18:52>$ /bin/python3
  11. python3     python3.6   python3.6m  
  12. [svccnetlhs@HOST datax]<231211 17:18:52>$ /bin/python3 bin/datax.py
  13. DataX (DATAX-OPENSOURCE-3.0), From Alibaba !
  14. Copyright (C) 2010-2017, Alibaba Group. All Rights Reserved.
  15. Usage: datax.py [options] job-url-or-path
  16. Options:
  17.   -h, --help            show this help message and exit
  18.   Product Env Options:
  19.     Normal user use these options to set jvm parameters, job runtime mode
  20.     etc. Make sure these options can be used in Product Env.
  21.     -j <jvm parameters>, --jvm=<jvm parameters>
  22.                         Set jvm parameters if necessary.
  23.     --jobid=<job unique id>
  24.                         Set job unique id when running by Distribute/Local
  25.                         Mode.
  26.     -m <job runtime mode>, --mode=<job runtime mode>
  27.                         Set job runtime mode such as: standalone, local,
  28.                         distribute. Default mode is standalone.
  29.     -p <parameter used in job config>, --params=<parameter used in job config>
  30.                         Set job parameter, eg: the source tableName you want
  31.                         to set it by command, then you can use like this:
  32.                         -p"-DtableName=your-table-name", if you have mutiple
  33.                         parameters: -p"-DtableName=your-table-name
  34.                         -DcolumnName=your-column-name".Note: you should config
  35.                         in you job tableName with ${tableName}.
  36.     -r <parameter used in view job config[reader] template>, --reader=<parameter used in view job config[reader] template>
  37.                         View job config[reader] template, eg:
  38.                         mysqlreader,streamreader
  39.     -w <parameter used in view job config[writer] template>, --writer=<parameter used in view job config[writer] template>
  40.                         View job config[writer] template, eg:
  41.                         mysqlwriter,streamwriter
  42.   Develop/Debug Options:
  43.     Developer use these options to trace more details of DataX.
  44.     -d, --debug         Set to remote debug mode.
  45.     --loglevel=<log level>
  46.                         Set log level such as: debug, info, all etc.
  47. [svccnetlhs@HOST datax]<231211 17:19:06>$
复制代码
 
DataX (HdfsReader) 插件
  1. [svccnetlhs@HOST datax]<231211 17:23:29>$ ls
  2. bin  conf  job  lib  log  log_perf  plugin  script  tmp
  3. [svccnetlhs@HOST datax]<231211 17:23:29>$
  4. [svccnetlhs@HOST datax]<231211 17:23:30>$ cd plugin/
  5. [svccnetlhs@HOST plugin]<231211 17:23:32>$ ls
  6. reader  writer
  7. [svccnetlhs@HOST plugin]<231211 17:23:32>$ cd reader/
  8. [svccnetlhs@HOST reader]<231211 17:23:36>$ ls
  9. cassandrareader   datahubreader  ftpreader  hbase094xreader  hbase11xsqlreader  hdfsreader        loghubreader   mysqlreader         odpsreader      oraclereader  otsreader        postgresqlreader  sqlserverreader  streamreader    tsdbreader
  10. clickhousereader  drdsreader     gdbreader  hbase11xreader   hbase20xsqlreader  kingbaseesreader  mongodbreader  oceanbasev10reader  opentsdbreader  ossreader     otsstreamreader  rdbmsreader       starrocksreader  tdenginereader  txtfilereader
  11. [svccnetlhs@HOST reader]<231211 17:23:37>$ cd hdfsreader/
  12. [svccnetlhs@HOST hdfsreader]<231211 17:23:39>$ ls
  13. hdfsreader-0.0.1-SNAPSHOT.jar  libs  plugin_job_template.json  plugin.json
  14. [svccnetlhs@HOST hdfsreader]<231211 17:23:40>$
  15. [svccnetlhs@HOST hdfsreader]<231211 17:23:42>$ pwd
  16. /home/svccnetlhs/chengken/starrocks/datax/plugin/reader/hdfsreader
  17. [svccnetlhs@HOST hdfsreader]<231211 17:23:43>$
  18. [svccnetlhs@HOST hdfsreader]<231211 17:23:44>$ cd libs/
  19. [svccnetlhs@HOST libs]<231211 17:23:54>$ ls
  20. activation-1.1.jar                     commons-beanutils-1.9.2.jar       curator-recipes-2.7.1.jar               hadoop-mapreduce-client-core-2.7.1.jar                  httpclient-4.1.2.jar           jetty-util-6.1.26.jar       parquet-hadoop-bundle-1.6.0rc3.jar
  21. aircompressor-0.3.jar                  commons-beanutils-core-1.8.0.jar  datanucleus-api-jdo-3.2.6.jar           hadoop-yarn-api-2.7.1.jar                               httpcore-4.1.2.jar             jline-2.12.jar              pentaho-aggdesigner-algorithm-5.1.5-jhyde.jar
  22. annotations-2.0.3.jar                  commons-cli-1.2.jar               datanucleus-core-3.2.10.jar             hadoop-yarn-common-2.7.1.jar                            jackson-core-asl-1.9.13.jar    jpam-1.1.jar                plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar
  23. ant-1.9.1.jar                          commons-codec-1.4.jar             datanucleus-rdbms-3.2.9.jar             hadoop-yarn-server-applicationhistoryservice-2.6.0.jar  jackson-jaxrs-1.9.13.jar       jsch-0.1.42.jar             protobuf-java-2.5.0.jar
  24. ant-launcher-1.9.1.jar                 commons-collections-3.2.1.jar     datax-common-0.0.1-SNAPSHOT.jar         hadoop-yarn-server-common-2.6.0.jar                     jackson-mapper-asl-1.9.13.jar  jsp-api-2.1.jar             servlet-api-2.5.jar
  25. antlr-2.7.7.jar                        commons-compiler-2.7.6.jar        derby-10.11.1.1.jar                     hadoop-yarn-server-resourcemanager-2.6.0.jar            jackson-xc-1.9.13.jar          jsr305-3.0.0.jar            slf4j-api-1.7.10.jar
  26. antlr-runtime-3.4.jar                  commons-compress-1.4.1.jar        eigenbase-properties-1.1.4.jar          hadoop-yarn-server-web-proxy-2.6.0.jar                  janino-2.7.6.jar               jta-1.1.jar                 slf4j-log4j12-1.7.10.jar
  27. aopalliance-1.0.jar                    commons-configuration-1.6.jar     fastjson2-2.0.23.jar                    hamcrest-core-1.3.jar                                   javacsv-2.0.jar                leveldbjni-all-1.8.jar      snappy-java-1.0.4.1.jar
  28. apache-curator-2.6.0.pom               commons-daemon-1.0.13.jar         geronimo-annotation_1.0_spec-1.1.1.jar  hive-ant-1.1.1.jar                                      javax.inject-1.jar             libfb303-0.9.2.jar          ST4-4.0.4.jar
  29. apacheds-i18n-2.0.0-M15.jar            commons-dbcp-1.4.jar              geronimo-jaspic_1.0_spec-1.0.jar        hive-cli-1.1.1.jar                                      java-xmlbuilder-0.4.jar        libthrift-0.9.2.jar         stax-api-1.0.1.jar
  30. apacheds-kerberos-codec-2.0.0-M15.jar  commons-digester-1.8.jar          geronimo-jta_1.1_spec-1.1.1.jar         hive-common-1.1.1.jar                                   jaxb-api-2.2.2.jar             log4j-1.2.17.jar            stax-api-1.0-2.jar
  31. apache-log4j-extras-1.2.17.jar         commons-httpclient-3.1.jar        groovy-all-2.1.6.jar                    hive-exec-1.1.1.jar                                     jaxb-impl-2.2.3-1.jar          log4j-api-2.17.1.jar        stringtemplate-3.2.1.jar
  32. api-asn1-api-1.0.0-M20.jar             commons-io-2.4.jar                gson-2.2.4.jar                          hive-hcatalog-core-1.1.1.jar                            jdo-api-3.0.1.jar              log4j-core-2.17.1.jar       velocity-1.5.jar
  33. api-util-1.0.0-M20.jar                 commons-lang-2.6.jar              guava-11.0.2.jar                        hive-metastore-1.1.1.jar                                jersey-client-1.9.jar          logback-classic-1.0.13.jar  xercesImpl-2.9.1.jar
  34. asm-3.1.jar                            commons-lang3-3.3.2.jar           guice-3.0.jar                           hive-serde-1.1.1.jar                                    jersey-core-1.9.jar            logback-core-1.0.13.jar     xml-apis-1.3.04.jar
  35. asm-commons-3.1.jar                    commons-logging-1.1.3.jar         guice-servlet-3.0.jar                   hive-service-1.1.1.jar                                  jersey-guice-1.9.jar           lzo-core-1.0.5.jar          xmlenc-0.52.jar
  36. asm-tree-3.1.jar                       commons-math3-3.1.1.jar           hadoop-aliyun-2.7.2.jar                 hive-shims-0.20S-1.1.1.jar                              jersey-json-1.9.jar            mail-1.4.1.jar              xz-1.0.jar
  37. avro-1.7.4.jar                         commons-net-3.1.jar               hadoop-annotations-2.7.1.jar            hive-shims-0.23-1.1.1.jar                               jersey-server-1.9.jar          netty-3.6.2.Final.jar       zookeeper-3.4.6.jar
  38. bonecp-0.8.0.RELEASE.jar               commons-pool-1.5.4.jar            hadoop-auth-2.7.1.jar                   hive-shims-1.1.1.jar                                    jets3t-0.9.0.jar               netty-all-4.0.23.Final.jar
  39. calcite-avatica-1.0.0-incubating.jar   cos_api-bundle-5.6.137.2.jar      hadoop-common-2.7.1.jar                 hive-shims-common-1.1.1.jar                             jettison-1.1.jar               opencsv-2.3.jar
  40. calcite-core-1.0.0-incubating.jar      curator-client-2.7.1.jar          hadoop-cos-3.1.0-8.3.2.jar              hive-shims-scheduler-1.1.1.jar                          jetty-6.1.26.jar               oro-2.0.8.jar
  41. calcite-linq4j-1.0.0-incubating.jar    curator-framework-2.6.0.jar       hadoop-hdfs-2.7.1.jar                   htrace-core-3.1.0-incubating.jar                        jetty-all-7.6.0.v20120127.jar  paranamer-2.3.jar
  42. [svccnetlhs@HOST libs]<231211 17:23:55>$
复制代码
 
DataX (StarRocksWriter) 插件
  1. [svccnetlhs@HOST datax]<231211 17:25:07>$ ls
  2. bin  conf  job  lib  log  log_perf  plugin  script  tmp
  3. [svccnetlhs@HOST datax]<231211 17:25:08>$ cd plugin/
  4. [svccnetlhs@HOST plugin]<231211 17:25:11>$ ls
  5. reader  writer
  6. [svccnetlhs@HOST plugin]<231211 17:25:11>$ cd writer/
  7. [svccnetlhs@HOST writer]<231211 17:25:13>$ ls
  8. adbpgwriter      clickhousewriter  doriswriter          ftpwriter        hbase11xsqlwriter  hdfswriter          kuduwriter     mysqlwriter         ocswriter     oscarwriter  postgresqlwriter  sqlserverwriter  tdenginewriter
  9. adswriter        databendwriter    drdswriter           gdbwriter        hbase11xwriter     hologresjdbcwriter  loghubwriter   neo4jwriter         odpswriter    osswriter    rdbmswriter       starrockswriter  tsdbwriter
  10. cassandrawriter  datahubwriter     elasticsearchwriter  hbase094xwriter  hbase20xsqlwriter  kingbaseeswriter    mongodbwriter  oceanbasev10writer  oraclewriter  otswriter    selectdbwriter    streamwriter     txtfilewriter
  11. [svccnetlhs@HOST writer]<231211 17:25:13>$ cd starrockswriter/
  12. [svccnetlhs@HOST starrockswriter]<231211 17:25:15>$ ls
  13. libs  plugin_job_template.json  plugin.json  starrockswriter-1.1.0.jar
  14. [svccnetlhs@HOST starrockswriter]<231211 17:25:16>$ ls libs/
  15. commons-codec-1.9.jar        commons-io-2.4.jar       commons-logging-1.1.1.jar  datax-common-0.0.1-SNAPSHOT.jar  fastjson2-2.0.23.jar  hamcrest-core-1.3.jar  httpcore-4.4.6.jar          logback-core-1.0.13.jar          plugin-rdbms-util-0.0.1-SNAPSHOT.jar
  16. commons-collections-3.0.jar  commons-lang3-3.3.2.jar  commons-math3-3.1.1.jar    druid-1.0.15.jar                 guava-r05.jar         httpclient-4.5.3.jar   logback-classic-1.0.13.jar  mysql-connector-java-5.1.46.jar  slf4j-api-1.7.10.jar
  17. [svccnetlhs@HOST starrockswriter]<231211 17:25:21>$
复制代码
注: 两个datax插件在文件开头可以进行下载。
 
DataX JSON

众所周知,DataX的是基于数据抽取、数据转换和数据加载三个步骤来实现数据流的搬迁。
Datax设计理念:

Datax框架设计:

Datax工作流程: 

 连接JSON:
  1. 模板1:<br>{
  2.     “content”: [
  3.         {
  4.             “reader”: {
  5.                 “name”: “hdfsreader”,
  6.                 “parameter”: {
  7.                     “column”: [
  8.                         {                                                             /************************************/
  9.                             “name”: “ts”,                                             /************************************/
  10.                             “type”: “string”,                                         /************************************/
  11.                             “value”: “2023-11-14”                                     /************************************/
  12.                         },                                                            /************************************/
  13.                         {                                                             /************************************/
  14.                             “index”: 0,                                               /************************************/
  15.                             “name”: “local_id”,                                       /************************************/
  16.                             “type”: “string”                                          /************************************/
  17.                         },                                                            /************************************/
  18.                         {                                                             /****1.由于cos文件中没有ts这个字段***/
  19.                             “index”: 1,                                               /****这里我则使用value指定一个固定值*/
  20.                             “name”: “encrypted_imei”,                                 /****value=2023-11-14代表当前path****/
  21.                             “type”: “string”                                          /****的分区数据, ********************/
  22.                         },                                                            /****此值在脚本中属于动态传参********/
  23.                         {                                                             /************************************/
  24.                             “index”: 2,                                               /****2.这里其他的字段使用了index*****/
  25.                             “name”: “encrypted_idfa”,                                 /****下标的形式取到每个字段的值******/
  26.                             “type”: “string”                                          /************************************/
  27.                         },                                                            /************************************/
  28.                         {                                                             /************************************/
  29.                             “index”: 3,                                               /************************************/
  30.                             “name”: “encrypted_mac”,                                  /************************************/
  31.                             “type”: “string”                                          /************************************/
  32.                         },                                                            /************************************/
  33.                         {                                                             /************************************/
  34.                             “index”: 4,                                               /************************************/
  35.                             “name”: “encrypted_android_id”,                           /************************************/
  36.                             “type”: “string”                                          /************************************/
  37.                         }                                                             /************************************/
  38.                     ],
  39.                     “defaultFS”: “cosn: //桶名/”,
  40.                     “encoding”: “UTF-8”,
  41.                     “fieldDelimiter”: ",",
  42.                     “fileType”: “orc”,
  43.                     “hadoopConfig”: {
  44.                         “fs.cosn.impl”: “org.apache.hadoop.fs.CosFileSystem”,
  45.                         “fs.cosn.tmp.dir”: “本地临时路径(随便)”,
  46.                         “fs.cosn.userinfo.region”: “ap-guangzhou”,
  47.                         “fs.cosn.userinfo.secretId”: "",
  48.                         “fs.cosn.userinfo.secretKey”: ""
  49.                     },
  50.                     “path”: "/sam/sam_dwd_user_action_cos_d/20231114/part-00011*"
  51.                 }
  52.             },
  53.             “writer”: {
  54.                 “name”: “starrockswriter”,
  55.                 “parameter”: {
  56.                     “column”: [
  57.                         “ts”,                                                        /******************************************/
  58.                         “local_id”,                                                  /******************************************/
  59.                         “encrypted_imei”,                                            /****StarRocks需要接收的字段名*************/
  60.                         “encrypted_idfa”,                                            /******************************************/
  61.                         “encrypted_mac”,                                             /******************************************/
  62.                         “encrypted_android_id”                                       /******************************************/
  63.                     ],
  64.                     “database”: “StarRocks库名”,
  65.                     “jdbcUrl”: “jdbc: mysql: //StarRocksFE_IP:9030/”,
  66.                     “loadProps”: {
  67.                         “max_filter_ratio”: 1
  68.                     },
  69.                     “loadUrl”: [
  70.                         “StarRocksFE_IP:8030”,
  71.                         “StarRocksFE_IP:8030”,
  72.                         “StarRocksFE_IP:8030”
  73.                     ],
  74.                     “password”: “StarRocks密码”,
  75.                     “postSql”: [
  76.                         
  77.                     ],
  78.                     “preSql”: [
  79.                         
  80.                     ],
  81.                     “table”: “StarRocks表名”,
  82.                     “username”: “StarRocks用户”
  83.                 }
  84.             }
  85.         }
  86.     ],
  87.     “setting”: {
  88.         “speed”: {
  89.             “byte”: -1,                                                            /********channel调整为3,不限速**********/
  90.             “channel”: 3                                                           /*********************************************/
  91.         }
  92.     }
  93. }
复制代码
  1. 模板2:<br>{
  2.     "job": {
  3.         "setting": {
  4.           "speed": {
  5.             "channel":3
  6.           },
  7.           "errorLimit": {}
  8.         },
  9.         "content": [{
  10.             "reader": {
  11.                 "name": "hdfsreader",
  12.                     "parameter": {
  13.                         "path": "/sam/sam_dwd_user_action_cos_d/20231114/part-*",
  14.                         "defaultFS": "cosn://*********/",
  15.                         "column": [
  16.                                 {"name":"ts","type":"string","value":"2023-11-14"},
  17.                                 {"name":"import_ds_","type":"string","index":0},
  18.                                 {"name":"unique_action_id","type":"string","index":1},
  19.                                 {"name":"action_time","type":"string","index":2},
  20.                                 {"name":"report_time","type":"string","index":3},
  21.                                 {"name":"action_type","type":"string","index":4},
  22.                                 {"name":"ka_id","type":"string","index":5},
  23.                                 {"name":"action_session_id","type":"string","index":6},
  24.                                 {"name":"uuid","type":"string","index":7},
  25.                                 {"name":"wx_app_id","type":"string","index":8},
  26.                                 {"name":"wx_open_id","type":"string","index":9},
  27.                                 {"name":"wx_union_id","type":"string","index":10},
  28.                                 {"name":"external_user_id","type":"string","index":11},
  29.                                 {"name":"merber_id","type":"string","index":12},
  30.                                 {"name":"local_id","type":"string","index":13},
  31.                                 {"name":"encrypted_imei","type":"string","index":14},
  32.                                 {"name":"encrypted_idfa","type":"string","index":15},
  33.                                 {"name":"encrypted_mac","type":"string","index":16},
  34.                                 {"name":"encrypted_android_id","type":"string","index":17},
  35.                                 {"name":"encrypted_qq","type":"string","index":18},
  36.                                 {"name":"encrypted_phone","type":"string","index":19},
  37.                                 {"name":"encrypting_algorithm","type":"string","index":20},
  38.                                 {"name":"chan_id","type":"string","index":21},
  39.                                 {"name":"chan_refer_app_id","type":"string","index":22},
  40.                                 {"name":"chan_shop_id","type":"string","index":23},
  41.                                 {"name":"chan_shop_name","type":"string","index":24},
  42.                                 {"name":"client_type","type":"string","index":25},
  43.                                 {"name":"client_name","type":"string","index":26},
  44.                                 {"name":"client_version","type":"string","index":27},
  45.                                 {"name":"sdk_version","type":"string","index":28},
  46.                                 {"name":"device_model","type":"string","index":29},
  47.                                 {"name":"ip","type":"string","index":30},
  48.                                 {"name":"user_agent","type":"string","index":31},
  49.                                 {"name":"page_path","type":"string","index":32},
  50.                                 {"name":"page_name","type":"string","index":33},
  51.                                 {"name":"referrer","type":"string","index":34},
  52.                                 {"name":"address","type":"string","index":35},
  53.                                 {"name":"city","type":"string","index":36},
  54.                                 {"name":"province","type":"string","index":37},
  55.                                 {"name":"country","type":"string","index":38},
  56.                                 {"name":"latitude","type":"string","index":39},
  57.                                 {"name":"longitude","type":"string","index":40},
  58.                                 {"name":"json_properties","type":"string","index":41},
  59.                                 {"name":"fdate","type":"string","index":42},
  60.                                 {"name":"tag_id","type":"string","index":43},
  61.                                 {"name":"tag_name","type":"string","index":44},
  62.                                 {"name":"chan_custom_id","type":"string","index":45},
  63.                                 {"name":"etl_load_time","type":"string","index":46},
  64.                                 {"name":"event_name","type":"string","index":47}
  65.                         ],
  66.                         "fileType": "orc",
  67.                         "encoding": "UTF-8",
  68.                         "hadoopConfig": {
  69.                             "fs.cosn.impl": "org.apache.hadoop.fs.CosFileSystem",
  70.                             "fs.cosn.userinfo.region": "ap-guangzhou",
  71.                             "fs.cosn.tmp.dir": "/u/chengken/starrocks/sam/data",
  72.                             "fs.cosn.userinfo.secretId": "***************",
  73.                             "fs.cosn.userinfo.secretKey": "**************",
  74.                             "fs.cosn.read.ahead.block.size": 1048576,
  75.                             "fs.cosn.read.ahead.queue.size": 2
  76.                         },
  77.                         "fieldDelimiter": ","
  78.                     }
  79.             },
  80.             "writer": {
  81.                  "name": "starrockswriter",
  82.                  "parameter": {
  83.                      "maxBatchRows":"5000000",
  84.                      "maxBatchSize":"5368709120",
  85.                      "username": "cndlopsns",
  86.                      "password": "lizhenghua1.",
  87.                      "database": "ods",
  88.                      "table": "buckets_tmp_20231205__sams_cos",
  89.                      "column": [
  90.                             "ts",
  91.                             "import_ds_",
  92.                             "unique_action_id",
  93.                             "action_time",
  94.                             "report_time",
  95.                             "action_type",
  96.                             "ka_id",
  97.                             "action_session_id",
  98.                             "uuid",
  99.                             "wx_app_id",
  100.                             "wx_open_id",
  101.                             "wx_union_id",
  102.                             "external_user_id",
  103.                             "merber_id",
  104.                             "local_id",
  105.                             "encrypted_imei",
  106.                             "encrypted_idfa",
  107.                             "encrypted_mac",
  108.                             "encrypted_android_id",
  109.                             "encrypted_qq",
  110.                             "encrypted_phone",
  111.                             "encrypting_algorithm",
  112.                             "chan_id",
  113.                             "chan_refer_app_id",
  114.                             "chan_shop_id",
  115.                             "chan_shop_name",
  116.                             "client_type",
  117.                             "client_name",
  118.                             "client_version",
  119.                             "sdk_version",
  120.                             "device_model",
  121.                             "ip",
  122.                             "user_agent",
  123.                             "page_path",
  124.                             "page_name",
  125.                             "referrer",
  126.                             "address",
  127.                             "city",
  128.                             "province",
  129.                             "country",
  130.                             "latitude",
  131.                             "longitude",
  132.                             "json_properties",
  133.                             "fdate",
  134.                             "tag_id",
  135.                             "tag_name",
  136.                             "chan_custom_id",
  137.                             "etl_load_time",
  138.                             "event_name"
  139.                   ],
  140.                      "preSql": [],
  141.                      "postSql": [],
  142.                      "jdbcUrl": "jdbc:mysql://***********:9030/ods?useCursorFetch=true&tinyInt1isBit=false&query_timeout=36000&useUnicode=true&characterEncoding=utf8",
  143.                      "loadUrl": ["****1:8030","****2:8030","****3:8030"],
  144.                      "loadProps": {"max_filter_ratio":1}
  145.                  }
  146.              }
  147.         }]
  148.     }
  149. }
复制代码
 
启动

启动并顺利读到上游数据文件,然后异步写入StarRocks。
  1. /usr/bin/python2.7 /home/hadoop/datax/bin/datax.py --jvm="-Xms1G -Xmx4G" /home/hadoop/datax/cos-starrocks1.json
复制代码
  1. Run: /usr/bin/python2.7 /u/chengken/datax/bin/datax.py --jvm="-Xms16g -Xmx16g" /home/svccnetlhs/chengken/starrocks/json/20231114_sam_dwd_user_action_cos__1702288181.json
  2. Output_____________
  3. DataX (DATAX-OPENSOURCE-3.0), From Alibaba !
  4. Copyright (C) 2010-2017, Alibaba Group. All Rights Reserved.
  5. 2023-12-11 17:49:54.918 [main] INFO  MessageSource - JVM TimeZone: GMT+08:00, Locale: zh_CN
  6. 2023-12-11 17:49:54.920 [main] INFO  MessageSource - use Locale: zh_CN timeZone: sun.util.calendar.ZoneInfo[id="GMT+08:00",offset=28800000,dstSavings=0,useDaylight=false,transitions=0,lastRule=null]
  7. 2023-12-11 17:49:54.945 [main] INFO  VMInfo - VMInfo# operatingSystem class => sun.management.OperatingSystemImpl
  8. 2023-12-11 17:49:54.950 [main] INFO  Engine - the machine info  =>
  9.     osInfo:    Linux amd64 4.18.0-348.7.1.el8_5.x86_64
  10.     jvmInfo:    Oracle Corporation 1.8 25.112-b15
  11.     cpu num:    16
  12.     totalPhysicalMemory:    -0.00G
  13.     freePhysicalMemory:    -0.00G
  14.     maxFileDescriptorCount:    -1
  15.     currentOpenFileDescriptorCount:    -1
  16.     GC Names    [PS MarkSweep, PS Scavenge]
  17.     MEMORY_NAME                    | allocation_size                | init_size                     
  18.     PS Eden Space                  | 4,096.00MB                     | 4,096.00MB                     
  19.     Code Cache                     | 240.00MB                       | 2.44MB                        
  20.     Compressed Class Space         | 1,024.00MB                     | 0.00MB                        
  21.     PS Survivor Space              | 682.50MB                       | 682.50MB                       
  22.     PS Old Gen                     | 10,923.00MB                    | 10,923.00MB                    
  23.     Metaspace                      | -0.00MB                        | 0.00MB                        
  24. 2023-12-11 17:49:54.966 [main] INFO  Engine -
  25. {
  26.     "setting":{
  27.         "speed":{
  28.             "channel":3
  29.         },
  30.         "errorLimit":{
  31.             
  32.         }
  33.     },
  34.     "content":[
  35.         {
  36.             "reader":{
  37.                 "name":"hdfsreader",
  38.                 "parameter":{
  39.                     "path":"/sam/sam_dwd_user_action_cos_d/20231114/part-*",
  40.                     "defaultFS":"cosn://*********/",
  41.                     "column":[
  42.                         {
  43.                             "name":"ts",
  44.                             "type":"string",
  45.                             "value":"2023-11-14"
  46.                         },
  47.                         {
  48.                             "name":"import_ds_",
  49.                             "type":"string",
  50.                             "index":0
  51.                         },
  52.                         {
  53.                             "name":"unique_action_id",
  54.                             "type":"string",
  55.                             "index":1
  56.                         },
  57.                         {
  58.                             "name":"action_time",
  59.                             "type":"string",
  60.                             "index":2
  61.                         },
  62.                         {
  63.                             "name":"report_time",
  64.                             "type":"string",
  65.                             "index":3
  66.                         },
  67.                         {
  68.                             "name":"action_type",
  69.                             "type":"string",
  70.                             "index":4
  71.                         },
  72.                         {
  73.                             "name":"ka_id",
  74.                             "type":"string",
  75.                             "index":5
  76.                         },
  77.                         {
  78.                             "name":"action_session_id",
  79.                             "type":"string",
  80.                             "index":6
  81.                         },
  82.                         {
  83.                             "name":"uuid",
  84.                             "type":"string",
  85.                             "index":7
  86.                         },
  87.                         {
  88.                             "name":"wx_app_id",
  89.                             "type":"string",
  90.                             "index":8
  91.                         },
  92.                         {
  93.                             "name":"wx_open_id",
  94.                             "type":"string",
  95.                             "index":9
  96.                         },
  97.                         {
  98.                             "name":"wx_union_id",
  99.                             "type":"string",
  100.                             "index":10
  101.                         },
  102.                         {
  103.                             "name":"external_user_id",
  104.                             "type":"string",
  105.                             "index":11
  106.                         },
  107.                         {
  108.                             "name":"merber_id",
  109.                             "type":"string",
  110.                             "index":12
  111.                         },
  112.                         {
  113.                             "name":"local_id",
  114.                             "type":"string",
  115.                             "index":13
  116.                         },
  117.                         {
  118.                             "name":"encrypted_imei",
  119.                             "type":"string",
  120.                             "index":14
  121.                         },
  122.                         {
  123.                             "name":"encrypted_idfa",
  124.                             "type":"string",
  125.                             "index":15
  126.                         },
  127.                         {
  128.                             "name":"encrypted_mac",
  129.                             "type":"string",
  130.                             "index":16
  131.                         },
  132.                         {
  133.                             "name":"encrypted_android_id",
  134.                             "type":"string",
  135.                             "index":17
  136.                         },
  137.                         {
  138.                             "name":"encrypted_qq",
  139.                             "type":"string",
  140.                             "index":18
  141.                         },
  142.                         {
  143.                             "name":"encrypted_phone",
  144.                             "type":"string",
  145.                             "index":19
  146.                         },
  147.                         {
  148.                             "name":"encrypting_algorithm",
  149.                             "type":"string",
  150.                             "index":20
  151.                         },
  152.                         {
  153.                             "name":"chan_id",
  154.                             "type":"string",
  155.                             "index":21
  156.                         },
  157.                         {
  158.                             "name":"chan_refer_app_id",
  159.                             "type":"string",
  160.                             "index":22
  161.                         },
  162.                         {
  163.                             "name":"chan_shop_id",
  164.                             "type":"string",
  165.                             "index":23
  166.                         },
  167.                         {
  168.                             "name":"chan_shop_name",
  169.                             "type":"string",
  170.                             "index":24
  171.                         },
  172.                         {
  173.                             "name":"client_type",
  174.                             "type":"string",
  175.                             "index":25
  176.                         },
  177.                         {
  178.                             "name":"client_name",
  179.                             "type":"string",
  180.                             "index":26
  181.                         },
  182.                         {
  183.                             "name":"client_version",
  184.                             "type":"string",
  185.                             "index":27
  186.                         },
  187.                         {
  188.                             "name":"sdk_version",
  189.                             "type":"string",
  190.                             "index":28
  191.                         },
  192.                         {
  193.                             "name":"device_model",
  194.                             "type":"string",
  195.                             "index":29
  196.                         },
  197.                         {
  198.                             "name":"ip",
  199.                             "type":"string",
  200.                             "index":30
  201.                         },
  202.                         {
  203.                             "name":"user_agent",
  204.                             "type":"string",
  205.                             "index":31
  206.                         },
  207.                         {
  208.                             "name":"page_path",
  209.                             "type":"string",
  210.                             "index":32
  211.                         },
  212.                         {
  213.                             "name":"page_name",
  214.                             "type":"string",
  215.                             "index":33
  216.                         },
  217.                         {
  218.                             "name":"referrer",
  219.                             "type":"string",
  220.                             "index":34
  221.                         },
  222.                         {
  223.                             "name":"address",
  224.                             "type":"string",
  225.                             "index":35
  226.                         },
  227.                         {
  228.                             "name":"city",
  229.                             "type":"string",
  230.                             "index":36
  231.                         },
  232.                         {
  233.                             "name":"province",
  234.                             "type":"string",
  235.                             "index":37
  236.                         },
  237.                         {
  238.                             "name":"country",
  239.                             "type":"string",
  240.                             "index":38
  241.                         },
  242.                         {
  243.                             "name":"latitude",
  244.                             "type":"string",
  245.                             "index":39
  246.                         },
  247.                         {
  248.                             "name":"longitude",
  249.                             "type":"string",
  250.                             "index":40
  251.                         },
  252.                         {
  253.                             "name":"json_properties",
  254.                             "type":"string",
  255.                             "index":41
  256.                         },
  257.                         {
  258.                             "name":"fdate",
  259.                             "type":"string",
  260.                             "index":42
  261.                         },
  262.                         {
  263.                             "name":"tag_id",
  264.                             "type":"string",
  265.                             "index":43
  266.                         },
  267.                         {
  268.                             "name":"tag_name",
  269.                             "type":"string",
  270.                             "index":44
  271.                         },
  272.                         {
  273.                             "name":"chan_custom_id",
  274.                             "type":"string",
  275.                             "index":45
  276.                         },
  277.                         {
  278.                             "name":"etl_load_time",
  279.                             "type":"string",
  280.                             "index":46
  281.                         },
  282.                         {
  283.                             "name":"event_name",
  284.                             "type":"string",
  285.                             "index":47
  286.                         }
  287.                     ],
  288.                     "fileType":"orc",
  289.                     "encoding":"UTF-8",
  290.                     "hadoopConfig":{
  291.                         "fs.cosn.impl":"org.apache.hadoop.fs.CosFileSystem",
  292.                         "fs.cosn.userinfo.region":"ap-guangzhou",
  293.                         "fs.cosn.tmp.dir":"/u/chengken/starrocks/sam/data",
  294.                         "fs.cosn.userinfo.secretId":"**********",
  295.                         "fs.cosn.userinfo.secretKey":"****************",
  296.                         "fs.cosn.read.ahead.block.size":1048576,
  297.                         "fs.cosn.read.ahead.queue.size":2
  298.                     },
  299.                     "fieldDelimiter":","
  300.                 }
  301.             },
  302.             "writer":{
  303.                 "name":"starrockswriter",
  304.                 "parameter":{
  305.                     "maxBatchRows":"5000000",
  306.                     "maxBatchSize":"5368709120",
  307.                     "username":"cndlopsns",
  308.                     "password":"************",
  309.                     "database":"ods",
  310.                     "table":"buckets_tmp_20231205__sams_cos",
  311.                     "column":[
  312.                         "ts",
  313.                         "import_ds_",
  314.                         "unique_action_id",
  315.                         "action_time",
  316.                         "report_time",
  317.                         "action_type",
  318.                         "ka_id",
  319.                         "action_session_id",
  320.                         "uuid",
  321.                         "wx_app_id",
  322.                         "wx_open_id",
  323.                         "wx_union_id",
  324.                         "external_user_id",
  325.                         "merber_id",
  326.                         "local_id",
  327.                         "encrypted_imei",
  328.                         "encrypted_idfa",
  329.                         "encrypted_mac",
  330.                         "encrypted_android_id",
  331.                         "encrypted_qq",
  332.                         "encrypted_phone",
  333.                         "encrypting_algorithm",
  334.                         "chan_id",
  335.                         "chan_refer_app_id",
  336.                         "chan_shop_id",
  337.                         "chan_shop_name",
  338.                         "client_type",
  339.                         "client_name",
  340.                         "client_version",
  341.                         "sdk_version",
  342.                         "device_model",
  343.                         "ip",
  344.                         "user_agent",
  345.                         "page_path",
  346.                         "page_name",
  347.                         "referrer",
  348.                         "address",
  349.                         "city",
  350.                         "province",
  351.                         "country",
  352.                         "latitude",
  353.                         "longitude",
  354.                         "json_properties",
  355.                         "fdate",
  356.                         "tag_id",
  357.                         "tag_name",
  358.                         "chan_custom_id",
  359.                         "etl_load_time",
  360.                         "event_name"
  361.                     ],
  362.                     "preSql":[
  363.                         
  364.                     ],
  365.                     "postSql":[
  366.                         
  367.                     ],
  368.                     "jdbcUrl":"jdbc:mysql://192.168.1.121:9030/ods?useCursorFetch=true&tinyInt1isBit=false&query_timeout=36000&useUnicode=true&characterEncoding=utf8",
  369.                     "loadUrl":[
  370.                         "192.168.1.121:8030",
  371.                         "192.168.1.122:8030",
  372.                         "192.168.1.123:8030"
  373.                     ],
  374.                     "loadProps":{
  375.                         "max_filter_ratio":1
  376.                     }
  377.                 }
  378.             }
  379.         }
  380.     ]
  381. }
  382. 2023-12-11 17:49:54.983 [main] INFO  PerfTrace - PerfTrace traceId=job_-1, isEnable=true
  383. 2023-12-11 17:49:54.983 [main] INFO  JobContainer - DataX jobContainer starts job.
  384. 2023-12-11 17:49:54.984 [main] INFO  JobContainer - Set jobId = 0
  385. 2023-12-11 17:49:54.994 [job-0] INFO  HdfsReader$Job - init() begin...
  386. 2023-12-11 17:49:55.250 [job-0] INFO  HdfsReader$Job - hadoopConfig details:{"classLoader":{"URLs":["file:/u/chengken/datax/plugin/reader/hdfsreader/hdfsreader-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-impl-2.2.3-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-hcatalog-core-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-auth-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-linq4j-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-metastore-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jta-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/activation-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aircompressor-0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aopalliance-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-configuration-1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-web-proxy-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-servlet-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-cli-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-avatica-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xz-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/parquet-hadoop-bundle-1.6.0rc3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-all-7.6.0.v20120127.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-httpclient-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0-2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-i18n-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-api-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/mail-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-api-2.2.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-annotations-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jta_1.1_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libthrift-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/oro-2.0.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-hdfs-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-jaxrs-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xercesImpl-2.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-logging-1.1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-serde-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-recipes-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/opencsv-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-json-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-service-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/avro-1.7.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.20S-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpclient-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-cli-1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xml-apis-1.3.04.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-digester-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/servlet-api-2.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/lzo-core-1.0.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-core-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javacsv-2.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/protobuf-java-2.5.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-resourcemanager-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-common-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang3-3.3.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xmlenc-0.52.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-xc-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-ant-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/pentaho-aggdesigner-algorithm-5.1.5-jhyde.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apache-log4j-extras-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-util-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.23-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guava-11.0.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-core-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-scheduler-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-aliyun-2.7.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-dbcp-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compress-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-io-2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpcore-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-mapreduce-client-core-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-core-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-core-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/janino-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jdo-api-3.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsp-api-2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/annotations-2.0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/velocity-1.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-codec-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-applicationhistoryservice-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-3.6.2.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-framework-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-tree-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-core-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-core-1.8.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/groovy-all-2.1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-api-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-server-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-api-jdo-3.2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-rdbms-3.2.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/java-xmlbuilder-0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-client-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-guice-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libfb303-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/paranamer-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-runtime-3.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/zookeeper-3.4.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-annotation_1.0_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ST4-4.0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jettison-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-core-3.2.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-asn1-api-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jaspic_1.0_spec-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-kerberos-codec-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hamcrest-core-1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-commons-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-launcher-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-exec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-collections-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stringtemplate-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jline-2.12.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-util-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-net-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/bonecp-0.8.0.RELEASE.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/gson-2.2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-log4j12-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jets3t-0.9.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang-2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/eigenbase-properties-1.1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-2.7.7.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsch-0.1.42.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/leveldbjni-all-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-classic-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jpam-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/snappy-java-1.0.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-mapper-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-math3-3.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-pool-1.5.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-all-4.0.23.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compiler-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-daemon-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javax.inject-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/htrace-core-3.1.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsr305-3.0.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-api-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/fastjson2-2.0.23.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/derby-10.11.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-client-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/cos_api-bundle-5.6.137.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-cos-3.1.0-8.3.2.jar"],"parent":{"URLs":["file:/u/chengken/datax/lib/commons-lang3-3.3.2.jar","file:/u/chengken/datax/lib/logback-core-1.0.13.jar","file:/u/chengken/datax/lib/commons-io-2.4.jar","file:/u/chengken/datax/lib/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/hamcrest-core-1.3.jar","file:/u/chengken/datax/lib/datax-transformer-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/logback-classic-1.0.13.jar","file:/u/chengken/datax/lib/commons-math3-3.1.1.jar","file:/u/chengken/datax/lib/slf4j-api-1.7.10.jar","file:/u/chengken/datax/lib/fastjson2-2.0.23.jar","file:/u/chengken/datax/lib/commons-logging-1.1.1.jar","file:/u/chengken/datax/lib/httpcore-4.4.13.jar","file:/u/chengken/datax/lib/commons-configuration-1.10.jar","file:/u/chengken/datax/lib/fluent-hc-4.5.jar","file:/u/chengken/datax/lib/commons-cli-1.2.jar","file:/u/chengken/datax/lib/janino-2.5.16.jar","file:/u/chengken/datax/lib/groovy-all-2.1.9.jar","file:/u/chengken/datax/lib/commons-codec-1.11.jar","file:/u/chengken/datax/lib/commons-collections-3.2.1.jar","file:/u/chengken/datax/lib/commons-lang-2.6.jar","file:/u/chengken/datax/lib/httpclient-4.5.13.jar","file:/u/chengken/datax/lib/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/lib/datax-core-0.0.1-SNAPSHOT.jar","file:/home/svccnetlhs/chengken/starrocks/"],"parent":{"URLs":["file:/opt/software/jdk1.8.0_112/jre/lib/ext/jaccess.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunjce_provider.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunpkcs11.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/localedata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/nashorn.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunec.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/dnsns.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/zipfs.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/cldrdata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/jfxrt.jar"]}}},"finalParameters":[]}
  387. 2023-12-11 17:49:55.250 [job-0] INFO  HdfsReader$Job - init() ok and end...
  388. 2023-12-11 17:49:55.279 [job-0] INFO  JobContainer - jobContainer starts to do prepare ...
  389. 2023-12-11 17:49:55.279 [job-0] INFO  JobContainer - DataX Reader.Job [hdfsreader] do prepare work .
  390. 2023-12-11 17:49:55.279 [job-0] INFO  HdfsReader$Job - prepare(), start to getAllFiles...
  391. 2023-12-11 17:49:55.279 [job-0] INFO  HdfsReader$Job - get HDFS all files in path = [/sam/sam_dwd_user_action_cos_d/20231114/part-*]
  392. 十二月 11, 2023 5:49:55 下午 org.apache.hadoop.util.NativeCodeLoader <clinit>
  393. 警告: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
  394. 2023-12-11 17:49:55.527 [job-0] INFO  RangerCredentialsClient - begin to init ranger client, impl []
  395. 2023-12-11 17:49:55.776 [job-0] INFO  CosNativeFileSystemStore - hadoop cos retry times: 200, cos client retry times: 5
  396. log4j:WARN No appenders could be found for logger (com.qcloud.cos.thirdparty.org.apache.http.client.protocol.RequestAddCookies).
  397. log4j:WARN Please initialize the log4j system properly.
  398. log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
  399. 2023-12-11 17:49:56.400 [job-0] INFO  CosFileSystem - The cos bucket is the normal bucket.
  400. 2023-12-11 17:49:56.418 [job-0] INFO  BufferPool - Initialize the buffer pool.
  401. 2023-12-11 17:49:56.419 [job-0] INFO  BufferPool - fs.cosn.upload.buffer.size is set to -1, so the 'mapped_disk' buffer will be used by default.
  402. 2023-12-11 17:49:56.419 [job-0] INFO  BufferPool - The type of the upload buffer pool is [MAPPED_DISK]. Buffer size:[-1]
  403. 2023-12-11 17:49:56.419 [job-0] INFO  BufferPool - tmp dir list
  404. 2023-12-11 17:49:56.796 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00000-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  405. 2023-12-11 17:49:56.953 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00000-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  406. 2023-12-11 17:49:57.014 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00001-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  407. 2023-12-11 17:49:57.192 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00001-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  408. 2023-12-11 17:49:57.244 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  409. 2023-12-11 17:49:57.392 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  410. 2023-12-11 17:49:57.443 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00003-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  411. 2023-12-11 17:49:57.586 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00003-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  412. 2023-12-11 17:49:57.645 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  413. 2023-12-11 17:49:57.782 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  414. 2023-12-11 17:49:57.837 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00005-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  415. 2023-12-11 17:49:58.007 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00005-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  416. 2023-12-11 17:49:58.066 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00006-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  417. 2023-12-11 17:49:58.233 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00006-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  418. 2023-12-11 17:49:58.292 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00007-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  419. 2023-12-11 17:49:58.444 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00007-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  420. 2023-12-11 17:49:58.510 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  421. 2023-12-11 17:49:58.694 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  422. 2023-12-11 17:49:58.749 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00009-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  423. 2023-12-11 17:49:58.975 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00009-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  424. 2023-12-11 17:49:59.030 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00010-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  425. 2023-12-11 17:49:59.181 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00010-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  426. 2023-12-11 17:49:59.232 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00011-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  427. 2023-12-11 17:49:59.393 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00011-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  428. 2023-12-11 17:49:59.449 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00012-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  429. 2023-12-11 17:49:59.602 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00012-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  430. 2023-12-11 17:49:59.656 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00013-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  431. 2023-12-11 17:49:59.836 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00013-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  432. 2023-12-11 17:49:59.888 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00014-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  433. 2023-12-11 17:50:00.038 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00014-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  434. 2023-12-11 17:50:00.106 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00015-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  435. 2023-12-11 17:50:00.325 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00015-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  436. 2023-12-11 17:50:00.382 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00016-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  437. 2023-12-11 17:50:00.551 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00016-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  438. 2023-12-11 17:50:00.606 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00017-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  439. 2023-12-11 17:50:00.769 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00017-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  440. 2023-12-11 17:50:00.826 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00018-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  441. 2023-12-11 17:50:00.969 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00018-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  442. 2023-12-11 17:50:01.021 [job-0] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00019-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  443. 2023-12-11 17:50:01.165 [job-0] INFO  HdfsReader$Job - [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00019-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]是[orc]类型的文件, 将该文件加入source files列表
  444. 2023-12-11 17:50:01.165 [job-0] INFO  HdfsReader$Job - 您即将读取的文件数为: [20], 列表为: [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00018-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00001-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00003-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00017-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00009-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00011-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00015-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00016-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00010-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00013-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00007-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00012-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00014-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00006-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00000-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00019-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc,cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00005-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]
  445. 2023-12-11 17:50:01.166 [job-0] INFO  JobContainer - DataX Writer.Job [starrockswriter] do prepare work .
  446. 2023-12-11 17:50:01.168 [job-0] INFO  JobContainer - jobContainer starts to do split ...
  447. 2023-12-11 17:50:01.168 [job-0] INFO  JobContainer - Job set Channel-Number to 3 channels.
  448. 2023-12-11 17:50:01.168 [job-0] INFO  HdfsReader$Job - split() begin...
  449. 2023-12-11 17:50:01.173 [job-0] INFO  JobContainer - DataX Reader.Job [hdfsreader] splits to [20] tasks.
  450. 2023-12-11 17:50:01.174 [job-0] INFO  JobContainer - DataX Writer.Job [starrockswriter] splits to [20] tasks.
  451. 2023-12-11 17:50:01.200 [job-0] INFO  JobContainer - jobContainer starts to do schedule ...
  452. 2023-12-11 17:50:01.213 [job-0] INFO  JobContainer - Scheduler starts [1] taskGroups.
  453. 2023-12-11 17:50:01.215 [job-0] INFO  JobContainer - Running by standalone Mode.
  454. 2023-12-11 17:50:01.227 [taskGroup-0] INFO  TaskGroupContainer - taskGroupId=[0] start [3] channels for [20] tasks.
  455. 2023-12-11 17:50:01.231 [taskGroup-0] INFO  Channel - Channel set byte_speed_limit to 209715200.
  456. 2023-12-11 17:50:01.232 [taskGroup-0] INFO  Channel - Channel set record_speed_limit to -1, No tps activated.
  457. 2023-12-11 17:50:01.240 [taskGroup-0] INFO  TaskGroupContainer - taskGroup[0] taskId[17] attemptCount[1] is started
  458. 2023-12-11 17:50:01.243 [taskGroup-0] INFO  TaskGroupContainer - taskGroup[0] taskId[0] attemptCount[1] is started
  459. 2023-12-11 17:50:01.244 [taskGroup-0] INFO  TaskGroupContainer - taskGroup[0] taskId[8] attemptCount[1] is started
  460. 2023-12-11 17:50:01.288 [0-0-17-writer] INFO  HostUtils - IP 10.233.76.104 HOSTNAME pose-app-52211-pdc
  461. 2023-12-11 17:50:01.322 [0-0-17-reader] INFO  HdfsReader$Job - hadoopConfig details:{"classLoader":{"URLs":["file:/u/chengken/datax/plugin/reader/hdfsreader/hdfsreader-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-impl-2.2.3-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-hcatalog-core-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-auth-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-linq4j-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-metastore-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jta-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/activation-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aircompressor-0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aopalliance-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-configuration-1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-web-proxy-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-servlet-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-cli-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-avatica-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xz-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/parquet-hadoop-bundle-1.6.0rc3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-all-7.6.0.v20120127.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-httpclient-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0-2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-i18n-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-api-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/mail-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-api-2.2.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-annotations-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jta_1.1_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libthrift-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/oro-2.0.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-hdfs-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-jaxrs-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xercesImpl-2.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-logging-1.1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-serde-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-recipes-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/opencsv-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-json-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-service-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/avro-1.7.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.20S-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpclient-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-cli-1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xml-apis-1.3.04.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-digester-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/servlet-api-2.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/lzo-core-1.0.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-core-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javacsv-2.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/protobuf-java-2.5.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-resourcemanager-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-common-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang3-3.3.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xmlenc-0.52.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-xc-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-ant-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/pentaho-aggdesigner-algorithm-5.1.5-jhyde.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apache-log4j-extras-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-util-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.23-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guava-11.0.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-core-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-scheduler-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-aliyun-2.7.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-dbcp-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compress-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-io-2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpcore-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-mapreduce-client-core-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-core-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-core-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/janino-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jdo-api-3.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsp-api-2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/annotations-2.0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/velocity-1.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-codec-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-applicationhistoryservice-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-3.6.2.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-framework-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-tree-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-core-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-core-1.8.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/groovy-all-2.1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-api-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-server-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-api-jdo-3.2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-rdbms-3.2.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/java-xmlbuilder-0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-client-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-guice-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libfb303-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/paranamer-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-runtime-3.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/zookeeper-3.4.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-annotation_1.0_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ST4-4.0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jettison-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-core-3.2.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-asn1-api-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jaspic_1.0_spec-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-kerberos-codec-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hamcrest-core-1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-commons-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-launcher-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-exec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-collections-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stringtemplate-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jline-2.12.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-util-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-net-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/bonecp-0.8.0.RELEASE.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/gson-2.2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-log4j12-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jets3t-0.9.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang-2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/eigenbase-properties-1.1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-2.7.7.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsch-0.1.42.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/leveldbjni-all-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-classic-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jpam-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/snappy-java-1.0.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-mapper-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-math3-3.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-pool-1.5.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-all-4.0.23.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compiler-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-daemon-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javax.inject-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/htrace-core-3.1.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsr305-3.0.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-api-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/fastjson2-2.0.23.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/derby-10.11.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-client-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/cos_api-bundle-5.6.137.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-cos-3.1.0-8.3.2.jar"],"parent":{"URLs":["file:/u/chengken/datax/lib/commons-lang3-3.3.2.jar","file:/u/chengken/datax/lib/logback-core-1.0.13.jar","file:/u/chengken/datax/lib/commons-io-2.4.jar","file:/u/chengken/datax/lib/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/hamcrest-core-1.3.jar","file:/u/chengken/datax/lib/datax-transformer-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/logback-classic-1.0.13.jar","file:/u/chengken/datax/lib/commons-math3-3.1.1.jar","file:/u/chengken/datax/lib/slf4j-api-1.7.10.jar","file:/u/chengken/datax/lib/fastjson2-2.0.23.jar","file:/u/chengken/datax/lib/commons-logging-1.1.1.jar","file:/u/chengken/datax/lib/httpcore-4.4.13.jar","file:/u/chengken/datax/lib/commons-configuration-1.10.jar","file:/u/chengken/datax/lib/fluent-hc-4.5.jar","file:/u/chengken/datax/lib/commons-cli-1.2.jar","file:/u/chengken/datax/lib/janino-2.5.16.jar","file:/u/chengken/datax/lib/groovy-all-2.1.9.jar","file:/u/chengken/datax/lib/commons-codec-1.11.jar","file:/u/chengken/datax/lib/commons-collections-3.2.1.jar","file:/u/chengken/datax/lib/commons-lang-2.6.jar","file:/u/chengken/datax/lib/httpclient-4.5.13.jar","file:/u/chengken/datax/lib/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/lib/datax-core-0.0.1-SNAPSHOT.jar","file:/home/svccnetlhs/chengken/starrocks/"],"parent":{"URLs":["file:/opt/software/jdk1.8.0_112/jre/lib/ext/jaccess.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunjce_provider.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunpkcs11.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/localedata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/nashorn.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunec.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/dnsns.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/zipfs.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/cldrdata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/jfxrt.jar"]}}},"finalParameters":["mapreduce.job.end-notification.max.retry.interval","mapreduce.job.end-notification.max.attempts"]}
  462. 2023-12-11 17:50:01.324 [0-0-17-reader] INFO  Reader$Task - read start
  463. 2023-12-11 17:50:01.324 [0-0-17-reader] INFO  Reader$Task - reading file : [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]
  464. 2023-12-11 17:50:01.324 [0-0-17-reader] INFO  HdfsReader$Job - Start Read orcfile [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc].
  465. 2023-12-11 17:50:01.326 [0-0-0-reader] INFO  HdfsReader$Job - hadoopConfig details:{"classLoader":{"URLs":["file:/u/chengken/datax/plugin/reader/hdfsreader/hdfsreader-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-impl-2.2.3-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-hcatalog-core-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-auth-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-linq4j-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-metastore-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jta-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/activation-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aircompressor-0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aopalliance-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-configuration-1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-web-proxy-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-servlet-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-cli-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-avatica-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xz-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/parquet-hadoop-bundle-1.6.0rc3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-all-7.6.0.v20120127.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-httpclient-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0-2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-i18n-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-api-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/mail-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-api-2.2.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-annotations-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jta_1.1_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libthrift-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/oro-2.0.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-hdfs-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-jaxrs-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xercesImpl-2.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-logging-1.1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-serde-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-recipes-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/opencsv-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-json-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-service-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/avro-1.7.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.20S-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpclient-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-cli-1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xml-apis-1.3.04.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-digester-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/servlet-api-2.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/lzo-core-1.0.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-core-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javacsv-2.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/protobuf-java-2.5.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-resourcemanager-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-common-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang3-3.3.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xmlenc-0.52.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-xc-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-ant-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/pentaho-aggdesigner-algorithm-5.1.5-jhyde.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apache-log4j-extras-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-util-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.23-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guava-11.0.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-core-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-scheduler-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-aliyun-2.7.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-dbcp-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compress-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-io-2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpcore-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-mapreduce-client-core-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-core-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-core-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/janino-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jdo-api-3.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsp-api-2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/annotations-2.0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/velocity-1.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-codec-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-applicationhistoryservice-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-3.6.2.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-framework-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-tree-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-core-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-core-1.8.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/groovy-all-2.1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-api-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-server-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-api-jdo-3.2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-rdbms-3.2.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/java-xmlbuilder-0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-client-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-guice-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libfb303-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/paranamer-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-runtime-3.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/zookeeper-3.4.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-annotation_1.0_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ST4-4.0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jettison-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-core-3.2.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-asn1-api-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jaspic_1.0_spec-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-kerberos-codec-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hamcrest-core-1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-commons-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-launcher-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-exec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-collections-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stringtemplate-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jline-2.12.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-util-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-net-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/bonecp-0.8.0.RELEASE.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/gson-2.2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-log4j12-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jets3t-0.9.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang-2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/eigenbase-properties-1.1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-2.7.7.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsch-0.1.42.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/leveldbjni-all-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-classic-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jpam-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/snappy-java-1.0.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-mapper-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-math3-3.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-pool-1.5.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-all-4.0.23.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compiler-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-daemon-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javax.inject-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/htrace-core-3.1.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsr305-3.0.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-api-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/fastjson2-2.0.23.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/derby-10.11.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-client-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/cos_api-bundle-5.6.137.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-cos-3.1.0-8.3.2.jar"],"parent":{"URLs":["file:/u/chengken/datax/lib/commons-lang3-3.3.2.jar","file:/u/chengken/datax/lib/logback-core-1.0.13.jar","file:/u/chengken/datax/lib/commons-io-2.4.jar","file:/u/chengken/datax/lib/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/hamcrest-core-1.3.jar","file:/u/chengken/datax/lib/datax-transformer-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/logback-classic-1.0.13.jar","file:/u/chengken/datax/lib/commons-math3-3.1.1.jar","file:/u/chengken/datax/lib/slf4j-api-1.7.10.jar","file:/u/chengken/datax/lib/fastjson2-2.0.23.jar","file:/u/chengken/datax/lib/commons-logging-1.1.1.jar","file:/u/chengken/datax/lib/httpcore-4.4.13.jar","file:/u/chengken/datax/lib/commons-configuration-1.10.jar","file:/u/chengken/datax/lib/fluent-hc-4.5.jar","file:/u/chengken/datax/lib/commons-cli-1.2.jar","file:/u/chengken/datax/lib/janino-2.5.16.jar","file:/u/chengken/datax/lib/groovy-all-2.1.9.jar","file:/u/chengken/datax/lib/commons-codec-1.11.jar","file:/u/chengken/datax/lib/commons-collections-3.2.1.jar","file:/u/chengken/datax/lib/commons-lang-2.6.jar","file:/u/chengken/datax/lib/httpclient-4.5.13.jar","file:/u/chengken/datax/lib/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/lib/datax-core-0.0.1-SNAPSHOT.jar","file:/home/svccnetlhs/chengken/starrocks/"],"parent":{"URLs":["file:/opt/software/jdk1.8.0_112/jre/lib/ext/jaccess.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunjce_provider.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunpkcs11.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/localedata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/nashorn.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunec.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/dnsns.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/zipfs.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/cldrdata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/jfxrt.jar"]}}},"finalParameters":["mapreduce.job.end-notification.max.retry.interval","mapreduce.job.end-notification.max.attempts"]}
  466. 2023-12-11 17:50:01.326 [0-0-8-reader] INFO  HdfsReader$Job - hadoopConfig details:{"classLoader":{"URLs":["file:/u/chengken/datax/plugin/reader/hdfsreader/hdfsreader-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-impl-2.2.3-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-hcatalog-core-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-auth-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-linq4j-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-metastore-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jta-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/activation-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aircompressor-0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/aopalliance-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-configuration-1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-web-proxy-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-servlet-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-cli-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-avatica-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xz-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/parquet-hadoop-bundle-1.6.0rc3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-all-7.6.0.v20120127.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-httpclient-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0-2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-i18n-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-api-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/mail-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jaxb-api-2.2.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-annotations-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jta_1.1_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libthrift-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/oro-2.0.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-hdfs-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-jaxrs-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xercesImpl-2.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-logging-1.1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-serde-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-recipes-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/opencsv-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-json-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-service-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/avro-1.7.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.20S-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpclient-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-cli-1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xml-apis-1.3.04.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-digester-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/servlet-api-2.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/lzo-core-1.0.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/log4j-core-2.17.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javacsv-2.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/protobuf-java-2.5.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-resourcemanager-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-common-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang3-3.3.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/xmlenc-0.52.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-xc-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-ant-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/pentaho-aggdesigner-algorithm-5.1.5-jhyde.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apache-log4j-extras-1.2.17.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-util-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-0.23-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guava-11.0.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-core-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-scheduler-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-aliyun-2.7.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-dbcp-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compress-1.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-shims-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-io-2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/httpcore-4.1.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-mapreduce-client-core-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-core-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-core-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/janino-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jdo-api-3.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsp-api-2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/annotations-2.0.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/velocity-1.5.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-codec-1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-server-applicationhistoryservice-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-3.6.2.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-framework-2.6.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-tree-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jetty-6.1.26.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/calcite-core-1.0.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/plugin-unstructured-storage-util-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-core-1.8.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/groovy-all-2.1.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-yarn-api-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-server-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-api-jdo-3.2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/guice-3.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-rdbms-3.2.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/java-xmlbuilder-0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-common-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-client-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jersey-guice-1.9.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/libfb303-0.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/paranamer-2.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-runtime-3.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/zookeeper-3.4.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-annotation_1.0_spec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ST4-4.0.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jettison-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/datanucleus-core-3.2.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-asn1-api-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/geronimo-jaspic_1.0_spec-1.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/apacheds-kerberos-codec-2.0.0-M15.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hamcrest-core-1.3.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-commons-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/ant-launcher-1.9.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-exec-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-collections-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/asm-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stringtemplate-3.2.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hive-common-1.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jline-2.12.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/api-util-1.0.0-M20.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/stax-api-1.0.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-net-3.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/bonecp-0.8.0.RELEASE.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/gson-2.2.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-log4j12-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jets3t-0.9.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-lang-2.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/eigenbase-properties-1.1.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/antlr-2.7.7.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsch-0.1.42.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/leveldbjni-all-1.8.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/logback-classic-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jpam-1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/snappy-java-1.0.4.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jackson-mapper-asl-1.9.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-math3-3.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-pool-1.5.4.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/netty-all-4.0.23.Final.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-compiler-2.7.6.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/commons-daemon-1.0.13.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/javax.inject-1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/htrace-core-3.1.0-incubating.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/jsr305-3.0.0.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/slf4j-api-1.7.10.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/fastjson2-2.0.23.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/derby-10.11.1.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/curator-client-2.7.1.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/cos_api-bundle-5.6.137.2.jar","file:/u/chengken/datax/plugin/reader/hdfsreader/libs/hadoop-cos-3.1.0-8.3.2.jar"],"parent":{"URLs":["file:/u/chengken/datax/lib/commons-lang3-3.3.2.jar","file:/u/chengken/datax/lib/logback-core-1.0.13.jar","file:/u/chengken/datax/lib/commons-io-2.4.jar","file:/u/chengken/datax/lib/datax-common-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/hamcrest-core-1.3.jar","file:/u/chengken/datax/lib/datax-transformer-0.0.1-SNAPSHOT.jar","file:/u/chengken/datax/lib/logback-classic-1.0.13.jar","file:/u/chengken/datax/lib/commons-math3-3.1.1.jar","file:/u/chengken/datax/lib/slf4j-api-1.7.10.jar","file:/u/chengken/datax/lib/fastjson2-2.0.23.jar","file:/u/chengken/datax/lib/commons-logging-1.1.1.jar","file:/u/chengken/datax/lib/httpcore-4.4.13.jar","file:/u/chengken/datax/lib/commons-configuration-1.10.jar","file:/u/chengken/datax/lib/fluent-hc-4.5.jar","file:/u/chengken/datax/lib/commons-cli-1.2.jar","file:/u/chengken/datax/lib/janino-2.5.16.jar","file:/u/chengken/datax/lib/groovy-all-2.1.9.jar","file:/u/chengken/datax/lib/commons-codec-1.11.jar","file:/u/chengken/datax/lib/commons-collections-3.2.1.jar","file:/u/chengken/datax/lib/commons-lang-2.6.jar","file:/u/chengken/datax/lib/httpclient-4.5.13.jar","file:/u/chengken/datax/lib/commons-beanutils-1.9.2.jar","file:/u/chengken/datax/lib/datax-core-0.0.1-SNAPSHOT.jar","file:/home/svccnetlhs/chengken/starrocks/"],"parent":{"URLs":["file:/opt/software/jdk1.8.0_112/jre/lib/ext/jaccess.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunjce_provider.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunpkcs11.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/localedata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/nashorn.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/sunec.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/dnsns.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/zipfs.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/cldrdata.jar","file:/opt/software/jdk1.8.0_112/jre/lib/ext/jfxrt.jar"]}}},"finalParameters":["mapreduce.job.end-notification.max.retry.interval","mapreduce.job.end-notification.max.attempts"]}
  467. 2023-12-11 17:50:01.327 [0-0-0-reader] INFO  Reader$Task - read start
  468. 2023-12-11 17:50:01.327 [0-0-0-reader] INFO  Reader$Task - reading file : [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]
  469. 2023-12-11 17:50:01.328 [0-0-0-reader] INFO  HdfsReader$Job - Start Read orcfile [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc].
  470. 2023-12-11 17:50:01.328 [0-0-8-reader] INFO  Reader$Task - read start
  471. 2023-12-11 17:50:01.328 [0-0-8-reader] INFO  Reader$Task - reading file : [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc]
  472. 2023-12-11 17:50:01.328 [0-0-8-reader] INFO  HdfsReader$Job - Start Read orcfile [cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc].
  473. 十二月 11, 2023 5:50:01 下午 org.apache.hadoop.hive.ql.log.PerfLogger PerfLogBegin
  474. 信息: <PERFLOG method=OrcGetSplits from=org.apache.hadoop.hive.ql.io.orc.ReaderImpl>
  475. 十二月 11, 2023 5:50:01 下午 org.apache.hadoop.hive.ql.log.PerfLogger PerfLogBegin
  476. 信息: <PERFLOG method=OrcGetSplits from=org.apache.hadoop.hive.ql.io.orc.ReaderImpl>
  477. 十二月 11, 2023 5:50:01 下午 org.apache.hadoop.hive.ql.log.PerfLogger PerfLogBegin
  478. 信息: <PERFLOG method=OrcGetSplits from=org.apache.hadoop.hive.ql.io.orc.ReaderImpl>
  479. 十二月 11, 2023 5:50:01 下午 org.apache.hadoop.conf.Configuration warnOnceIfDeprecated
  480. 信息: mapred.input.dir is deprecated. Instead, use mapreduce.input.fileinputformat.inputdir
  481. 2023-12-11 17:50:01.646 [ORC_GET_SPLITS #1] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  482. 2023-12-11 17:50:01.776 [ORC_GET_SPLITS #1] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  483. 2023-12-11 17:50:01.778 [ORC_GET_SPLITS #1] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  484. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcInputFormat generateSplitsInfo
  485. 信息: FooterCacheHitRatio: 0/1
  486. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.log.PerfLogger PerfLogEnd
  487. 信息: </PERFLOG method=OrcGetSplits start=1702288201425 end=1702288202185 duration=760 from=org.apache.hadoop.hive.ql.io.orc.ReaderImpl>
  488. 2023-12-11 17:50:02.288 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  489. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcInputFormat generateSplitsInfo
  490. 信息: FooterCacheHitRatio: 0/1
  491. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.log.PerfLogger PerfLogEnd
  492. 信息: </PERFLOG method=OrcGetSplits start=1702288201425 end=1702288202296 duration=871 from=org.apache.hadoop.hive.ql.io.orc.ReaderImpl>
  493. 2023-12-11 17:50:02.369 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  494. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcInputFormat generateSplitsInfo
  495. 信息: FooterCacheHitRatio: 0/1
  496. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.log.PerfLogger PerfLogEnd
  497. 信息: </PERFLOG method=OrcGetSplits start=1702288201425 end=1702288202433 duration=1008 from=org.apache.hadoop.hive.ql.io.orc.ReaderImpl>
  498. 2023-12-11 17:50:02.492 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  499. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  500. 信息: min key = null, max key = {originalTxn: 0, bucket: -1, row: 302079}
  501. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  502. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 3, length: 9223372036854775807}
  503. 2023-12-11 17:50:02.647 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  504. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  505. 信息: min key = null, max key = {originalTxn: 0, bucket: -1, row: 302079}
  506. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  507. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 3, length: 9223372036854775807}
  508. 2023-12-11 17:50:02.784 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  509. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  510. 信息: min key = null, max key = {originalTxn: 0, bucket: -1, row: 302079}
  511. 十二月 11, 2023 5:50:02 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  512. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 3, length: 9223372036854775807}
  513. 2023-12-11 17:50:02.867 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  514. 2023-12-11 17:50:11.235 [job-0] INFO  StandAloneJobContainerCommunicator - Total 0 records, 0 bytes | Speed 0B/s, 0 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.000s |  All Task WaitReaderTime 0.000s | Percentage 0.00%
  515. 2023-12-11 17:50:21.236 [job-0] INFO  StandAloneJobContainerCommunicator - Total 297312 records, 558177719 bytes | Speed 53.23MB/s, 29731 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.023s |  All Task WaitReaderTime 25.333s | Percentage 0.00%
  516. 2023-12-11 17:50:30.385 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  517. 十二月 11, 2023 5:50:30 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  518. 信息: min key = {originalTxn: 0, bucket: -1, row: 302079}, max key = {originalTxn: 0, bucket: -1, row: 599039}
  519. 十二月 11, 2023 5:50:30 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  520. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 156410867, length: 9223372036854775807}
  521. 2023-12-11 17:50:30.731 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  522. 2023-12-11 17:50:30.907 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  523. 2023-12-11 17:50:31.114 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  524. 十二月 11, 2023 5:50:31 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  525. 信息: min key = {originalTxn: 0, bucket: -1, row: 302079}, max key = {originalTxn: 0, bucket: -1, row: 599039}
  526. 十二月 11, 2023 5:50:31 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  527. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 156011050, length: 9223372036854775807}
  528. 2023-12-11 17:50:31.237 [job-0] INFO  StandAloneJobContainerCommunicator - Total 598944 records, 1125524739 bytes | Speed 54.11MB/s, 30163 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.040s |  All Task WaitReaderTime 43.878s | Percentage 0.00%
  529. 2023-12-11 17:50:31.280 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  530. 十二月 11, 2023 5:50:31 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  531. 信息: min key = {originalTxn: 0, bucket: -1, row: 302079}, max key = {originalTxn: 0, bucket: -1, row: 604159}
  532. 十二月 11, 2023 5:50:31 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  533. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 156411995, length: 9223372036854775807}
  534. 2023-12-11 17:50:31.457 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  535. 2023-12-11 17:50:41.238 [job-0] INFO  StandAloneJobContainerCommunicator - Total 906144 records, 1702990600 bytes | Speed 55.07MB/s, 30720 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.056s |  All Task WaitReaderTime 62.147s | Percentage 0.00%
  536. 2023-12-11 17:50:51.241 [job-0] INFO  StandAloneJobContainerCommunicator - Total 1203104 records, 2261515086 bytes | Speed 53.27MB/s, 29696 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.070s |  All Task WaitReaderTime 92.372s | Percentage 0.00%
  537. 2023-12-11 17:50:56.098 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  538. 十二月 11, 2023 5:50:56 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  539. 信息: min key = {originalTxn: 0, bucket: -1, row: 599039}, max key = {originalTxn: 0, bucket: -1, row: 803839}
  540. 十二月 11, 2023 5:50:56 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  541. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 311140141, length: 9223372036854775807}
  542. 2023-12-11 17:50:56.469 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  543. 2023-12-11 17:50:57.564 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  544. 十二月 11, 2023 5:50:57 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  545. 信息: min key = {originalTxn: 0, bucket: -1, row: 604159}, max key = {originalTxn: 0, bucket: -1, row: 803839}
  546. 十二月 11, 2023 5:50:57 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  547. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 312563739, length: 9223372036854775807}
  548. 2023-12-11 17:50:57.971 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  549. 2023-12-11 17:50:58.447 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  550. 十二月 11, 2023 5:50:58 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  551. 信息: min key = {originalTxn: 0, bucket: -1, row: 599039}, max key = {originalTxn: 0, bucket: -1, row: 803839}
  552. 十二月 11, 2023 5:50:58 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  553. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 310368886, length: 9223372036854775807}
  554. 2023-12-11 17:50:58.782 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  555. 2023-12-11 17:51:01.242 [job-0] INFO  StandAloneJobContainerCommunicator - Total 1703232 records, 3203376993 bytes | Speed 89.82MB/s, 50012 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.093s |  All Task WaitReaderTime 127.642s | Percentage 0.00%
  556. 2023-12-11 17:51:11.242 [job-0] INFO  StandAloneJobContainerCommunicator - Total 1829984 records, 3440997771 bytes | Speed 22.66MB/s, 12675 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.101s |  All Task WaitReaderTime 138.648s | Percentage 0.00%
  557. 2023-12-11 17:51:15.910 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  558. 十二月 11, 2023 5:51:16 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  559. 信息: min key = {originalTxn: 0, bucket: -1, row: 803839}, max key = {originalTxn: 0, bucket: -1, row: 1100799}
  560. 十二月 11, 2023 5:51:16 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  561. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 417009783, length: 9223372036854775807}
  562. 2023-12-11 17:51:16.305 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  563. 2023-12-11 17:51:17.688 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  564. 十二月 11, 2023 5:51:17 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  565. 信息: min key = {originalTxn: 0, bucket: -1, row: 803839}, max key = {originalTxn: 0, bucket: -1, row: 1100799}
  566. 十二月 11, 2023 5:51:17 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  567. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 415835058, length: 9223372036854775807}
  568. 2023-12-11 17:51:18.017 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  569. 2023-12-11 17:51:18.695 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  570. 十二月 11, 2023 5:51:18 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  571. 信息: min key = {originalTxn: 0, bucket: -1, row: 803839}, max key = {originalTxn: 0, bucket: -1, row: 1105919}
  572. 十二月 11, 2023 5:51:18 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  573. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 415904892, length: 9223372036854775807}
  574. 2023-12-11 17:51:19.066 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  575. 2023-12-11 17:51:21.243 [job-0] INFO  StandAloneJobContainerCommunicator - Total 2342208 records, 4403928517 bytes | Speed 91.83MB/s, 51222 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.126s |  All Task WaitReaderTime 178.482s | Percentage 0.00%
  576. 2023-12-11 17:51:31.245 [job-0] INFO  StandAloneJobContainerCommunicator - Total 2466496 records, 4638515258 bytes | Speed 22.37MB/s, 12428 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.132s |  All Task WaitReaderTime 190.077s | Percentage 0.00%
  577. 2023-12-11 17:51:41.246 [job-0] INFO  StandAloneJobContainerCommunicator - Total 2967264 records, 5579213090 bytes | Speed 89.71MB/s, 50076 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.156s |  All Task WaitReaderTime 229.517s | Percentage 0.00%
  578. 2023-12-11 17:51:41.359 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  579. 十二月 11, 2023 5:51:41 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  580. 信息: min key = {originalTxn: 0, bucket: -1, row: 1100799}, max key = {originalTxn: 0, bucket: -1, row: 1300479}
  581. 十二月 11, 2023 5:51:41 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  582. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 571347891, length: 9223372036854775807}
  583. 2023-12-11 17:51:41.747 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  584. 2023-12-11 17:51:41.998 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  585. 十二月 11, 2023 5:51:42 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  586. 信息: min key = {originalTxn: 0, bucket: -1, row: 1100799}, max key = {originalTxn: 0, bucket: -1, row: 1300479}
  587. 十二月 11, 2023 5:51:42 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  588. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 570444727, length: 9223372036854775807}
  589. 2023-12-11 17:51:42.390 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  590. 2023-12-11 17:51:44.650 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  591. 十二月 11, 2023 5:51:44 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  592. 信息: min key = {originalTxn: 0, bucket: -1, row: 1105919}, max key = {originalTxn: 0, bucket: -1, row: 1305599}
  593. 十二月 11, 2023 5:51:44 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  594. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 572342605, length: 9223372036854775807}
  595. 2023-12-11 17:51:44.978 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  596. 2023-12-11 17:51:51.248 [job-0] INFO  StandAloneJobContainerCommunicator - Total 3307424 records, 6219717845 bytes | Speed 61.08MB/s, 34016 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.172s |  All Task WaitReaderTime 246.926s | Percentage 0.00%
  597. 2023-12-11 17:52:01.250 [job-0] INFO  StandAloneJobContainerCommunicator - Total 3614624 records, 6797953036 bytes | Speed 55.14MB/s, 30720 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.187s |  All Task WaitReaderTime 279.975s | Percentage 0.00%
  598. 2023-12-11 17:52:02.446 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  599. 十二月 11, 2023 5:52:02 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  600. 信息: min key = {originalTxn: 0, bucket: -1, row: 1300479}, max key = {originalTxn: 0, bucket: -1, row: 1597439}
  601. 十二月 11, 2023 5:52:02 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  602. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 674573070, length: 9223372036854775807}
  603. 2023-12-11 17:52:02.859 [0-0-8-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00008-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  604. 2023-12-11 17:52:03.369 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  605. 十二月 11, 2023 5:52:03 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  606. 信息: min key = {originalTxn: 0, bucket: -1, row: 1300479}, max key = {originalTxn: 0, bucket: -1, row: 1597439}
  607. 十二月 11, 2023 5:52:03 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  608. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 673856234, length: 9223372036854775807}
  609. 2023-12-11 17:52:03.751 [0-0-0-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00002-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  610. 2023-12-11 17:52:04.567 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  611. 十二月 11, 2023 5:52:04 下午 org.apache.hadoop.hive.ql.io.orc.OrcRawRecordMerger <init>
  612. 信息: min key = {originalTxn: 0, bucket: -1, row: 1305599}, max key = {originalTxn: 0, bucket: -1, row: 1602559}
  613. 十二月 11, 2023 5:52:04 下午 org.apache.hadoop.hive.ql.io.orc.ReaderImpl rowsOptions
  614. 信息: Reading ORC rows from cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc with {include: null, offset: 675641495, length: 9223372036854775807}
  615. 2023-12-11 17:52:04.881 [0-0-17-reader] INFO  CosNFileSystem - Opening 'cosn://bucketsName/sam/sam_dwd_user_action_cos_d/20231114/part-00004-04661bb7-8d42-4e91-ae8e-47f996111f67-c000.snappy.orc' for reading
  616. 2023-12-11 17:52:11.251 [job-0] INFO  StandAloneJobContainerCommunicator - Total 3906464 records, 7347149221 bytes | Speed 52.38MB/s, 29184 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.202s |  All Task WaitReaderTime 297.463s | Percentage 0.00%
  617. 2023-12-11 17:52:21.252 [job-0] INFO  StandAloneJobContainerCommunicator - Total 4198304 records, 7896134390 bytes | Speed 52.36MB/s, 29184 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.215s |  All Task WaitReaderTime 331.832s | Percentage 0.00%
  618. ...
复制代码
StarRocks数据顺利加载:
  1. [Mon Dec 11 09:56:57 2023]:['default_cluster:cndlopsns']>[192.168.1.121]:[ods] [ADHOC用户集群] > show partitions from ods.buckets_tmp_20231205__sams_cos;
  2. +-------------+---------------+----------------+---------------------+--------------------+--------+--------------+----------------------------------------------------------------------------+-----------------+---------+----------------+---------------+---------------------+--------------------------+----------+------------+
  3. | PartitionId | PartitionName | VisibleVersion | VisibleVersionTime  | VisibleVersionHash | State  | PartitionKey | Range                                                                      | DistributionKey | Buckets | ReplicationNum | StorageMedium | CooldownTime        | LastConsistencyCheckTime | DataSize | IsInMemory |
  4. +-------------+---------------+----------------+---------------------+--------------------+--------+--------------+----------------------------------------------------------------------------+-----------------+---------+----------------+---------------+---------------------+--------------------------+----------+------------+
  5. | 513180658   | p20231114     | 1              | 2023-12-11 09:49:41 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-11-14]; ..types: [DATE]; keys: [2023-11-15]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | 16.3GB   | false      |
  6. | 504724038   | p20231115     | 1              | 2023-12-06 13:47:20 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-11-15]; ..types: [DATE]; keys: [2023-11-16]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  7. | 504722860   | p20231116     | 1              | 2023-12-06 13:47:20 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-11-16]; ..types: [DATE]; keys: [2023-11-17]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  8. | 504721682   | p20231206     | 1              | 2023-12-06 13:47:20 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-06]; ..types: [DATE]; keys: [2023-12-07]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  9. | 504722271   | p20231207     | 1              | 2023-12-06 13:47:20 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-07]; ..types: [DATE]; keys: [2023-12-08]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  10. | 504720504   | p20231208     | 1              | 2023-12-06 13:47:20 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-08]; ..types: [DATE]; keys: [2023-12-09]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  11. | 504721093   | p20231209     | 1              | 2023-12-06 13:47:20 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-09]; ..types: [DATE]; keys: [2023-12-10]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  12. | 505174943   | p20231210     | 1              | 2023-12-07 00:14:47 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-10]; ..types: [DATE]; keys: [2023-12-11]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  13. | 506968779   | p20231211     | 1              | 2023-12-08 00:04:57 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-11]; ..types: [DATE]; keys: [2023-12-12]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  14. | 508936021   | p20231212     | 1              | 2023-12-09 00:11:09 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-12]; ..types: [DATE]; keys: [2023-12-13]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  15. | 510079030   | p20231213     | 1              | 2023-12-10 00:05:38 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-13]; ..types: [DATE]; keys: [2023-12-14]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  16. | 510795860   | p20231214     | 1              | 2023-12-11 00:07:15 | 0                  | NORMAL | ts           | [types: [DATE]; keys: [2023-12-14]; ..types: [DATE]; keys: [2023-12-15]; ) | uuid            | 196     | 2              | HDD           | 9999-12-31 15:59:59 | NULL                     | .000     | false      |
  17. +-------------+---------------+----------------+---------------------+--------------------+--------+--------------+----------------------------------------------------------------------------+-----------------+---------+----------------+---------------+---------------------+--------------------------+----------+------------+
  18. 12 rows in set (0.002 sec)
  19. [Mon Dec 11 09:56:57 2023]:['default_cluster:cndlopsns']>[192.168.1.121]:[ods] [ADHOC用户集群] >
复制代码
查询正常:
  1. [Mon Dec 11 09:58:20 2023]:['default_cluster:cndlopsns']>[192.168.1.121]:[ods] [ADHOC用户集群] > select * from ods.buckets_tmp_20231205__sams_cos where ts='2023-11-14' limit 2;
  2. +------------+------------------+------------+--------------------------------------+---------------+---------------+----------------------+----------+---------------------------------------------------+-----------+------------+-------------+------------------+-------------------+----------+--------------------------------------+----------------+---------------+----------------------+--------------+-----------------+----------------------+---------+-------------------+--------------------------+--------------------------------------------------------------------------------------------------------------------+--------------+-------------+----------------+-------------+--------------+------+------------+--------------+-----------+----------+---------+------+----------+---------+----------+-----------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------+----------------+---------------+-----------+--------------+------------+
  3. | ts         | uuid             | import_ds_ | unique_action_id                     | action_time   | report_time   | action_type          | ka_id    | action_session_id                                 | wx_app_id | wx_open_id | wx_union_id | external_user_id | merber_id         | local_id | encrypted_imei                       | encrypted_idfa | encrypted_mac | encrypted_android_id | encrypted_qq | encrypted_phone | encrypting_algorithm | chan_id | chan_refer_app_id | chan_shop_id             | chan_shop_name                                                                                                     | client_type  | client_name | client_version | sdk_version | device_model | ip   | user_agent | page_path    | page_name | referrer | address | city | province | country | latitude | longitude | json_properties                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | fdate    | chan_custom_id | etl_load_time | tag_id    | tag_name     | event_name |
  4. +------------+------------------+------------+--------------------------------------+---------------+---------------+----------------------+----------+---------------------------------------------------+-----------+------------+-------------+------------------+-------------------+----------+--------------------------------------+----------------+---------------+----------------------+--------------+-----------------+----------------------+---------+-------------------+--------------------------+--------------------------------------------------------------------------------------------------------------------+--------------+-------------+----------------+-------------+--------------+------+------------+--------------+-----------+----------+---------+------+----------+---------+----------+-----------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------+----------------+---------------+-----------+--------------+------------+
  5. | 2023-11-14 | 8325 | 2023111412 | ed55491f-da41-4976-8601-49c4405d994f | 1699936160765 | 1699936132584 | element              | 10001042 | 1699936154050a1b33cce-f249-4112-8db2-288f1f0f6703 | NULL      | NULL       | NULL        | 274012627        | 10742100529761108 | NULL     | 9090 | NULL           | NULL          | NULL                 | NULL         | NULL            | NULL                 | NULL    | NULL              | 9991,6758,4834,6119,9996 | 苏州木渎DC                                     | sams-app-sdk | NULL        | NULL           | NULL        | NULL         | NULL | NULL       | HomeFragment | 首页      | NULL     | NULL    | NULL | NULL     | NULL    | NULL     | NULL      | *****                                                                                                                                                 | 20231114 | default        | NULL          | advantage | 普通会员     | NULL       |
  6. | 2023-11-14 | 8614 | 2023111412 | f30f3672-1bad-478c-ab19-16b736b850ef | 1699936161032 | 1699936133093 | expose_sku_component | 10001042 | 1699936154050a1b33cce-f249-4112-8db2-288f1f0f6703 | NULL      | NULL       | NULL        | 274012627        | 10742100529761108 | NULL     | 9090 | NULL           | NULL          | NULL                 | NULL         | NULL            | NULL                 | NULL    | NULL              | 9991,6758,4834,6119,9996 | 苏州木渎DC                                     | sams-app-sdk | NULL        | NULL           | NULL        | NULL         | NULL | NULL       | HomeFragment | 首页      | NULL     | NULL    | NULL | NULL     | NULL    | NULL     | NULL      | *****                           | 20231114 | default        | NULL          | advantage | 普通会员     | NULL       |
  7. +------------+------------------+------------+--------------------------------------+---------------+---------------+----------------------+----------+---------------------------------------------------+-----------+------------+-------------+------------------+-------------------+----------+--------------------------------------+----------------+---------------+----------------------+--------------+-----------------+----------------------+---------+-------------------+--------------------------+--------------------------------------------------------------------------------------------------------------------+--------------+-------------+----------------+-------------+--------------+------+------------+--------------+-----------+----------+---------+------+----------+---------+----------+-----------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------+----------------+---------------+-----------+--------------+------------+
  8. 2 rows in set (1.660 sec)
复制代码
速率不断太慢,Datax也就这样了:
  1. 2023-12-11 17:52:11.251 [job-0] INFO  StandAloneJobContainerCommunicator - Total 3906464 records, 7347149221 bytes | Speed 52.38MB/s, 29184 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.202s |  All Task WaitReaderTime 297.463s | Percentage 0.00%
  2. 2023-12-11 17:52:21.252 [job-0] INFO  StandAloneJobContainerCommunicator - Total 4198304 records, 7896134390 bytes | Speed 52.36MB/s, 29184 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.215s |  All Task WaitReaderTime 331.832s | Percentage 0.00%
复制代码
 
备注

之前有伙伴问到,为什么我的json里面字段用到“index”
  1. "column": [
  2.     {
  3.         "name": "ts",
  4.         "type": "string",
  5.         "value": "2023-11-14"
  6.     },
  7.     {
  8.         "name": "import_ds_",
  9.         "type": "string",
  10.         "index": 0
  11.     },
  12.     {
  13.         "name": "unique_action_id",
  14.         "type": "string",
  15.         "index": 1
  16.     },
  17.     {
  18.         "name": "action_time",
  19.         "type": "string",
  20.         "index": 2
  21.     },<br>    ....
  22.     ]
复制代码
这个地方经过尝试,原有的column1,column2,column3...的方式测试行不通, 因为我有个ts的字段需要造值。但如果使用
  1.     {
  2.         "name": "ts",
  3.         "type": "string",
  4.         "value": "2023-11-14"
  5.     },
  6.     {
  7.         "name": "import_ds_",
  8.         "type": "string",
  9.     },<br>    ....
复制代码
这种方式, 则抛出:由于您配置了type, 则至少需要配置 index 或 value,这是一件令人头疼的事。
  1. [svccnetlhs@HOST log]<231211 18:10:01>$ /usr/bin/python2.7 /u/chengken/datax/bin/datax.py --jvm="-Xms1G -Xmx4G" /home/svccnetlhs/chengken/starrocks/json/20231114_sam_dwd_user_action_cos__1702288181.json
  2. DataX (DATAX-OPENSOURCE-3.0), From Alibaba !
  3. Copyright (C) 2010-2017, Alibaba Group. All Rights Reserved.
  4. 2023-12-11 18:10:30.213 [main] INFO  MessageSource - JVM TimeZone: GMT+08:00, Locale: zh_CN
  5. 2023-12-11 18:10:30.215 [main] INFO  MessageSource - use Locale: zh_CN timeZone: sun.util.calendar.ZoneInfo[id="GMT+08:00",offset=28800000,dstSavings=0,useDaylight=false,transitions=0,lastRule=null]
  6. 2023-12-11 18:10:30.226 [main] INFO  VMInfo - VMInfo# operatingSystem class => sun.management.OperatingSystemImpl
  7. 2023-12-11 18:10:30.231 [main] INFO  Engine - the machine info  =>
  8.     osInfo:    Linux amd64 4.18.0-348.7.1.el8_5.x86_64
  9.     jvmInfo:    Oracle Corporation 1.8 25.112-b15
  10.     cpu num:    16
  11.     totalPhysicalMemory:    -0.00G
  12.     freePhysicalMemory:    -0.00G
  13.     maxFileDescriptorCount:    -1
  14.     currentOpenFileDescriptorCount:    -1
  15.     GC Names    [PS MarkSweep, PS Scavenge]
  16.     MEMORY_NAME                    | allocation_size                | init_size                     
  17.     PS Eden Space                  | 1,280.00MB                     | 256.00MB                       
  18.     Code Cache                     | 240.00MB                       | 2.44MB                        
  19.     Compressed Class Space         | 1,024.00MB                     | 0.00MB                        
  20.     PS Survivor Space              | 42.50MB                        | 42.50MB                        
  21.     PS Old Gen                     | 2,731.00MB                     | 683.00MB                       
  22.     Metaspace                      | -0.00MB                        | 0.00MB                        
  23. 2023-12-11 18:10:30.258 [main] INFO  PerfTrace - PerfTrace traceId=job_-1, isEnable=true
  24. 2023-12-11 18:10:30.258 [main] INFO  JobContainer - DataX jobContainer starts job.
  25. 2023-12-11 18:10:30.259 [main] INFO  JobContainer - Set jobId = 0
  26. 2023-12-11 18:10:30.269 [job-0] INFO  HdfsReader$Job - init() begin...
  27. 2023-12-11 18:10:30.274 [job-0] ERROR JobContainer - Exception when job run
  28. com.alibaba.datax.common.exception.DataXException: Code:[HdfsReader-06], Description:[没有 Index].  - 由于您配置了type, 则至少需要配置 index 或 value
  29.     at com.alibaba.datax.common.exception.DataXException.asDataXException(DataXException.java:30) ~[datax-common-0.0.1-SNAPSHOT.jar:na]
  30.     at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Job.validateColumns(HdfsReader.java:150) ~[hdfsreader-0.0.1-SNAPSHOT.jar:na]
  31.     at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Job.validate(HdfsReader.java:111) ~[hdfsreader-0.0.1-SNAPSHOT.jar:na]
  32.     at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Job.init(HdfsReader.java:50) ~[hdfsreader-0.0.1-SNAPSHOT.jar:na]
  33.     at com.alibaba.datax.core.job.JobContainer.initJobReader(JobContainer.java:673) ~[datax-core-0.0.1-SNAPSHOT.jar:na]
  34.     at com.alibaba.datax.core.job.JobContainer.init(JobContainer.java:303) ~[datax-core-0.0.1-SNAPSHOT.jar:na]
  35.     at com.alibaba.datax.core.job.JobContainer.start(JobContainer.java:113) ~[datax-core-0.0.1-SNAPSHOT.jar:na]
  36.     at com.alibaba.datax.core.Engine.start(Engine.java:86) [datax-core-0.0.1-SNAPSHOT.jar:na]
  37.     at com.alibaba.datax.core.Engine.entry(Engine.java:168) [datax-core-0.0.1-SNAPSHOT.jar:na]
  38.     at com.alibaba.datax.core.Engine.main(Engine.java:201) [datax-core-0.0.1-SNAPSHOT.jar:na]
  39. 2023-12-11 18:10:30.277 [job-0] INFO  StandAloneJobContainerCommunicator - Total 0 records, 0 bytes | Speed 0B/s, 0 records/s | Error 0 records, 0 bytes |  All Task WaitWriterTime 0.000s |  All Task WaitReaderTime 0.000s | Percentage 0.00%
  40. 2023-12-11 18:10:30.279 [job-0] ERROR Engine -
  41. 经DataX智能分析,该任务最可能的错误原因是:
  42. com.alibaba.datax.common.exception.DataXException: Code:[HdfsReader-06], Description:[没有 Index].  - 由于您配置了type, 则至少需要配置 index 或 value
  43.     at com.alibaba.datax.common.exception.DataXException.asDataXException(DataXException.java:30)
  44.     at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Job.validateColumns(HdfsReader.java:150)
  45.     at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Job.validate(HdfsReader.java:111)
  46.     at com.alibaba.datax.plugin.reader.hdfsreader.HdfsReader$Job.init(HdfsReader.java:50)
  47.     at com.alibaba.datax.core.job.JobContainer.initJobReader(JobContainer.java:673)
  48.     at com.alibaba.datax.core.job.JobContainer.init(JobContainer.java:303)
  49.     at com.alibaba.datax.core.job.JobContainer.start(JobContainer.java:113)
  50.     at com.alibaba.datax.core.Engine.start(Engine.java:86)
  51.     at com.alibaba.datax.core.Engine.entry(Engine.java:168)
  52.     at com.alibaba.datax.core.Engine.main(Engine.java:201)
复制代码
那么我是如何取得每个字段精准的index?
这里我用到了 orc-tools-1.8.0-uber.jar 这个包把orc里面的字段先解析出来,
下载:https://repo1.maven.org/maven2/org/apache/orc/orc-tools/
  1. java -jar orc-tools-1.8.0-uber.jar meta <ORC文件>
复制代码
成功解析orc文件的元数据字段信息,Type: struct:代表的就是字段列与下标顺序
  1. log4j:WARN No appenders could be found for logger (org.apache.hadoop.util.Shell).
  2. log4j:WARN Please initialize the log4j system properly.
  3. log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
  4. Processing data file part-01291-c9231ae0-6186-4b4a-83dc-c95521cf2b8d-c000 [length: 290734159]
  5. Structure for part-01291-c9231ae0-6186-4b4a-83dc-c95521cf2b8d-c000
  6. File Version: 0.12 with ORC_14 by ORC Java 1.6.14
  7. Rows: 849920
  8. Compression: SNAPPY
  9. Compression size: 131072
  10. Calendar: Julian/Gregorian
  11. Type: struct<import_ds_:int,unique_action_id:string,action_time:bigint,report_time:bigint,action_type:string,ka_id:bigint,action_session_id:string,uuid:string,wx_app_id:string,wx_open_id:string,wx_union_id:string,external_user_id:string,merber_id:string,local_id:string,encrypted_imei:string,encrypted_idfa:string,encrypted_mac:string,encrypted_android_id:string,encrypted_qq:string,encrypted_phone:string,encrypting_algorithm:string,chan_id:string,chan_refer_app_id:string,chan_shop_id:string,chan_shop_name:string,client_type:string,client_name:string,client_version:string,sdk_version:string,device_model:string,ip:string,user_agent:string,page_path:string,page_name:string,referrer:string,address:string,city:string,province:string,country:string,latitude:string,longitude:string,json_properties:string,fdate:string,tag_id:string,tag_name:string,chan_custom_id:string,etl_load_time:string>
  12. Stripe Statistics:
复制代码
 
 
完。
 

免责声明:如果侵犯了您的权益,请联系站长,我们会及时删除侵权内容,谢谢合作!

本帖子中包含更多资源

您需要 登录 才可以下载或查看,没有账号?立即注册

x
回复

使用道具 举报

0 个回复

正序浏览

快速回复

您需要登录后才可以回帖 登录 or 立即注册

本版积分规则

飞不高

金牌会员
这个人很懒什么都没写!

标签云

快速回复 返回顶部 返回列表