加入收藏 | 设为首页 | 会员中心 | 我要投稿 宁德站长网 (https://www.0593zz.com/)- 科技、建站、经验、云计算、5G、大数据,站长网!
当前位置: 首页 > 大数据 > 正文

基因数据处理28之avocado运行

发布时间:2021-03-06 18:06:41 所属栏目:大数据 来源:网络整理
导读:需要注意的是如果使用avocado的命令行,fs和fq为hdfs路径,properties为本地路径: hadoop@Master:~/xubo/data/testTools/se$ avocado-submit /xubo/avocado/hs1.fq /xubo/avocado/hs38DH.fa /xubo/avocado/test20160527 /home/hadoop/cloud/avocado/basic

需要注意的是如果使用avocado的命令行,fs和fq为hdfs路径,properties为本地路径:

hadoop@Master:~/xubo/data/testTools/se$ avocado-submit /xubo/avocado/hs1.fq /xubo/avocado/hs38DH.fa /xubo/avocado/test20160527 /home/hadoop/cloud/avocado/basic.properties
Using SPARK_SUBMIT=/home/hadoop/cloud/spark-1.5.2//bin/spark-submit
Loading reads in from /xubo/avocado/hs1.fq
[Stage 8:>                                                          (0 + 2) / 4]SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
SLF4J: Defaulting to no-operation (NOP) logger implementation
SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details.
hadoop@Master:~/xubo/data/testTools/se$ hadoop fs -ls /xubo/avocado/test20160527
Found 7 items
-rw-r--r--   3 hadoop supergroup          0 2016-05-27 22:32 /xubo/avocado/test20160527/_SUCCESS
-rw-r--r--   3 hadoop supergroup      13367 2016-05-27 22:32 /xubo/avocado/test20160527/_common_metadata
-rw-r--r--   3 hadoop supergroup      13367 2016-05-27 22:32 /xubo/avocado/test20160527/_metadata
-rw-r--r--   3 hadoop supergroup      13367 2016-05-27 22:31 /xubo/avocado/test20160527/part-r-00000.gz.parquet
-rw-r--r--   3 hadoop supergroup      13367 2016-05-27 22:31 /xubo/avocado/test20160527/part-r-00001.gz.parquet
-rw-r--r--   3 hadoop supergroup      13367 2016-05-27 22:32 /xubo/avocado/test20160527/part-r-00002.gz.parquet
-rw-r--r--   3 hadoop supergroup      13367 2016-05-27 22:31 /xubo/avocado/test20160527/part-r-00003.gz.parquet

详细请见:
avocado:

hadoop@Master:~/xubo/data/testTools/se$ avocado-submit 
Using SPARK_SUBMIT=/home/hadoop/cloud/spark-1.5.2//bin/spark-submit
Argument "READS" is required
 READS                                                           : ADAM read-oriented data
 REFERENCE                                                       : ADAM or FASTA reference genome data
 VARIANTS                                                        : ADAM variant output
 CONFIG                                                          : avocado configuration file
 -debug : If set,prints a higher level of debug output.  -fragment_length N : Sets maximum fragment length. Default value is 10,000. Values greater than 1e9                                                                    should be avoided.
 -h (-help,--help,-?) : Print help  -parquet_block_size N : Parquet block size (default = 128mb)  -parquet_compression_codec [UNCOMPRESSED | SNAPPY | GZIP | LZO] : Parquet compression codec  -parquet_disable_dictionary : Disable dictionary encoding  -parquet_logging_level VAL : Parquet logging level (default = severe)  -parquet_page_size N : Parquet page size (default = 1mb)  -print_metrics : Print metrics to the log on completion 

参考:
【1】https://github.com/bigdatagenomics/avocado/issues/152
【2】https://github.com/bigdatagenomics/avocado/

(编辑:宁德站长网)

【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容!

    热点阅读