准备好基因组的文件及hg19.fa,然后调用bowtie2。
#hg19.fa 基因组fa文件
#hg19 索引前缀
#bowtie2-build 构建参考基因组索引文件
$bowtie2-build hg19/hg19.fa hg19/hg19
Settings:
Output files: 'hg19/hg19.*.bt2'
Line rate: 6 (line is 64 bytes)
Lines per side: 1 (side is 64 bytes)
Offset rate: 4 (one in 16)
FTable chars: 10
Strings: unpacked
Max bucket size: default
Max bucket size, sqrt multiplier: default
Max bucket size, len divisor: 4
Difference-cover sample period: 1024
Endianness: little
Actual local endianness: little
Sanity checking: disabled
Assertions: disabled
Random seed: 0
Sizeofs: void*:8, int:4, long:8, size_t:8
Input files DNA, FASTA:
hg19/hg19.fa
Building a SMALL index
Reading reference sizes
Time reading reference sizes: 00:00:17
Calculating joined length
Writing header
Reserving space for joined string
Joining reference sequences
Time to join reference sequences: 00:00:12
bmax according to bmaxDivN setting: 715331782
Using parameters --bmax 536498837 --dcv 1024
Doing ahead-of-time memory usage test
Passed! Constructing with these parameters: --bmax 536498837 --dcv 1024
Constructing suffix-array element generator
Building DifferenceCoverSample
Building sPrime
Building sPrimeOrder
V-Sorting samples
V-Sorting samples time: 00:01:22
Allocating rank array
Ranking v-sort output
Ranking v-sort output time: 00:00:24
Invoking Larsson-Sadakane on ranks
Invoking Larsson-Sadakane on ranks time: 00:00:43
Sanity-checking and returning
Building samples
Reserving space for 12 sample suffixes
Generating random suffixes
QSorting 12 sample offsets, eliminating duplicates
QSorting sample offsets, eliminating duplicates time: 00:00:00
Multikey QSorting 12 samples
(Using difference cover)
Multikey QSorting samples time: 00:00:00
Calculating bucket sizes
Splitting and merging
Splitting and merging time: 00:00:00
Avg bucket size: 2.86133e+09 (target: 536498836)
Converting suffix-array elements to index image
Allocating ftab, absorbFtab
Entering Ebwt loop
Getting block 1 of 1
No samples; assembling all-inclusive block
Sorting block of length 2861327131 for bucket 1
(Using difference cover)
Sorting block time: 00:44:25
Returning block of 2861327132 for bucket 1
Exited Ebwt loop
fchr[A]: 0
fchr[C]: 844862932
fchr[G]: 1429875684
fchr[T]: 2015233940
fchr[$]: 2861327131
Exiting Ebwt::buildToDisk()
Returning from initFromVector
Wrote 957974502 bytes to primary EBWT file: hg19/hg19.1.bt2
Wrote 715331788 bytes to secondary EBWT file: hg19/hg19.2.bt2
Re-opening _in1 and _in2 as input streams
Returning from Ebwt constructor
Headers:
len: 2861327131
bwtLen: 2861327132
sz: 715331783
bwtSz: 715331783
lineRate: 6
offRate: 4
offMask: 0xfffffff0
ftabChars: 10
eftabLen: 20
eftabSz: 80
ftabLen: 1048577
ftabSz: 4194308
offsLen: 178832946
offsSz: 715331784
lineSz: 64
sideSz: 64
sideBwtSz: 48
sideBwtLen: 192
numSides: 14902746
numLines: 14902746
ebwtTotLen: 953775744
ebwtTotSz: 953775744
color: 0
reverse: 0
Total time for call to driver() for forward index: 00:56:27
Reading reference sizes
Time reading reference sizes: 00:00:13
Calculating joined length
Writing header
Reserving space for joined string
Joining reference sequences
Time to join reference sequences: 00:00:11
Time to reverse reference sequence: 00:00:02
bmax according to bmaxDivN setting: 715331782
Using parameters --bmax 536498837 --dcv 1024
Doing ahead-of-time memory usage test
Passed! Constructing with these parameters: --bmax 536498837 --dcv 1024
Constructing suffix-array element generator
Building DifferenceCoverSample
Building sPrime
Building sPrimeOrder
V-Sorting samples
V-Sorting samples time: 00:01:29
Allocating rank array
Ranking v-sort output
Ranking v-sort output time: 00:00:28
Invoking Larsson-Sadakane on ranks
Invoking Larsson-Sadakane on ranks time: 00:00:42
Sanity-checking and returning
Building samples
Reserving space for 12 sample suffixes
Generating random suffixes
QSorting 12 sample offsets, eliminating duplicates
QSorting sample offsets, eliminating duplicates time: 00:00:00
Multikey QSorting 12 samples
(Using difference cover)
Multikey QSorting samples time: 00:00:00
Calculating bucket sizes
Splitting and merging
Splitting and merging time: 00:00:00
Avg bucket size: 2.86133e+09 (target: 536498836)
Converting suffix-array elements to index image
Allocating ftab, absorbFtab
Entering Ebwt loop
Getting block 1 of 1
No samples; assembling all-inclusive block
Sorting block of length 2861327131 for bucket 1
(Using difference cover)
Sorting block time: 00:44:02
Returning block of 2861327132 for bucket 1
Exited Ebwt loop
fchr[A]: 0
fchr[C]: 844862932
fchr[G]: 1429875684
fchr[T]: 2015233940
fchr[$]: 2861327131
Exiting Ebwt::buildToDisk()
Returning from initFromVector
Wrote 957974502 bytes to primary EBWT file: hg19/hg19.rev.1.bt2
Wrote 715331788 bytes to secondary EBWT file: hg19/hg19.rev.2.bt2
Re-opening _in1 and _in2 as input streams
Returning from Ebwt constructor
Headers:
len: 2861327131
bwtLen: 2861327132
sz: 715331783
bwtSz: 715331783
lineRate: 6
offRate: 4
offMask: 0xfffffff0
ftabChars: 10
eftabLen: 20
eftabSz: 80
ftabLen: 1048577
ftabSz: 4194308
offsLen: 178832946
offsSz: 715331784
lineSz: 64
sideSz: 64
sideBwtSz: 48
sideBwtLen: 192
numSides: 14902746
numLines: 14902746
ebwtTotLen: 953775744
ebwtTotSz: 953775744
color: 0
reverse: 1
Total time for backward call to driver() for mirror index: 00:56:19
花费一个小时左右。
结果:
跑完之后可以保存,直接调用就可以了。pwd确定文件目录的路径
#直接拷贝构建好的bowtie2索引
cp /home/training58/hicpro/hg19.*bt2 ./hg19
联系客服