使用Linux版的MEGA构建某一基因家族的基因进化树_系统运维

最近碰到了个需求，让我构建某一基因家族的进化树，并根据进化关系进行相应的分类。这让我想起了之前上课的时候，一个做进化的老师给我们讲过，如果不是纯做进化方向的课题，MEGA完全够用了。由于windows的内存等有限，做几十个基因还凑合，要是上百个基因就吃不消了，于是就想到了用Linux下的MEGA来做。

https://www.megasoftware.net/

由于是二进制文件，直接解压缩，添加到环境变量就可以用了。

具体请看我这篇文章。 https://www.jianshu.com/p/5fd60c818651

上一步我得到了该基因家族的所有基因家族的蛋白序列，然后我用windows下的MEGA的 muscle 算法进行了比较，【align-build alignment-上一步的基因家族蛋白序列-muscle比对-data-export-FASTA format】

最终我得到了比对后的 multiproteins.fasta 文件。

首先进行参数的解读，相比于其他软件，我觉得这款软件比较好理解，也容易上手。

这里的 .mao 文件尤为重要，较为简单的方法是拿到windows下去设置，具体请看组学大讲堂的这篇推送。

https://www.omicsclass.com/article/568

版本信息

MEGA version 10.1.8

For 64-bit Linux

Build 10200331

参数解读

EXAMPLES

This example performs a multiple sequence alignment on codons (it assumes that you have created the file "Clustal_Codon_Alignment.mao" using the prototyper (megaproto). A fasta file with coding data is used as input and the resulting alignment is output in the MEGA format:

This example shows how to construct a neighbor-joining phylogeny for each of a list of sequence data files.

The analysis will be performed for each file listed in "listOfDataFiles.txt" and all results will be written to

the ~/Documents/outputDirectory/ directory:

megacc -a ~/Documents/NJ_Tree_Settings.mao -l ~/Documents/listOfDataFiles.txt -o ~/Documents/outputDirectory/

LIST FORMAT

When using the -l option, each file to be analyzed must be on its own line. For example:

~/Documents/myData/seqData1.fas

~/Documents/myData/seqData2.fas

~/Documents/myData/seqData3.fas

If the analyses are to use a user-provided Newick tree file, then the tree files are given on the same line as the data files, following two pipe characters. For example:

~/Documents/myData/seqData1.fas || ~/Documents/myData/treeFile1.nwk

~/Documents/myData/seqData2.fas || ~/Documents/myData/treeFile2.nwk

~/Documents/myData/seqData3.fas || ~/Documents/myData/treeFile3.nwk

我的最终使用：

下一步我打算用 ggtree 来美化，具体学习情况，我再更新。

进化树在生物学中，用来表示物种之间的进化关系。生物分类学家和进化论者根据各类生物间的亲缘关系的远近，把各类生物安置在有分枝的树状的图表上，简明地表示生物的进化历程和亲缘关系。在进化树上每个叶子结点代表一个物种，如果每一条边都被...

欢迎分享，转载请注明来源：内存溢出

原文地址: http://outofmemory.cn/yw/7642307.html

使用Linux版的MEGA构建某一基因家族的基因进化树

发表评论

评论列表（0条）