A simple python program to split MSAs in FASTA format.
Go to file
Harrison Deng f08e4c7d35
All checks were successful
ydeng/splitmsa/pipeline/head This commit looks good
Pipeline will not fail if version is the same.
2024-07-21 04:03:36 +00:00
.devcontainer Added some extensions for the devcontainer. 2024-07-21 02:54:01 +00:00
.vscode Prepping for implementation of quick and dirty GUI 2023-04-11 16:13:07 -05:00
splitmsa Added '__init__.py' 2023-04-21 13:58:28 -05:00
tests/resources Updated translation mechanism. 2023-03-30 16:16:00 -05:00
.gitignore Regenerated .gitignore. 2023-03-30 16:13:28 -05:00
environment.yml Loosened version requirements for twine. 2024-07-21 04:01:36 +00:00
Jenkinsfile Pipeline will not fail if version is the same. 2024-07-21 04:03:36 +00:00
pyproject.toml Renamed project to "SplitMSA" and added pipeline file 2023-04-11 12:39:04 -05:00
README.md Renamed project to "SplitMSA" and added pipeline file 2023-04-11 12:39:04 -05:00
setup.cfg Loosened version requirements for twine. 2024-07-21 04:01:36 +00:00
setup.py Renamed project to "SplitMSA" and added pipeline file 2023-04-11 12:39:04 -05:00
tox.ini Incomplete translation mechanism 2023-03-27 16:23:12 -05:00

SplitMSA

Simple FASTA file splitter. Capable of batch trimming a large amount of sequences in the form of a MSA in a FASTA file.

Features

  • Split large fasta files that contain a multiple sequence alignment (MSA) into individual genes
  • Trim off stop codon
  • Batch process multiple genes from one MSA
  • Correct gene start and stop locations based on start and stop codon location
  • Catalogues errors that occurred in human-readable CSV file

Planned

  • Translate MSA into amino acids while maintaining alignment (shows type of mutation if frameshift)
  • Simple to use GUI
  • Run without system-wide python install

Use

Command Line

  1. Install python 3
  2. Install biopython
    • Using pip: pip install biopython or pip3 install biopython
    • Using conda: conda install -c conda-forge biopython
  3. Download msa_splitter.py and run with python3 msa_splitter.py
    • python3 msa_splitter.py -h for help