auto_HMM_beta

Author: Marcin Gradowski, Marianna Krysińska

Description

The script auto_HMM_beta.py automates the processing of protein sequences in FASTA format. It splits sequences into shorter and longer ones based on a specified length threshold, clusters similar sequences using CD-HIT, aligns them with Clustal Omega, and builds Hidden Markov Model (HMM) profiles using HMMER.

Requirements

Python 3.6+
Biopython
CD-HIT
Clustal Omega (clustalo)
HMMER (hmmbuild)

Installation

Clone the repository:

git clone https://github.com/username/auto_HMM_beta.git
cd auto_HMM_beta

Install Python dependencies:
```
pip install biopython
```
Ensure the following tools are installed and accessible in your PATH:
```
cd-hit clustalo hmmbuild
```

Usage

python auto_HMM_beta.py

Default parameters:

cutoff (sequence length): 50
cd_hit_threshold (CD-HIT similarity threshold): 0.80
cd_hit_word_length (CD-HIT word length): 5

You can modify these parameters directly in the script or by specifying command-line arguments if implemented.

Directory Structure

/ShortsAndLong/      # Results of sequence splitting
/Clustered/          # CD-HIT clustering outputs
/ClustalO/           # Aligned FASTA files
/HMM/                # Generated HMM profiles
HMM_names.txt        # List of HMM profile names and metadata

License

BSD 2-Clause "Simplified" License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ARTF000044_domains.fasta		ARTF000044_domains.fasta
LICENSE		LICENSE
README.md		README.md
auto_HMM_beta.py		auto_HMM_beta.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

auto_HMM_beta

Description

Requirements

Installation

Usage

Directory Structure

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

auto_HMM_beta

Description

Requirements

Installation

Usage

Directory Structure

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages