NLP Stemming Pipeline

This project demonstrates tokenization and stemming using NLTK's Porter and Lancaster stemmers, containerized with Docker.

Installation

Clone the repository:

git clone https://github.com/learningwithmainsh/Stemming.git
cd Stemming

Project Structure

.
├── Dockerfile
├── requirements.txt
├── stemmer.py
├── README.md

Prerequisites

Ensure you have Docker installed. You can verify by running:

docker --version

Setup and Usage

1. Build the Docker image

docker build -t nlp-stemmer .

2. Run the Docker container

docker run --rm nlp-stemmer

3. Check container logs (optional)

If you want to check logs from a running container:

docker run -d --name nlp-stemmer nlp-stemmer

docker logs nlp-stemmer

Files

Dockerfile: Contains instructions to build the Docker image.
requirements.txt: Lists required Python packages.
stemmer.py: Python script for tokenization and stemming.
README.md: This documentation.

NLTK Stemming Example

The script processes the following sample text:

text = "Running ran easily quickly."

Sample Output

Tokenized Words: ['Running', 'ran', 'easily', 'quickly', '.']

Porter Stemmed Words: ['run', 'ran', 'easili', 'quickli', '.']

Lancaster Stemmed Words: ['run', 'ran', 'easy', 'quick', '.']

Cleanup

To remove all stopped containers and dangling images:

docker system prune -f

Contributing

Feel free to fork this repo and open a pull request with any improvements!

Author

Manish Pandey

Author 👤

Created by Manish Pandey. Feel free to reach out for any queries or collaborations!

Happy Coding! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Stemming Pipeline

Installation

Project Structure

Prerequisites

Setup and Usage

1. Build the Docker image

2. Run the Docker container

3. Check container logs (optional)

Files

NLTK Stemming Example

Sample Output

Cleanup

Contributing

Author

Author 👤

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
stemmer.py		stemmer.py

mpandey95/Stemming

Folders and files

Latest commit

History

Repository files navigation

NLP Stemming Pipeline

Installation

Project Structure

Prerequisites

Setup and Usage

1. Build the Docker image

2. Run the Docker container

3. Check container logs (optional)

Files

NLTK Stemming Example

Sample Output

Cleanup

Contributing

Author

Author 👤

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages