EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction

The repository contains the codes and data for our AAAI 2025 Main Paper: EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction

This paper introduces the EXGEC task, establishes the EXCGEC benchmark, and provides a comprehensive evaluation suite to advance the study of explainable Grammatical Error Correction.

Features

We propose the EXGEC task and establish the EXCGEC benchmark with a Chinese dataset and comprehensive metrics.
We develop EXGEC baseline models and investigate the performance of various LLMs using the proposed benchmark.
We perform detailed analyses and human evaluation experiments to assess the effectiveness of automatic metrics for error descriptions.

Requirements and Installation

Python version >= 3.10

git clone https://https://github.com/THUKElab/EXCGEC.git
cd LLaMA-Factory
conda create -n excgec-eval python=3.10.14
conda activate excgec-eval
pip install  -r eval_requirements.txt
conda deactivate

conda create -n excgec python=3.10.14
conda activate excgec
pip install -e .[metrics]

Usage

excgec
├── benchmarks/            # Contains scripts and tools for data processing and benchmark evaluation.
├── evaluation/            # Includes tools and scripts for model evaluation and performance metrics.
├── excgec_generation/     # Holds decoding strategies and related components for EXGEC (Explainable Grammatical Error Correction).
├── exp-cgec/              # Main directory containing executable files for training, fine-tuning, and evaluation.
├── LLaMA-Factory/         # Contains components related to the LLaMA model factory and setup.
├── util/                  # Miscellaneous utility scripts and functions for various tasks.
└── LLM/                   # Stores fine-tuned models and associated code.

EXCGEC finetuning shell file

cd exp-cgec
bash excgec_finetuning_lora.sh

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
LLaMA-Factory		LLaMA-Factory
benchmarks		benchmarks
data		data
evaluation		evaluation
excgec_generation		excgec_generation
exp-cgec		exp-cgec
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction

Features

Requirements and Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction

Features

Requirements and Installation

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages