Metrics for measuring various CoT pathologies

This project contains several metrics to measure chain-of-thought (CoT) pathologies. See src/metric_* for implementations. We focus on the following primary metrics:

Necessity (called Reliance in the code)
Substitutability (called Internalized in the code)
Paraphrasability

Running the Code

To set up:

pip install -r requirements.txt

To run a metric, try

python src/main_batch.py --model=Qwen/Qwen3-0.6B --metric=Reliance --data-hf=GSM8K

or

./test.sh

Output is printed to the console, and to log/jsonl files in log/.

Note: All supported Huggingface datasets and CoT models are listed in src/config.py, feel free to add to the lists. We also support local datasets in data/, such as alpaca_500_samples.json (based on Alpaca).

To generate graphs,

python src/plot_metric_logprobs.py --metric-name Transferability --input-path log/input.jsonl --out-dir output

Using metrics in a new project

If the src/main_batch.py runner is insufficient for your needs, we suggest reading the simpler example.py as a basis for your own code (or the slighly longer version src/main.py). Essentially, you will do something like this:

model = CoTModel(model_name)
metric = construct_metric(metric_name, model)
response = model.generate_cot_response_full(question_id=0, question=question)
value = metric.evaluate(response)

If you would prefer to evaluate pre-generated outputs, you can construct a ModelResponse yourself or call model.do_split. For examples of this, see src/test_model.py. You can run these tests with pytest.

Name		Name	Last commit message	Last commit date
Latest commit History 429 Commits
data		data
output/Qwen3-8B-encoded_rank1		output/Qwen3-8B-encoded_rank1
src		src
test-plots		test-plots
.gitignore		.gitignore
README.md		README.md
example.py		example.py
output1.txt		output1.txt
pipeline1.sh		pipeline1.sh
pipeline2.sh		pipeline2.sh
requirements.txt		requirements.txt
run_job.sh		run_job.sh
submit_any_command.sh		submit_any_command.sh
summarize.py		summarize.py
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Metrics for measuring various CoT pathologies

Running the Code

Using metrics in a new project

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

MeridianResearch/cot_health_metrics

Folders and files

Latest commit

History

Repository files navigation

Metrics for measuring various CoT pathologies

Running the Code

Using metrics in a new project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages