Skip to content

[Findings of EMNLP 2024] Code for the paper "On Evaluating Explanation Utility for Human-AI Decision Making in NLP"

License

Notifications You must be signed in to change notification settings

utahnlp/nlp-explanation-utility-guideline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

On Evaluating Explanation Utility for Human-AI Decision Making in NLP

Repository for the EMNLP 2024 Findings paper “On Evaluating Explanation Utility for Human-AI Decision Making in NLP” by Fateme Hashemi Chaleshtori, Atreya Ghosal, Alexander Gill, Purbid Bambroo, and Ana Marasović

Environment Setup

Make a new python3.10 environment:

conda create python=3.10 --name expl_utility

Install the dependencies from the requirement file:

pip3 install -r requirements.txt

Developing Models for Tasks

To finetune Flan-T5-3B on ContractNLI, SciFact-Open, ILDC, and EvidenceInference-v2 datasets:

cd model_development/data
sh download_data.sh
cd ..
sh run_ft.sh <dataset_name> -> [scifact-open, contract-nli, evidenceinference, ildc]  <batch_size>  <epoch>  <learning_rate>  <save_dir> 

Other than dataset_name, the rest of the arguments are optionsl. Example:

sh run_ft.sh scifact-open 2 10 0.00005 ./ftmodels/scifact_open/

Developing Deferral Model

Run the following command to finetune Llama-2-13B on the deferral task:

cd deferral
sh run_def.sh

Prompts for zero-shot and few-shot learning with GPT-4 are provided in the deferral_model directory.

Human Study Templates

You can find Qualtrics templates for different studies under the human_study_templates directory. These templates can help replicate the human studies according to our guidelines.

To quickly view the format of the human studies, follow the links provided in the human_study_templates directory.

About

[Findings of EMNLP 2024] Code for the paper "On Evaluating Explanation Utility for Human-AI Decision Making in NLP"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published