PRESQUE: Pragmatic Reasoning for Quantifier Semantics

Here is the repository for "Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models" in EMNLP 2023.

Environment

conda env create -f presque.yaml
conda activate presque

Data

data/hvd_quantifier_perception.json: The human perception of 5 quantifiers (no, few, some, most, all).

Configuration: key is the quantifier, value is a list over all percentage values. Each entry in the each percentage value results represents whether the percentage value is selected by an annotator.
Run python gen_human_perception.py for human perception result.

Human Perception of Quantifiers

QuRe

QuRe is a quantifier dataset that includes Wikipedia sentences and human annotation of generalized quantifiers in target percentage mentions in the sentence. It include three splits regarding the difficulty in measuring the target percentage mention.

Fully: the target percentage scope can be fully determined by other contents in the sentence, like One in ten for 10%
Partial: the percentage scope can be narrowed but not determined by the contents (e.g. an incomplete percentage breakdown).
Indeterminable: there is no information in the content of the sentence to determine the percentage scope.

Load QuRe from Huggingface

from datasets import load_dataset

ds = load_dataset("billli/QuRe")

data/QuRe.json: The QuRe dataset

Metadata sample

{
    "orig_sentence": "In order for a steel to be considered stainless it must have a Chromium content of at least 10.5%.", 
    "percentage": "10.50%", 
    "percentage_index": 0, 
    "math_expr": ">=0.105", 
    "quant_sent": "In order for a steel to be considered stainless it must have some Chromium content.", 
    "quantifier": "some", 
    "quantifier_position": 12, 
    "specificity": "unable", 
    "wiki_entity": "List of blade materials", 
    "topics": "metallurgy; steel; composition"
}

orig_sentence: the original sentence appeared in Wikipedia.
percentage: the percentage mentioned in the orig_sentence.
percentage_index: the index of the mentioned percentage in the orig_sentence.
math_expr: the percentage expression generated.
quant_sent: the annotated quantified sentence.
quantifier_position: the position of quantifier mentioned.
specificity: the difficulty of deciphering the percentage scope of the quantifier from the sentence excluding the quantifier.
wiki_entity: the wikipedia entity that includes orig_sentence.
topics: sentence topics.

PRESQUE

PRESQUE is a pragamatic reasoning based quantifier reasoning framework.

Sample Command

cd code
python presque.py --data_file=../data/QuRe.json  --approximate_window=2 --range_window=2 --display_every=100 --model_version=ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli

Parameters

data_file: the data file.
interval: the interval width β.
granularity: the granularity g.
approximate_window: the window size w for the approximation operator (~).
range_window: the window size w for other operators.
out_dir: the directory to save results, default ../result.
note: the note to save for result names.
model_version: the model used to serve the NLI component.
batch_size: the batch size for running experiments.
display_every: the frequency to display sample output.
device: whether to use GPU, default cuda.

Performance of PRESQUE on QuRe Splits

Metrics

Consecutiveness: whether the top K predictions forms a consecutive percentage span.
MSD: minimal scope distance.
F1: the span-level F1 between the consecutive percentage span of the top K predictions and the golden percentage scope.

Reference

@inproceedings{li-etal-2023-pragmatic,
    title = "Pragmatic Reasoning Unlocks Quantifier Semantics for Foundation Models",
    author = "Li, Yiyuan  and
      Menon, Rakesh  and
      Ghosh, Sayan  and
      Srivastava, Shashank",
    editor = "Bouamor, Houda  and
      Pino, Juan  and
      Bali, Kalika",
    booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.emnlp-main.38",
    pages = "573--591",
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
code		code
data		data
figures		figures
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
presque.yaml		presque.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRESQUE: Pragmatic Reasoning for Quantifier Semantics

Environment

Data

Human Perception of Quantifiers

QuRe

Load QuRe from Huggingface

PRESQUE

Sample Command

Parameters

Performance of PRESQUE on QuRe Splits

Reference

About

Releases

Packages

Languages

License

Nativeatom/PRESQUE

Folders and files

Latest commit

History

Repository files navigation

PRESQUE: Pragmatic Reasoning for Quantifier Semantics

Environment

Data

Human Perception of Quantifiers

QuRe

Load QuRe from Huggingface

PRESQUE

Sample Command

Parameters

Performance of PRESQUE on QuRe Splits

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages