TY - GEN
T1 - Template Filling for Controllable Commonsense Reasoning
AU - Rajagopal, Dheeraj
AU - Khetan, Vivek
AU - Sacaleanu, Bogdan
AU - Gershman, Anatole
AU - Fano, Andrew
AU - Hovy, Eduard
N1 - Publisher Copyright:
© 2023 Asian Federation of Natural Language Processing.
PY - 2023
Y1 - 2023
N2 - Large-scale sequence-to-sequence models have shown to be adept at both multiple-choice and open-domain commonsense reasoning tasks. However, the current formulations do not provide the ability to control the various attributes of the reasoning chain. To enable better controllability, we propose to study the commonsense reasoning as a template filling task (TemplateCSR) - where the language models fills reasoning templates with the given constraints as control factors. As an approach to TemplateCSR, we (i) propose a dataset of commonsense reasoning template-expansion pairs for healthcare and well-being domain and (ii) introduce ITO, an instruction fine-tuned sequence-to-sequence model that performs commonsense reasoning across concepts in the template. Our experiments show that our approach outperforms baseline both in generation metrics and factuality metrics. We also present a detailed error analysis on our approach's ability to reliably perform template based commonsense reasoning.
AB - Large-scale sequence-to-sequence models have shown to be adept at both multiple-choice and open-domain commonsense reasoning tasks. However, the current formulations do not provide the ability to control the various attributes of the reasoning chain. To enable better controllability, we propose to study the commonsense reasoning as a template filling task (TemplateCSR) - where the language models fills reasoning templates with the given constraints as control factors. As an approach to TemplateCSR, we (i) propose a dataset of commonsense reasoning template-expansion pairs for healthcare and well-being domain and (ii) introduce ITO, an instruction fine-tuned sequence-to-sequence model that performs commonsense reasoning across concepts in the template. Our experiments show that our approach outperforms baseline both in generation metrics and factuality metrics. We also present a detailed error analysis on our approach's ability to reliably perform template based commonsense reasoning.
UR - https://www.scopus.com/pages/publications/85188535422
UR - https://www.scopus.com/inward/citedby.url?scp=85188535422&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85188535422
T3 - IJCNLP-AACL 2023 - 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023
SP - 250
EP - 260
BT - IJCNLP-AACL 2023 - 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, Findings of the Association for Computational Linguistics
A2 - Park, Jong C.
A2 - Arase, Yuki
A2 - Hu, Baotian
A2 - Lu, Wei
A2 - Wijaya, Derry
A2 - Purwarianti, Ayu
A2 - Krisnadhi, Adila Alfa
PB - Association for Computational Linguistics (ACL)
T2 - 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics: Findings of the Association for Computational Linguistic, IJCNLP-AACL 2023
Y2 - 1 November 2023 through 4 November 2023
ER -