Options
Regular-pattern-sensitive CRFs for Distant Label Interactions
Papay, Sean; Klinger, Roman; Padó, Sebastian (2025): Regular-pattern-sensitive CRFs for Distant Label Interactions, in: Hao Fei, Kewei Tu, Yuhui Zhang, u. a. (Hrsg.), Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025), Association for Computational Linguistics, S. 26–35, doi: 10.18653/v1/2025.xllm-1.4.
Faculty/Chair:
Author:
Title of the compilation:
Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025)
Editors:
Fei, Hao
Tu, Kewei
Zhang, Yuhui
Hu, Xiang
Han, Wenjuan
Jia, Zixia
Zheng, Zilong
Cao, Yixin
Zhang, Meishan
Lu, Wei
Siddharth, N.
Zhang, Yue
Conference:
1st Joint Workshop on Large Language Models and Structure Modeling
Publisher Information:
Year of publication:
2025
Pages:
ISBN:
979-8-89176-286-2
Language:
English
Abstract:
While LLMs have grown popular in sequence labeling, linear-chain conditional random fields (CRFs) remain a popular alternative with the ability to directly model interactions between labels. However, the Markov assumption limits them to interactions between adjacent labels. Weighted finite state transducers (FSTs), in contrast, can model distant label–label interactions, but exact label inference is intractable in general. In this work, we present regular-pattern-sensitive CRFs (RPCRFs), a method of enriching standard linear-chain CRFs with the ability to learn long-distance label interactions through userspecified patterns. This approach allows users to write regular-expression label patterns concisely specifying which types of interactions the model should take into account, allowing the model to learn from data whether and in which contexts these patterns occur. The result can be interpreted alternatively as a CRF augmented with additional, non-local potentials, or as a finite-state transducer whose structure is defined by a set of easily-interpretable patterns. Critically, exact training and inference are tractable for many pattern sets. We detail how an RPCRF can be automatically constructed from a set of user-specified patterns, and demonstrate the model’s effectiveness on a sequence of three synthetic sequence modeling datasets.
Keywords:
-
Peer Reviewed:
Yes:
International Distribution:
Yes:
Open Access Journal:
Yes:
Type:
Conferenceobject
Activation date:
August 7, 2025
Versioning
Question on publication
Permalink
https://fis.uni-bamberg.de/handle/uniba/109547