@stumme

LG4AV: Combining Language Models and Graph Neural Networks for Author Verification

, and . Advances in Intelligent Data Analysis XX, page 315--326. Cham, Springer International Publishing, (2022)

Abstract

The verification of document authorships is important in various settings. Researchers are for example judged and compared by the amount and impact of their publications and public figures are confronted by their posts on social media. Therefore, it is important that authorship information in frequently used data sets is correct. The question whether a given document is written by a given author is commonly referred to as authorship verification (AV). While AV is a widely investigated problem in general, only few works consider settings where the documents are short and written in a rather uniform style. This makes most approaches impractical for bibliometric data. Here, authorships of scientific publications have to be verified, often with just abstracts and titles available. To this point, we present LG4AV which combines language models and graph neural networks for authorship verification. By directly feeding the available texts in a pre-trained transformer architecture, our model does not need any hand-crafted stylometric features that are not meaningful in scenarios where the writing style is, at least to some extent, standardized. By the incorporation of a graph neural network structure, our model can benefit from relations between authors that are meaningful with respect to the verification process.

Links and resources

Tags

community

  • @kde-alumni
  • @stumme
  • @stubbemann
  • @dblp
  • @regio
@stumme's tags highlighted