2019, ACL

data: SemEval 2014, SemEval 2014 ABSA, SemEval 2015, SemEval 2016

task: ABSA

propose a span-based extract-then-classify framework, where multiple opinion targets are directly extracted from the sentence under the supervision of target span boundaries, and corresponding polarities are then classified using their span representations.

优点:

  1. 用指针网络选取target,避免了序列标注的搜索空间过大问题
  2. 用span边界+极性的标注方式,解决多极性的target问题

方法

Input:

sentence x =(x1,..., xn) with length n,

Target list T = {t1,..., tm}: each target ti is annotated with its start, end position, and its sentiment polarity

/images/papers/paper0.png

Multi-Target Extractor

/images/papers/paper0-1.png

/images/papers/paper0-2.png

propose an heuristic multi-span decoding algorithm

/images/papers/paper0-3.png

3.3 Polarity Classifier

given a target span r, we calculate a summarized vector v using the attention mechanism (Bahdanau et al., 2014) over tokens in its corresponding bound (si, ej)

/images/papers/paper0-4.png

3.4 Model Variants

Pipeline model:

  • build a multi-target extractor where a BERT encoder is exclusively used.
  • Then, a second backbone network is used to provide contextual sentence vectors for the polarity classifier.
  • Two models are separately trained and combined as a pipeline during inference.

Joint model:

  • Each sentence is fed into a shared BERT backbone network
  • that finally branches into two sibling output layers: one for proposing multiple candidate targets and another for predicting the sentiment polarity over each extracted target.

Collapsed model: combine target span boundaries and sentiment polarities into one label space.

Data

SemEval 2014 ABSA

LAPTOP contains product reviews from the laptop domain in SemEval 2014 ABSA challenges

REST is the union set of the restaurant domain from SemEval 2014, 2015 and 2016

/images/papers/paper0-5.png

效果

Metrices We adopt the precision (P), recall (R), and F1 score

/images/papers/paper0-6.png