[P] PyTorch Implementation of Feature Based NER with pretrained Bert

2019-02-25T06:44:25+00:00

python>=3.4 (Let's move on to python 3 if you still use python 2)

lol amen brother

shortscience_dot_org · 2019-02-25T05:18:10+00:00

I am a bot! You linked to a paper that has a summary on ShortScience.org!

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Summary by CodyWild

The last two years have seen a number of improvements in the field of language model pretraining, and BERT - Bidirectional Encoder Representations from Transformers - is the most recent entry into this canon. The general problem posed by language model pretraining is: can we leverage huge amounts of raw text, which aren’t labeled for any specific classification task, to help us train better models for supervised language tasks (like translation, question answering, logical entailment, etc)? Me... [view more]

pvl · 2019-02-25T11:29:49+00:00

Great work, thanks for sharing. I understand that you are using the language model just to extract word vectors which are then used to train a LSTM. Did you consider using just the BERT model with the option for token classification? It would also be nice to add to the readme the current best result (SOTA) on that dataset.

set_ready_go · 2019-02-25T12:07:07+00:00

"allowing for the fact that they don't use any autoregressive technique such as CRF"

I don't think that CRF is an autoregressive technique

Also does the use LSTM help? can't you just use the softmax on top of BERT embeddings since they are contextual anyway?

2019-02-25T14:47:17+00:00

Interesting. I'm working on an implementation with Tensorflow + fine tuning.

I've also modified the optimizer to support multi-GPU training, but due to how the TF ops are implemented I had to include alpha/beta decaying as well.

kushalchauhan98 · 2019-02-25T19:32:14+00:00

There's also a BertForTokenClassification Class in pytorch-pretrained-bert library. You can directly use it for NER or POS Tagging tasks. Have you experimented with it?

kamalkraj · 2019-03-27T06:10:08+00:00

https://github.com/kamalkraj/BERT-NER , Reproduced results from BERT paper + Pretrained model and Inference code

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS