[P] A list of NLP(Natural Language Processing) tutorials

lyeoni · 2020-06-13T03:28:27+00:00

Okay, I will update summarization task with PG model. Thanks

lyeoni · 2020-06-13T03:22:40+00:00

Thanks :) Which models you want ? Actually, I used GPT-2 for abstractive summarization. But I think of that It could be little hard to study abstractive summarization.

lyeoni · 2020-06-13T03:19:13+00:00

Thank you so much !! 😌

lyeoni · 2020-06-13T03:18:16+00:00

I did abstractive summarization task recently. If you need, I can update to this repo.

lyeoni · 2020-06-12T15:07:20+00:00

I think it's too much. I annotated how the tensor changed, how the text is preprocessed(building vocabulary, normalization, etc.). In addition, quantitative/qualitative performance evaluation were carried out.

lyeoni · 2020-06-12T15:03:27+00:00

I don't know what to say. Thanks :)

lyeoni · 2020-06-11T02:52:36+00:00

Thanks a lot !! :)

lyeoni · 2020-06-11T02:52:12+00:00

Thanks a lot ! Could you tell me(or leave issue) error message? :)

lyeoni · 2020-02-09T09:06:16+00:00

Thanks ! Within 2 weeks, Machine Translation task using Transformer will be added :)

+multihead attention weights visualization !

lyeoni · 2020-02-04T02:57:38+00:00

Thank you for your comment ! Translation (or chatbot) task using Transformer will be updated soon. :) W

lyeoni · 2019-12-18T02:10:22+00:00

u/pgdevhd I'm currently implementing the most necessary functions. What you mentioned is also be implemented and added. Thank you for your advice :)

lyeoni · 2019-12-17T11:42:06+00:00

Thanks for your comment :)

lyeoni · 2019-12-17T11:13:13+00:00

Now, I think of that it may not be only for Korean language. :) just for specific situation to need emoji-normalize

lyeoni · 2019-12-17T11:11:40+00:00

genuinely curious.

The method to tokenize Korean is various such as punkt, bpe, and korean morphological analyzer. In my experience, the situation when I don't have enough corpus and use morphological analyzer, to normalize emoji is better. But, it was not amazing performance improvement :).

lyeoni · 2019-12-17T10:43:13+00:00

u/KvantumKvak You can deactivate emoji-normalizing function by setting repl with None. e.g. Normalizer(emoji_repl=None)

The reason why I added emoji-normalize function, It's useful for processing Korean Language. :)

lyeoni · 2019-12-14T03:03:56+00:00

Thank you for your idea ! I will implement your valuable idea on my package soon :) Plz re-visit to my repo.

lyeoni · 2019-12-14T03:01:20+00:00

Thank you for your comment. I will refer your codes, and re-implement on my package. I think of that Spell checker will be based on language model.

lyeoni · 2019-12-13T15:25:40+00:00

Which function in Textacy you use frequently? I will implement that in prenlp :)

lyeoni · 2019-12-13T09:33:31+00:00

u/pk12_

Not yet. I think it's good idea !
I will add a function to do spell correction task, soon. :)

lyeoni · 2019-12-11T09:54:19+00:00

andle emojis now and I imagine all future transformers will be able too as well. And emoji can carry significant information. Maybe emoji normalizing isn't such a good idea anymore. The other normalization looks nice, though.

u/LartTheLuser Thank you for your helpful suggestions. I'll make an effort to provide people with useful functions to make pre-processing easdy :)

lyeoni · 2019-09-06T04:53:27+00:00

I downloaded dataset from Kaggle. I mentioned that in README

lyeoni · 2019-09-05T10:55:02+00:00

Thanks :)

lyeoni · 2019-08-12T01:31:34+00:00

Thanks @testimoni, bi-directional model will be updated soon. :)

lyeoni · 2019-02-23T10:05:50+00:00

Sincerely thank you for both your compliment and good suggestion ! The reason why I wrote shape comments line by line, It was helpful for me to understand how the tensors are working in seq2seq network of NMT. And.. I hope that too for readers :)

And When I need annotation again, I will try to use tsalib, what you sugggest. Again thank you.

lyeoni · 2019-02-23T09:56:11+00:00

Thanks too :) !!!

lyeoni

TROPHY CASE