Exploiting Deep Representations for Neural Machine Translation

Posted on 2018-11-07 | Edited on 2018-11-08 | In PaperNotes , EMNLP18

MotivationCV 和 NLP 领域的一些工作发现对各层做 aggregation 能够获得比较好的效果，因此尝试将该思想加入 NMT 中，通过聚合各层信息来提升翻译质量。尽管残差已经是利用了多层信息，但这种简单的单步融合的方式太“浅”。 Author: Zi-Yi Dou, Zhaop ...

【报告】机器学习理论

Posted on 2018-10-19 | Edited on 2018-10-21 | In ReportNotes

Learning Theory 如何形式化机器学习问题问题 “learnable” 是什么含义如何建立学习算法 Introduction to generalizationAim: 收集数据 -> (训练)模型 -> (预测)未知数据模型应该 fit unknown data 胡 ...

【报告】个性化推荐

Posted on 2018-10-19 | In ReportNotes

场景个性化新闻排序广告对话式推荐 (Coversational Recommendation) User ModelUser Big Data -> User Representation Explicit vs Implict App - User ModelQuery lo ...

Zero-shot User Intent Detection via Capsule Neural Networks

Posted on 2018-10-19 | In PaperNotes

Introduction本文的任务是做 User intent detection

An Optimization View on Dynamic Routing Between Capsules

Posted on 2018-10-05 | Edited on 2018-10-12 | In PaperNotes

Intro这篇文章发表在 ICLR 2018 wordshop 上，文章总共只有四页，基本上属于讨论性质。在文章中，作者给 Dynamic Routing 做出了设想，认为其是在优化一个聚类loss + KL 散度，并据此优化问题给出了一个自己的理论上更合理的解，在实验中初步证明了修改过的 Rou ...

Fine-Grained Attention Mechanism for NMT

Posted on 2018-10-03 | Edited on 2018-10-12 | In PaperNotes

Motivation灵感来源于作者之前的有关 word embedding 的工作。（ Context-dependent word representation for neural machine translation）作者发现 word embedding vectors 中的每一维在不同的 ...

Hexo + GitHub + NexT 博客搭建

Posted on 2018-09-28 | Edited on 2018-10-12 | In 学习记录

这篇文章主要介绍一下自己搭建博客的流程，内容主要是综合网上各大教程所得。

【报告】Finding a good division of labor: Linguistics and machine learning in NLP

Posted on 2018-09-26 | In ReportNotes

Dims of deep NLP Robustness Accuracy Ambiguity Efficiency Parsing configurations Efficiency use packing cell by cell in parse chart use ubertagging ...

Slim Embedding Layers for Recurrent Neural Language Models

Posted on 2018-09-23 | Edited on 2018-09-25 | In PaperNotes

Motivation训练 RNN Language Model 时，大词表是系统瓶颈。本文提出一种空间压缩方法，通过共享输入和输出的 embedding 层的结构参数来减少总参数，并能够保持原来的表示能力。 MethodsRandom Parameter SharingInput Embeddin ...

EMNLP18 最佳论文: Phrase-Based & Neural Unsupervised Machine Translation

Posted on 2018-09-21 | Edited on 2018-09-23 | In PaperNotes , EMNLP18

Motivation利用大量的单语语料提升 MT 效果。前人工作有一些共通的原则： Initialization: 利用双语词典来初始化 MT 系统 Language modeling: 通过将 seq2seq 系统作为一个 denoising autoencoder 来利用强能力的语言模型 ...