Attention Is All You Need 一些好的资料

Attention Is All You Need 一些好的资料

The encoders are all identical in structure (yet they do not share weights). Each one is broken down into two sub-layers:

https://kexue.fm/archives/4765

https://jalammar.github.io/illustrated-transformer/

http://nlp.seas.harvard.edu/2018/04/03/attention.html

https://colab.research.google.com/github/tensorflow/tensor2tensor/blob/master/tensor2tensor/notebooks/hello_t2t.ipynb#scrollTo=r6GPPFy1fL2N
相关阅读:
SQLServer之删除用户自定义数据库用户
 Oracle expdp/impdp导出导入命令及数据库备份
 ORACLE EXP/IMP的使用详解
 《JAVA与模式》之抽象工厂模式
 Oracle中的Temporary tablespace的作用
 Oracle常用函数笔记
 java Map及Map.Entry详解
 LinkedHashMap和HashMap的比较使用
 win7 64系统安装oracle客户端使用PL/SQL Developer工具
 PL/SQL 创建视图语法
原文地址：https://www.cnblogs.com/zle1992/p/10062804.html