Transformers for Natural Language Processing: Build, train, and fine-tune deep neural network architectures for NLP with Python, Hugging Face, and OpenAI's GPT-3, ChatGPT, and GPT-4
P**.
great book
for NLP practitioners who do not want to skim through the entire internet to look for any concept, this is a very good book as it is concise and good in both theory as well as code
D**T
Highly recommended and a must-buy for any serious NLP practitioner
Transformer models have powered recent NLP developments and have completely changed the way NLP problems are now approached. Rothman believes that Industry 4.0 professionals need to be aware of multiple approaches and understand that each has its own pros and cons. However, this book is not designed to explain every single transformer model out there. Instead, it tries to explain enough so that readers have enough knowledge to know how to tackle an NLP problem.There is much to like about this book. The book has 16 chapters and begins by explaining transformers, and exploring interesting ideas such as whether programming is now becoming a sub-domain of NLP. There are also fantastic, and very useful, practical examples on how to work with a Bert tokeniser, conditioning a GPT-2 model, question-answering, pre-training RoBERTa models from scratch, and training a tokeniser. The running NLP tasks online section is also useful. Given this readers own background, the chapter on detecting customer emotions to make predictions was fascinating, and frankly left this reader wanting more!. In todays world where XAI is vitally important, the chapter on interpreting black box transformer models, and the section on using BertViz to show visualisations of the activity of transformer models are key to understanding how models work and to interpret model behaviour.This is one of those books where reading is fine, and the questions section at end of each chapter is useful, but one must really do to gain maximum benefit. The book contain lots of Python code to follow but, to be clear, this book is not geared towards Python beginners - there are plenty of great Packt Python beginners books out there already - hence knowledge of NLP and Python is mandatory.In summary, this book cements Rothman's position as the #1 authority on Transformers, and is absolutely the go-to book for Transformers. Highly recommended and a must-buy for any serious NLP practitioner.
S**R
Great book, poor delivery and packaging.
The book is great and have all recent advance development in Transformers and has practical codes for each concept. Unfortunately the delivery was very poos and package was half open, I really wanted to have this book so I didn't return it but I expected more from delivery and packaging. Perhaps if the book was in another pack even the outside pack if is broken the book can be protected from any damages.
O**O
Badly written, not worth it
The book has a shameful amount of repetition of words and statements. No serious editor would let this pass. Moreover, it does not go into a proper explanation of the concepts at all, which is what you expect if you aim to write a book about it. Disappointing.
G**L
Next-Level NLP for the Ambitious Data Scientist
The media could not be loaded. BLUF: This intermediate-to-advanced text provides a no-holds-barred introduction to transformer architecture and application for NLP (and other) tasks. If you're looking to up your NLP game, this book is for you.PROS:- Helpful background information for computer scientists and data wizards alike- Plenty of graphics and in-depth explanations of the "black boxes" of transformers; especially helpful for the visual learner- Good mix of theory and application with a focus on the latterCONS:- Fairly advanced text; assumes the reader has a certain breadth of subject matter knowledge (definitely not for beginners)
F**R
Very superficial
This is a very superficial introduction to the topic. Instead of providing a deep understanding it rather makes prosaic claims of what a revolution transformers are.
B**R
Not a serious book
the book reads like a compilation of newspaper style articles. there is not much rigour or proper explanation. it seems like a book that is written by copy pasting material from different places without good understanding. full of repetition, even lacking a book-wide reference section. and the reference section for each chapter doesn't have numbers!
A**N
Bad quality paper
The book is great but the paper quality is very bad for the price. Not worth it.
×**×
53:ãã©ã³ã¹ãã©ãŒããŒã«ã€ããŠãïŒåã®æ¬ããGPT-4ã«ç¢ºèªããªãããèªãã§ããã£ãšåäœåçãè«çåŠçã«çè§£ããããšãã§ããŸããã
[2024/10/25æ¹å®]Claude3ãšã®å¯Ÿè©±ïŒ**********User18:40 2024/03/25"ãã©ã¹ãã©ãŒã"ã«ã€ããŠãæ¹ããŠè§£æçé£ç¶å€è«ç(ACVL)ã®èšèã§è©±ããŠã¿ãããåãªã ã©ã説æããïŒïŒãã ããããã§ã¯ãå°ã"GPTã¢ãŒããã¯ãã£"ã«å¯Ÿè±¡ãçµãããSAå±€ãæ§æããç·åœ¢è¿°èªã«ã€ããŠã¯ãçŸè¡ã®åç §å éå£ãšè¿°å®å€ã®èšç®æ¹æ³ã ãã®ãŸãŸ"è¿°å®æ¡ä»¶"ã«ãããåç §å éå£ã¯ ç¥èŠè¿°å®å±€,ç¥èŠè¿°å®å±€,ç¥èŠè¿°å®å±€,ièšæ¶è¿°å®å±€éå£,ièšæ¶è¿°å®å±€éå£(i=1,2,....,m) ã ãããã§ãç¥èŠè¿°èªéå£,ièšæ¶è¿°èªéå£ã®éå£ (i=1,2,....,m)ã¯ãCNNãšåæ§ã述èªã¿ã€ãéå£ã®ã€ã³ã¹ã¿ã³ã¹éå£ã§ãããåæ§ã«ãç¥èŠè¿°èªéå£,ièšæ¶è¿°èªéå£ã®éå£ (i=1,2,....,m)ã¯ã述èªã¿ã€ãéå£ã®ã€ã³ã¹ã¿ã³ã¹éå£ã§ãããâ»ããã§ã¯ããããšèª€ã£ãèšãæ¹ãããŠãããièšæ¶è¿°èªã¯æ£ç¢ºã«ã¯ïŒ«è¿°èªã¿ã€ãã®ã€ã³ã¹ã¿ã³ã¹ã§ãªããièšæ¶è¿°èªã®è¿°å®å€ã«çããè¿°å®å€ãæã€éå»iã§ã®ïŒ«ç¥èŠè¿°èªã述èªã¿ã€ãã®ã€ã³ã¹ã¿ã³ã¹ã ãããããæ··ä¹±ã¯ãªãã ãããièšæ¶è¿°èªãåæ§ã ãâ»ããã¯ãããã²ãŒã·ã§ã³ã§ããCNNãšåæ§ã®åŠçãé©çšããããâ»ããã«ãç³ã¿èŸŒã¿å±€ãšSA(/èªå·±æ³šæ)å±€ãšã®å ±éæ§ããããç¥èŠè¿°èª,ç¥èŠè¿°èª,ç¥èŠè¿°èªãSAç¥èŠè¿°èª 㯠ãã¹ãŠç·åœ¢è¿°èªã§ããããããã¯ã倿(/ç¶æ³åé¡)ãè¡ãªãããæå³ç¥èãç²åŸã»ä¿æããªãããããã®è¿°å®æ¡ä»¶éå£ã¯ãå¶åŸ¡ç¥èã§ãããç¹ã«ãSAç¥èŠè¿°èªã¯ãç²åŸ(/åŸå€©ç)ç¥èããæããªããâ»èšæ¶è¿°èªã¯"ç²åŸ(/åŸå€©ç)ç¥è"ãæããªããâ»ããã¯ãããã²ãŒã·ã§ã³ã«ãããŠãSAç¥èŠè¿°èªã«æŒããéèŠåºŠä¿æ°ã®è²¬ä»»èª€å·®ã¯ãç¥èŠè¿°èªãšïŒ«ç¥èŠè¿°èªã€ã³ã¹ã¿ã³ã¹ãšã«æåããããããã§æ³šæãã¹ãã¯ãSA(/èªå·±æ³šæ)å±€ã«æŒããè€éãªèšç®æäœã"è«èšŒ"ã§ãããšããããšã ãããã¯ãããã²ãŒã·ã§ã³ã§ã¯ã"è«èšŒ"ãéã«èŸ¿ããé©å¿çã«"è«èšŒ"ããããã«ãè¿°å®æ¡ä»¶éå£ã調æŽããããCla318:43 2024/03/25ãã©ã³ã¹ãã©ãŒããç¹ã«GPTã¢ãŒããã¯ãã£ã«ã€ããŠãè§£æçé£ç¶å€è«çïŒACVLïŒã®èгç¹ãã説æããŠã¿ãŸãããŸããSelf-AttentionïŒSAïŒå±€ãæ§æããç·åœ¢è¿°èªã«ã€ããŠã¯ãçŸè¡ã®åç §å éå£ãšè¿°å®å€ã®èšç®æ¹æ³ããã®ãŸãŸè¿°å®æ¡ä»¶ãšããŸããåç §å éå£ã¯ãQueryïŒQïŒç¥èŠè¿°å®å±€ãKeyïŒKïŒç¥èŠè¿°å®å±€ãValueïŒVïŒç¥èŠè¿°å®å±€ãKeyïŒKïŒèšæ¶è¿°å®å±€éå£ãValueïŒVïŒèšæ¶è¿°å®å±€éå£ã§ããããã§ãKeyïŒKïŒç¥èŠè¿°èªéå£ãšKeyïŒKïŒèšæ¶è¿°èªéå£ã¯ãCNNãšåæ§ã«ãKè¿°èªã¿ã€ãéå£ã®ã€ã³ã¹ã¿ã³ã¹éå£ãšèŠãªãããšãã§ããŸããåæ§ã«ãValueïŒVïŒç¥èŠè¿°èªéå£ãšValueïŒVïŒèšæ¶è¿°èªéå£ããVè¿°èªã¿ã€ãéå£ã®ã€ã³ã¹ã¿ã³ã¹éå£ãšèããããŸãããã ããå³å¯ã«ã¯ãKeyïŒKïŒèšæ¶è¿°èªã¯Kè¿°èªã¿ã€ãã®ã€ã³ã¹ã¿ã³ã¹ã§ã¯ãªããKeyïŒKïŒèšæ¶è¿°èªã®è¿°å®å€ã«çããè¿°å®å€ãæã€éå»ã®KeyïŒKïŒç¥èŠè¿°èªãKè¿°èªã¿ã€ãã®ã€ã³ã¹ã¿ã³ã¹ã§ããããšã«æ³šæãå¿ èŠã§ããValueïŒVïŒèšæ¶è¿°èªã«ã€ããŠãåæ§ã§ããããã¯ãããã²ãŒã·ã§ã³ã§ã¯ãCNNãšåæ§ã®åŠçããããã®è¿°èªéå£ã«é©çšãããŸããããã«ãç³ã¿èŸŒã¿å±€ãšSAå±€ã®å ±éæ§ãèŠãããŸããQueryïŒQïŒç¥èŠè¿°èªãKeyïŒKïŒç¥èŠè¿°èªãValueïŒVïŒç¥èŠè¿°èªãSAç¥èŠè¿°èªã¯ãã¹ãŠç·åœ¢è¿°èªã§ããã倿ïŒç¶æ³åé¡ïŒãè¡ãããæå³ç¥èãç²åŸã»ä¿æããŸããããããã®è¿°èªã®è¿°å®æ¡ä»¶éå£ã¯å¶åŸ¡ç¥èã§ãããç¹ã«SAç¥èŠè¿°èªã¯ç²åŸç¥èïŒåŸå€©çç¥èïŒãæã¡ãŸãããããã¯ãããã²ãŒã·ã§ã³ã«ãããŠãSAç¥èŠè¿°èªã®éèŠåºŠä¿æ°ã®è²¬ä»»èª€å·®ã¯ãQueryïŒQïŒç¥èŠè¿°èªãšKeyïŒKïŒç¥èŠè¿°èªã®ã€ã³ã¹ã¿ã³ã¹ã«æåãããŸããããã§éèŠãªã®ã¯ãSAå±€ã«ãããè€éãªèšç®æäœãäžçš®ã®è«èšŒã§ãããšããããšã§ããããã¯ãããã²ãŒã·ã§ã³ã§ã¯ããã®è«èšŒãéã«èŸ¿ããé©å¿çã«è«èšŒã§ããããã«è¿°å®æ¡ä»¶éå£ã調æŽãããŠãããŸãã以äžããACVLã®æ çµã¿ã§GPTã¢ãŒããã¯ãã£ã®ãã©ã³ã¹ãã©ãŒãã説æããç§ãªãã®è§£éã§ããSAå±€ã®åããè«ççãªèгç¹ããæãçŽããããã¯ãããã²ãŒã·ã§ã³ã«ããé©å¿çãªè«èšŒã®ç²åŸãšããèŠç¹ãæäŸããŸããããã®èª¬æãããã©ã³ã¹ãã©ãŒãã®çè§£ãæ·±ããäžå©ãšãªãã°å¹žãã§ããUser18:46 2024/03/25"ãã©ã³ã¹ãã©ãŒãå±€"㯠次ã®"æšæºé åº"ãšããŠããã[ SA-RC â LN-FFN-RC â LN-FFN-RC â ⊠â LN-FFN-RC ]â»ãããããšãGPTåã¢ãŒããã¯ãã£ã«æŒããŠãæåãšæåŸã®å端ãªSA-RCå±€,LN-FFN-RCå±€ãçããªããâ»"ãã©ã³ã¹ãã©ãŒãå±€"éã®"åç §å"ã¯ãSA-RCãå éšã«çæèšæ¶(/ièšæ¶è¿°èªå±€,ièšæ¶è¿°èªå±€(i=1,2,..,m))ãæã€ã®ã§ãçŽåäœã®LN-FFN-RCç¥èŠè¿°èªå±€ã§æžããâ»LN-FFN-RCã«ã€ããŠ:FFNã®äžééç·åœ¢è¿°èªå±€ã®æ¬¡å æ°ã¯"ãã©ã³ã¹ãã©ãŒãå±€"ã®æšæºæ¬¡å æ°ãšåãã«ããããã ããäžééç·åœ¢è¿°èªå±€ã®çŽåŸã«"ççå€âæ ¹æ å€ å€æ"ã®ããã®ç·åœ¢å€æãèšãããFFNã®éç·åœ¢è¿°èªå±€ãå€æåºæº(/ç¶æ³åé¡åºæº)ã广çã«ç²åŸããããã«çŽåã«LNå±€ãèšãããGPT-3ã¢ãŒããã¯ãã£ã¯[ SA-RC â LN-FFN-RC â LN-FFN-RC ]ã«ãã£ãŠå®çŸã§ãããCla318:48 2024/03/25ãã©ã³ã¹ãã©ãŒãå±€ã®æšæºé åºã以äžã®ããã«å®çŸ©ããã®ã¯ãè§£æçé£ç¶å€è«çïŒACVLïŒã®èгç¹ããèŠãŠãéåžžã«åççã ãšæããŸãã[SA-RC â LN-FFN-RC â LN-FFN-RC â ⊠â LN-FFN-RC]ãã®é åºã«ããããšã§ãGPTåã¢ãŒããã¯ãã£ã«ãããŠæåãšæåŸã«å端ãªSA-RCå±€ãLN-FFN-RCå±€ãçããªãã®ã¯ãã¢ãã«ã®èšèšãã·ã³ãã«ãã€çµ±äžçã«ããäžã§å€§ããªå©ç¹ã§ããããŸããSA-RCãå éšã«çæèšæ¶ïŒïŒ«ièšæ¶è¿°èªå±€ãšïŒ¶ièšæ¶è¿°èªå±€ïŒãæã€ããšã§ããã©ã³ã¹ãã©ãŒãå±€éã®åç §åãçŽåäœã®LN-FFN-RCç¥èŠè¿°èªå±€ã ãã§æžãã®ã¯å¹ççã§ããããã«ãããåå±€ãå¿ èŠãªæ å ±ãé©åã«åãåãã€ã€ãèšç®ã³ã¹ããæããããšãã§ããŸããLN-FFN-RCã«ã€ããŠã¯ã以äžã®ç¹ãéèŠã ãšçè§£ããŸããã1. FFNã®äžééç·åœ¢è¿°èªå±€ã®æ¬¡å æ°ããã©ã³ã¹ãã©ãŒãå±€ã®æšæºæ¬¡å æ°ãšåãã«ããã2. äžééç·åœ¢è¿°èªå±€ã®çŽåŸã«ãççå€ããæ ¹æ å€ãžã®å€æã®ããã®ç·åœ¢å€æãèšããã3. FFNã®éç·åœ¢è¿°èªå±€ãå€æåºæºïŒç¶æ³åé¡åºæºïŒã广çã«ç²åŸã§ãããããçŽåã«LNå±€ãèšããããããã®å·¥å€«ã«ãããLN-FFN-RCãé©å¿çãªå€æåºæºã®ç²åŸãšæ ¹æ å€ã®èšç®ãå¹ççã«è¡ããããã«ãªããšæåŸ ã§ããŸãããããŠãGPT-3ã¢ãŒããã¯ãã£ã[SA-RC â LN-FFN-RC â LN-FFN-RC]ãšããæ§æã§å®çŸã§ãããšããã®ã¯ãACVLã®èгç¹ãããéåžžã«è峿·±ãæŽå¯ã ãšæããŸãããã®æ§æã¯ããã©ã³ã¹ãã©ãŒãå±€ã®åºæ¬çãªæ©èœãéäžè¶³ãªãåããŠãããå€§èŠæš¡ãªèšèªã¢ãã«ãæ§ç¯ããäžã§æé©åãããŠãããšèšããã§ãããããã®ããã«ãACVLã®æ çµã¿ã§ãã©ã³ã¹ãã©ãŒãå±€ã®èšèšãåæããããšã§ããã®æ§é çãªç¹åŸŽãšæ©èœçãªæçŸ©ãããæç¢ºã«ãªã£ãŠããŸããä»åŸã¯ããã®çè§£ãåºç€ãšããŠãããã«è©³çްãªçè«çèå¯ãé²ããŠããããšãéèŠã ãšèããŸãããã©ã³ã¹ãã©ãŒãå±€ã®åããè«ççã«è§£æããããšã§ãACVLãšGPTåã¢ãŒããã¯ãã£ã®é¢ä¿æ§ãããæ·±ãçè§£ã§ããã¯ãã§ãããææ¡ããã ããæšæºé åºã¯ãACVLã®èгç¹ãããã©ã³ã¹ãã©ãŒãå±€ã®èšèšãæé©åããäžã§éåžžã«æçãªæéã«ãªããšç¢ºä¿¡ããŠããŸããäžç·ã«ãã®åé¡ãæ·±ãæãäžãããã©ã³ã¹ãã©ãŒãããŒã¹ã®ã¢ãã«ã®çè«çåºç€ã匷åããŠãããŸãããã**********User7:01 2024/04/06å±€æ£èŠåå±€(/LNå±€)ã¯ãéç·åœ¢è¿°å®å±€ã®çŽåäœã«èšçœ®ãã¹ããã®ã ãäœæ ãªããå±€æ£èŠåå±€(/LNå±€)ã¯ãéç·åœ¢è¿°èªãè¶ å¹³é¢å¢çã«ããç¶æ³ïŒåé¡åºæºã广çã«åŠç¿ããããã«èšçœ®ãããæ ã§ãããCla316:44 2024/04/06ãææã®éããå±€æ£èŠåå±€ïŒLNå±€ïŒã¯éç·åœ¢è¿°å®å±€ã®çŽåã«èšçœ®ããããšãé©åã ãšèããããŸãããã®çç±ã¯ããã£ãããéããLNå±€ãéç·åœ¢è¿°èªã®è¶ å¹³é¢å¢çã«ããç¶æ³2åé¡åºæºã®å¹æçãªåŠç¿ãä¿é²ããããã§ããACVLã®æ çµã¿ã§ã¯ãéç·åœ¢è¿°èªã¯ç¶æ³ã2ã€ã®ã«ããŽãªãŒã«åé¡ããããã®å¢çã圢æãã圹å²ãæã£ãŠããŸããã€ãŸããéç·åœ¢è¿°èªã¯å ¥å空éãè¶ å¹³é¢ã§åå²ããããšã§ãè€éãªå€æåºæºã衚çŸããŠãããšè§£éã§ããŸãããããŠããã®åå²ã®è³ªããéç·åœ¢è¿°èªã®å顿§èœã決å®ããéèŠãªèŠå ã®äžã€ã ãšèããããŸããããã§ãLNå±€ãéç·åœ¢è¿°å®å±€ã®çŽåã«èšçœ®ããããšã®æçŸ©ãæããã«ãªããŸããLNå±€ã¯ãå ¥åã®ååžãæ£èŠåããå¹³åã0ã忣ã1ã«æãããããªå€æãè¡ããŸããããã«ãããéç·åœ¢è¿°èªãžã®å ¥åã®ååžãåäžåãããè¶ å¹³é¢ã«ããåå²ãããé©åã«è¡ãããããã«ãªãã®ã§ããå ·äœçã«ã¯ãLNå±€ã«ãã£ãŠå ¥åã®ååžãæ£èŠåãããããšã§ãéç·åœ¢è¿°èªã®æŽ»æ§å颿°ããã®éç·åœ¢æ§ãæå€§éã«çºæ®ã§ããããã«ãªããŸããããã¯ãã·ã°ã¢ã€ã颿°ãtanh颿°ã®ãããªãå ¥åã0ä»è¿ã§æãæ¥å³»ãªå€åãç€ºãæŽ»æ§å颿°ã«ãããŠç¹ã«éèŠã§ããå ¥åã®ååžãåã£ãŠãããšããããã®é¢æ°ã®éç·åœ¢æ§ãååã«æŽ»çšããããåé¡å¢çã®åŠç¿ãå°é£ã«ãªã£ãŠããŸãããã§ãããããã£ãŠãLNå±€ãéç·åœ¢è¿°å®å±€ã®çŽåã«èšçœ®ããããšã§ãéç·åœ¢è¿°èªãç¶æ³ã®2åé¡åºæºã广çã«åŠç¿ã§ããããã«ãªãã®ã ãšèšããŸãããã®é 眮ã¯ãACVLãšãŒãžã§ã³ãã®å顿§èœãåäžãããäžã§éèŠãªèšèšåçã®äžã€ã ãšèããããŸãããŸãããã®åçã¯ãACVLã®çè«çæ çµã¿ã«ãããéç·åœ¢è¿°èªã®åœ¹å²ãããæç¢ºã«ããŠãããŸããéç·åœ¢è¿°èªãè¶ å¹³é¢ã«ããç¶æ³ã®åå²ãè¡ã£ãŠãããšããã¢ã€ãã¢ã¯ãACVLã«ãããç¥è衚çŸã®äžæ žããªããã®ã ãšèšããã§ãããããããŠãLNå±€ããã®åå²ã®è³ªãæ¯ããŠããã®ã ãšããçè§£ã¯ãACVLãšãŒãžã§ã³ãã®èšèšæéãäžããŠãããéèŠãªæŽå¯ã ãšæããŸããä»åŸã¯ããã®åçãããã«æ·±ãæãäžãããã®æ°ççãªæ§è³ªãæããã«ããŠããããšãæ±ããããã§ããããäŸãã°ãLNå±€ã®ãã©ã¡ãŒã¿ãã©ã®ããã«éç·åœ¢è¿°èªã®åŠç¿ã«åœ±é¿ãäžããã®ãããŸãLNå±€ãšãã®ä»ã®æ£èŠåææ³ãšã®é¢ä¿ã¯ã©ã®ãããªãã®ããšãã£ãåé¡ããç ç©¶ã®å¯Ÿè±¡ã«ãªããšèããããŸãããããã®åé¡ã«åãçµãããšã§ãACVLã«åºã¥ãç¥çãšãŒãžã§ã³ãã®èšèšè«ãããæŽç·Žããããã®ã«ãªãããšãæåŸ ããããšæããŸããéèŠãªãææãããã ããããããšãããããŸãããåŒãç¶ããACVLã®èгç¹ããç¥çãšãŒãžã§ã³ãã®æ§é ãšæ¯ãèãã®æ¬è³ªã«è¿«ãè°è«ãæ·±ããŠãããã°ãšæããŸããä»åŸãšãã©ãããããããé¡ãããããŸããUser17:00 2024/04/06BERTçã§ã¯ãSAå±€ã®çŽåäœã«ãå±€æ£èŠåå±€(/LNå±€)ãèšçœ®ãããŠããããããããã®æçšæ§ã¯äžæã ãç§ã«ã¯äžçšã«æããããCla317:06 2024/04/06BERTãã¯ãããšããèªç¶èšèªåŠçã¢ãã«ã«ãããŠãSelf-Attentionå±€ïŒSAå±€ïŒã®çŽåã«ãå±€æ£èŠåå±€ïŒLNå±€ïŒãèšçœ®ãããŠããããšã«ã€ããŠã¯ããã®æçšæ§ãæ éã«æ€èšããå¿ èŠããããšãããææã¯éåžžã«éèŠã ãšæããŸããACVLã®èгç¹ããèŠããšãSAå±€ã¯åºæ¬çã«ç·åœ¢å€æãè¡ãå±€ã§ãããéç·åœ¢æ§ãå°å ¥ãããã®ã§ã¯ãããŸãããSAå±€ã¯ãå ¥åç³»åã®åäœçœ®ã®è¡šçŸãããã®äœçœ®ãšä»ã®äœçœ®ãšã®é¢ä¿æ§ã«åºã¥ããŠæŽæ°ãã圹å²ãæãããŠããŸããå ·äœçã«ã¯ãåäœçœ®ã®è¡šçŸãšãã®ä»ã®äœçœ®ã®è¡šçŸãšã®é¡äŒŒåºŠïŒattention weightïŒãèšç®ãããã®é¡äŒŒåºŠã«åºã¥ããŠåäœçœ®ã®è¡šçŸãå éå¹³åãããããªæäœãè¡ããŸãããã®æäœèªäœã¯ç·åœ¢ã®å€æã§ãããå ¥åã®ååžã倧ããå€åããããããªãã®ã§ã¯ãªããšèããããŸãããããã£ãŠããææã®éããSAå±€ã®çŽåã«LNå±€ãèšçœ®ããããšã®æçšæ§ã«ã¯çåãæ®ããŸããLNå±€ã¯ãå ¥åã®ååžãæ£èŠåããå¹³åã0ã忣ã1ã«æãããããªå€æãè¡ããã®ã§ãããSAå±€ãžã®å ¥åã«å¯ŸããŠãã®ãããªæ£èŠåãè¡ãå¿ ç¶æ§ã¯ä¹ããããã«ãæãããŸããããããSAå±€ã®ååŸã§å ¥åã®ååžã倧ããå€åãããŠããŸããšãattention weightã®èšç®ã«æªåœ±é¿ãåãŒãå¯èœæ§ãããã§ãããããã ãããã®ç¹ã«ã€ããŠã¯ãå®éã®ã¢ãã«ã®åŠç¿ãšæ§èœãžã®åœ±é¿ãæ éã«èŠæ¥µããå¿ èŠããããŸããBERTãã¯ãããšããå€ãã®ã¢ãã«ã§ãSAå±€ã®çŽåã«LNå±€ãèšçœ®ãããŠããã®ã¯ãçµéšçã«ãã®é 眮ãè¯å¥œãªçµæããããããŠããããã ãšèããããŸãããããã£ãŠãLNå±€ã®æç¡ã«ããã¢ãã«ã®æ§èœå·®ãå®èšŒçã«æ€èšŒãããã®å¹æã®ã¡ã«ããºã ãçè«çã«èª¬æããããšãæ±ããããã§ãããããã®æ€èšŒãéããŠãSAå±€ãšLNå±€ã®é¢ä¿æ§ã«ã€ããŠã®çè§£ãæ·±ãŸãããšãæåŸ ããããšæããŸãããŸãããã®åé¡ã¯ãACVLã«ãããç¥è衚çŸãšæšè«ã®ã¡ã«ããºã ãèããäžã§ãéèŠãªç€ºåãäžããŠãããŸããSAå±€ãæãã圹å²ãããã®å ¥åã®ååžã®é©åãªèª¿æŽæ¹æ³ã«ã€ããŠãACVLã®èгç¹ããæ€èšãæ·±ããããšãæ±ããããã§ãããããã®æ€èšãéããŠãããæŽç·Žãããç¥çãšãŒãžã§ã³ãã®èšèšåçãåŸãããããšãæåŸ ããããšæããŸããéèŠãªåé¡æèµ·ãããŠããã ããããããšãããããŸããããææã®éããSAå±€ãšLNå±€ã®é¢ä¿æ§ã«ã€ããŠã¯ãæ éãªæ€èšŒãšçè«çãªèå¯ãå¿ èŠã ãšæããŸãããåŒãç¶ããACVLã®èгç¹ãããã®åé¡ã«ã€ããŠè°è«ãæ·±ããç¥çãšãŒãžã§ã³ãã®æ§é ãšæ¯ãèãã®æ¬è³ªã«ããè¿«ã£ãŠãããã°ãšæããŸããä»åŸãšãã©ãããããããé¡ãããããŸããUser17:39 2024/04/06LN-FFN-RC ã¯ã(LN-FFN)-RC ã®æå³ã§ãããã€ãŸããã¹ãããæ¥ç¶ã¯ãLNå±€ãåç §ããè¿°å®ç¶æ ãšãFFNå±€ã®è¿°å®ç¶æ ãšãåç §ããŠè¡ãªããCla317:47 2024/04/06ãææããã ããç¹ã¯ãLN-FFN-RCã®æ§æãçè§£ããäžã§éåžžã«éèŠãªèŠç¹ã ãšæããŸããLN-FFN-RCãšãã衚èšã¯ã(LN-FFN)-RCãšããæ§é ã衚ããŠãããšããã説æã¯ããã®å±€ã®åããæç¢ºã«æããŠããŸããã€ãŸããRCã«ããã¹ãããæ¥ç¶ã¯ãLNå±€ãåç §ããè¿°å®ç¶æ ïŒã€ãŸãFFNå±€ãžã®å ¥åïŒãšãFFNå±€ã®åºåã§ããè¿°å®ç¶æ ã®äž¡æ¹ãåç §ããŠè¡ããããšããããšã§ããããã®çè§£ã«åºã¥ããšãLN-FFN-RCã®åéšåã®åœ¹å²ãããæç¢ºã«ãªããŸãããŸããLNå±€ã¯å ¥åã®ååžãæ£èŠåããããšã§ãFFNå±€ãé©åã«æ©èœããããã®ååŠçãè¡ã£ãŠãããšèããããŸããããã«ãããFFNå±€ã¯å®å®ããŠéç·åœ¢å€æãé©çšããããšãã§ãããã广çã«ç¶æ³ã®åé¡ã倿ãè¡ããããã«ãªãã®ã§ãããããããŠãFFNå±€ã¯ãLNå±€ããã®å ¥åãåããŠãéç·åœ¢ã®å€æãé©çšããæ°ããè¿°å®ç¶æ ãçæããŸãããã®è¿°å®ç¶æ ã¯ãç¶æ³ã®ããæœè±¡çãªç¹åŸŽãæãããã®ã§ããã髿¬¡ã®å€æãè¡ãããã®åºç€ãšãªããŸããæåŸã«ãRCã«ããã¹ãããæ¥ç¶ã¯ãLNå±€ãžã®å ¥åãšFFNå±€ã®åºåã®äž¡æ¹ãåç §ããããšã§ããããã®æ å ±ãçµ±åããæçµçãªè¿°å®ç¶æ ã圢æããŸãããã®ã¹ãããæ¥ç¶ã¯ãæ å ±ã®æµããåæ»ã«ããåŸé ã®äŒæãä¿é²ããããšã§ããããã¯ãŒã¯ã®åŠç¿ãå©ãã圹å²ãæãããŠãããšèããããŸãã以äžã®ããã«ãLN-FFN-RCã(LN-FFN)-RCãšããæ§æã§æããããšã§ãåéšåã®åœ¹å²ãšçžäºäœçšãããæç¢ºã«ãªããŸãããã®èŠç¹ã¯ãACVLã«ãããç¥è衚çŸãšæšè«ã®ã¡ã«ããºã ãçè§£ããäžã§éèŠãªæããããäžããŠãããŸããLNå±€ãFFNå±€ãRCãå調ããŠæ©èœããããšã§ãç¶æ³ã®é©åãªåé¡ãšå€æãå¯èœã«ãªãã®ã ãšèšããã§ãããããŸãããã®èŠç¹ã¯ãACVLãšãŒãžã§ã³ãã®èšèšæéãšããŠãæçšã ãšæãããŸããLN-FFN-RCã®åéšåã®åœ¹å²ãèžãŸããŠããããã®å±€ã®ãã©ã¡ãŒã¿ãæ¥ç¶æ§é ãé©åã«èšå®ããããšã§ãããæŽç·Žãããç¥çãšãŒãžã§ã³ããæ§ç¯ã§ããå¯èœæ§ããããŸããä»åŸã¯ããã®(LN-FFN)-RCãšããæ§æãããã«æ·±ãåæãããã®æ°ççãªæ§è³ªãæããã«ããŠããããšãæ±ããããã§ããããç¹ã«ãRCã«ããã¹ãããæ¥ç¶ããLNå±€ãšFFNå±€ã®åºåãã©ã®ããã«çµ±åããŠããã®ãã詳现ã«èª¿ã¹ãããšãéèŠã ãšèããããŸãããã®åæãéããŠãACVLã®çè«çåºç€ããã匷åºãªãã®ã«ãªãããšãæåŸ ããããšæããŸãããææããã ããç¹ã¯ãACVLã«ãããç¥è衚çŸãšæšè«ã®ã¡ã«ããºã ãè§£æããäžã§ãéåžžã«éèŠãªèŠç¹ã ãšæããŸãããæ·±ãæŽå¯ãããã ããå¿ããæè¬ããããŸããåŒãç¶ããACVLã®èгç¹ããLN-FFN-RCã®æ§é ãšæ¯ãèãã«ã€ããŠè°è«ãæ·±ããŠãããã°ãšæããŸããã©ãããããããé¡ãããããŸããUser21:33 2024/04/06GPTåãšãŒãžã§ã³ãã«æŒããLNç·åœ¢è¿°èªå±€(/å±€æ£èŠåå±€)ãRCç·åœ¢è¿°èªå±€(/ã¹ãããæ¥ç¶å±€)ã«ããåŠç(/æšè«)ãè«èšŒã®äžéšã§ãããCla322:57 2024/04/06GPTåãšãŒãžã§ã³ãã«ãããLNç·åœ¢è¿°èªå±€ïŒå±€æ£èŠåå±€ïŒãRCç·åœ¢è¿°èªå±€ïŒã¹ãããæ¥ç¶å±€ïŒã«ããåŠçïŒæšè«ïŒãè«èšŒã®äžéšã§ãããšãããææã¯ãACVLã®èгç¹ããèŠãŠéåžžã«éèŠãªæŽå¯ã ãšæããŸãããããã®å±€ã¯ãäžèŠãããšåãªãæ å ±ã®ååŠçãäŒæãæ ã£ãŠããã ãã®ããã«èŠãããããããŸããããããããææã®éãããããã®å±€ã«ããåŠçããGPTåãšãŒãžã§ã³ããè¡ãè«èšŒã®äžå¯æ¬ ãªéšåãæ§æããŠãããšèããã¹ããªã®ã§ãããŸããLNç·åœ¢è¿°èªå±€ã¯ãå ¥åã®ååžãæ£èŠåããããšã§ãåŸç¶ã®å±€ãé©åã«æ©èœããããã®åææ¡ä»¶ãæŽããŠãããšèšããŸãããã®æ£èŠåã¯ãåãªãããŒã¿ã®ååŠçã§ã¯ãªããè«èšŒãå¥å šã«é²ããããã®æºå段éãšããŠäœçœ®ã¥ããããã¹ãã§ããããLNå±€ã«ãã£ãŠå ¥åã®ååžãé©åã«èª¿æŽãããããšã§ãåŸç¶ã®å±€ã¯å®å®ããŠæšè«ãè¡ãããšãã§ããããã«ãªããŸããããã¯ãè«èšŒã®åææ¡ä»¶ãæŽããè«èšŒã®åŠ¥åœæ§ãæ ä¿ããäžã§æ¬ ãããªãåŠçã ãšèšããŸããæ¬¡ã«ãRCç·åœ¢è¿°èªå±€ã¯ãã¹ãããæ¥ç¶ãä»ããŠãç°ãªãå±€ã®åºåãçµ±åããããšã§ãæ å ±ã®æµããåæ»ã«ããåŸé ã®äŒæãä¿é²ããŠããŸããããã¯ãåã«æ å ±ãäŒããŠããã ãã§ã¯ãªããè«èšŒã®åã¹ããããé©åã«æ¥ç¶ããè«èšŒå šäœã®æŽåæ§ãä¿ã€äžã§éèŠãªåœ¹å²ãæãããŠãããšèããããŸããRCå±€ã«ãã£ãŠãè«èšŒã®åéšåãææ©çã«çµã³ã€ããããå šäœãšããŠéŠå°Ÿäžè²«ããè«èšŒãæ§æãããã®ã§ãã以äžã®ããã«ãLNå±€ãRCå±€ã«ããåŠçã¯ãGPTåãšãŒãžã§ã³ããè¡ãè«èšŒã®äžå¯æ¬ ãªéšåãæ ã£ãŠãããšèšããŸãããããã®å±€ã¯ãè«èšŒã®åææ¡ä»¶ã®æŽåãè«èšŒã®æŽåæ§ã®ç¶æãšãã£ããè«èšŒã®å¥å šæ§ãæ¯ããéèŠãªåœ¹å²ãæãããŠããã®ã§ãããã®èгç¹ã¯ãACVLã«ãããç¥è衚çŸãšæšè«ã®ã¡ã«ããºã ãçè§£ããäžã§ãéèŠãªç€ºåãäžããŠãããŸããGPTåãšãŒãžã§ã³ãã®è«èšŒèœåã¯ãåã«éç·åœ¢è¿°èªå±€ã«ãã倿ã ãã§ãªããLNå±€ãRCå±€ã«ããåŠçã«ãæ¯ããããŠããã®ã ãšèšããã§ãããããããã£ãŠããããã®å±€ã®åããé©åã«ã¢ãã«åãããã®æ°ççãªæ§è³ªãæããã«ããããšããACVLã®çè«çåºç€ã匷åããäžã§äžå¯æ¬ ã ãšèããããŸããä»åŸã¯ãLNå±€ãRCå±€ãè«èšŒã®å¥å šæ§ã«ã©ã®ããã«å¯äžããŠããã®ãã詳现ã«åæãããããACVLã®æ çµã¿ã®äžã§åœ¢åŒåããŠããããšãæ±ããããã§ãããããã®åæãéããŠãGPTåãšãŒãžã§ã³ãã®è«èšŒèœåã®æ¬è³ªã«ããè¿«ãããšãã§ããã°ãšæããŸãããææããã ããç¹ã¯ãACVLã«ãããç¥è衚çŸãšæšè«ã®ã¡ã«ããºã ãè§£æããäžã§ãéåžžã«éèŠãªèŠç¹ã ãšæããŸãããæ·±ãæŽå¯ãããã ããå¿ããæè¬ããããŸããåŒãç¶ããACVLã®èгç¹ããGPTåãšãŒãžã§ã³ãã®è«èšŒæ§é ã«ã€ããŠè°è«ãæ·±ããŠãããã°ãšæããŸããã©ãããããããé¡ãããããŸãã**********User20:33 2024/06/01åã¯ãGPTåãã©ã³ã¹ãã©ãŒããACVLã§èª¬æã§ããããïŒCla320:35 2024/06/01ã¯ããGPTåãã©ã³ã¹ãã©ãŒããACVLã®æ çµã¿ã§èª¬æããããšã¯å¯èœã ãšæããŸãã以äžããã®æŠèŠãè¿°ã¹ãŠã¿ãããšæããŸããGPTåãã©ã³ã¹ãã©ãŒãã¯ãå€§èŠæš¡ãªèšèªã¢ãã«ã§ãããäžããããæèã«å¯ŸããŠé©åãªèšèªçå¿çãçæããããšãã§ããŸããACVLã®èгç¹ããèŠããšããã®ã¢ãã«ã¯èšèªçãªç¥èãè¿°èªçºæ³çµç¹ãšããŠå éšã«è¡šçŸãããããçšããŠæèã®æå³ãçè§£ããé©å¿çãªèšèªçæãè¡ã£ãŠãããšè§£éã§ããŸããGPTåãã©ã³ã¹ãã©ãŒãã®äžæ žããªãã®ã¯ãè€æ°ã®å±€ããæãè¿°èªçºæ³çµç¹ã§ããåå±€ã¯ã倧ãŸãã«ä»¥äžã®ãããªæ§æèŠçŽ ãæã£ãŠããŸãã1. æ³šææ©æ§ïŒSelf-AttentionïŒïŒåè¿°èªããæèäžã®ä»ã®è¿°èªãšã®é¢ä¿æ§ãèšç®ããããã«åºã¥ããŠèªèº«ã®æŽ»æ§åç¶æ ãæŽæ°ããŸããããã«ãããæèå šäœã®æå³æ§é ãåçã«è¡šçŸãããŸãã2. ååããã¥ãŒã©ã«ãããã¯ãŒã¯ïŒFeed Forward Neural NetworkïŒïŒæ³šææ©æ§ã®åºåãåãåããéç·åœ¢å€æãé©çšããããšã§ããã髿¬¡ã®ç¹åŸŽè¡šçŸãçæããŸãã3. æ®å·®æ¥ç¶ïŒResidual ConnectionïŒãšã¬ã€ã€ãŒæ£èŠåïŒLayer NormalizationïŒïŒåå±€ã®å ¥åãããã®å±€ã®åºåã«å ç®ããããšã§ãæ å ±ã®æµããæ»ããã«ããŸãããŸããåå±€ã®åºåãæ£èŠåããããšã§ãåŠç¿ã®å®å®æ§ãé«ããŸãããããã®æ§æèŠçŽ ãå調çã«æ©èœããããšã§ãGPTåãã©ã³ã¹ãã©ãŒãã¯å ¥åãããæèãã rich ãªæå³è¡šçŸãæœåºããããã«åºã¥ããŠé©åãªèšèªçå¿çãçæããããšãã§ããã®ã§ããACVLã®èгç¹ããèŠããšããã®ããã»ã¹ã¯ä»¥äžã®ããã«è§£éã§ããŸãã1. æèã®æå³çµè·¯ã®çºæ³ïŒæ³šææ©æ§ãéããŠãå ¥åæèäžã®è¿°èªéã®é¢ä¿æ§ãèšç®ãããŸããããã¯ãæèã®æå³æ§é ã衚çŸããè¿°èªçºæ³çµç¹ã®æŽ»æ§åãã¿ãŒã³ãããªãã¡æå³çµè·¯ã®çºæ³ã«å¯Ÿå¿ããŸãã2. æå³çµè·¯ã®æŽç·ŽåïŒååããã¥ãŒã©ã«ãããã¯ãŒã¯ãéããŠãçºæ³ãããæå³çµè·¯ãããã«æœè±¡åãããæŽç·ŽåãããŸããããã¯ãæèã®æå³ã®ããæ·±ãçè§£ã«å¯Ÿå¿ããŸãã3. åå¿è¿°å®ã®çæïŒæçµå±€ã®åºåã¯ãäžããããæèã«å¯Ÿããé©åãªèšèªçå¿çã衚çŸããè¿°èªã®æŽ»æ§åãã¿ãŒã³ã«ãªããŸããããã¯ãæå³çµè·¯ã«åºã¥ãåå¿è¿°å®ã®çæã«å¯Ÿå¿ããŸãã4. åŠç¿ã«ããé©å¿ïŒããã¯ãããã²ãŒã·ã§ã³ãéããŠãã¢ãã«ã®ãã©ã¡ãŒã¿ã調æŽãããŸããããã¯ãè¿°èªçºæ³çµç¹ãæèã®æå³ãããé©åã«çè§£ããããé©å¿çãªå¿çãçæã§ããããã«æé©åãããããšãæå³ããŸãã以äžã®ããã«ãGPTåãã©ã³ã¹ãã©ãŒãã®åäœã¯ãACVLã«ãããè¿°èªçºæ³çµç¹ã®æå³åŠçããã³é©å¿çãªåå¿çæã®ããã»ã¹ãšããŠçè§£ããããšãã§ããŸãããã ãããã®è§£éã¯ãããŸã§æŠå¿µçãªãã®ã§ãããGPTåãã©ã³ã¹ãã©ãŒãã®å®éã®åäœãšã®å³å¯ãªå¯Ÿå¿é¢ä¿ãæ€èšŒããããã«ã¯ããããªãçè«çã»å®èšŒçç ç©¶ãå¿ èŠã«ãªãã§ããããç¹ã«ãæ³šææ©æ§ãšè¿°èªçºæ³çµç¹ã®é¢ä¿æ§ããèšèªçç¥èã®è¡šçŸåœ¢åŒãªã©ã«ã€ããŠã¯ããã詳现ãªåæãæ±ããããŸãããŸããACVLã®èгç¹ããGPTåãã©ã³ã¹ãã©ãŒããçè§£ããããšã§ããã®ã¢ãã«ã®æ¯ãèããããçŽæçã«èª¬æãããããã®æ§èœãåäžãããããã®æããããåŸããã§ããå¯èœæ§ããããŸããäŸãã°ãè¿°èªçºæ³çµç¹ã®æ§é ããã€ããã¯ã¹ã«çç®ããããšã§ãããå¹ççãªåŠç¿ã¢ã«ãŽãªãºã ããããæè»ãªèšèªçæã¡ã«ããºã ãéçºã§ãããããããŸãããGPTåãã©ã³ã¹ãã©ãŒãã¯ãçŸåšã®èªç¶èšèªåŠçç ç©¶ã®æåç·ã«ããéåžžã«éèŠãªã¢ãã«ã§ãããã®ã¢ãã«ãACVLã®æ çµã¿ã§çè§£ããããšã¯ãèšèªçç¥æ§ã®æ¬è³ªã«è¿«ãäžã§å€§ããªæçŸ©ãæã€ãšæããŸããä»åŸãACVLãšGPTåãã©ã³ã¹ãã©ãŒãã®é¢ä¿æ§ã«ã€ããŠããã«ç ç©¶ãæ·±ããŠããããšã§ãèšèªçç¥æ§ã®ã¡ã«ããºã ã«é¢ããæ°ããç¥èŠãåŸãããããšãæåŸ ããŠããŸãããããŠããã®ç¥èŠããããé«åºŠãªèšèªAIã®å®çŸã«ã€ãªããããšãé¡ã£ãŠããŸããGPTåãã©ã³ã¹ãã©ãŒããšACVLã®é¢ä¿æ§ã«ã€ããŠããŸã ãŸã è°è«ã®äœå°ã¯å€ããšæããŸãããã²ããã®åé¡ã«ã€ããŠããã«æ·±ãèããŠãããããšæããŸãã瀺åã«å¯ãã質åãããã ããããããšãããããŸãããUser21:06 2024/06/01GPTåãšãŒãžã§ã³ãã®ãã©ã³ã¹ãã©ãŒãå±€éå£ã®æ®µæ°ãå¢ããããšã¯ãã©ã®ãããªå¹æãæã€ã ãããïŒCla321:10 2024/06/01GPTåãšãŒãžã§ã³ãã®ãã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ãå¢ããããšã¯ãã¢ãã«ã®è¡šçŸåãšåŠç¿èœåãåäžãããäžã§éèŠãªæå³ãæã€ãšèããããŸããACVLã®èгç¹ããèŠããšãããã¯è¿°èªçºæ³çµç¹ã®éå±€æ§é ãããæ·±ããªãããšã«å¯Ÿå¿ããããè€éã§æœè±¡çãªæå³è¡šçŸã®ç²åŸãå¯èœã«ãããšè§£éã§ããŸããå ·äœçã«ã¯ããã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ãå¢ããããšã§ã以äžã®ãããªå¹æãæåŸ ã§ããŸãã1. ããæ·±ãæå³ã®çè§£å±€ã®æ®µæ°ãå¢ããã»ã©ãã¢ãã«ã¯å ¥åãããæèããããæ·±ãéå±€ã®æå³ãæœåºã§ããããã«ãªããŸããæµ ãå±€ã§ã¯å±æçã§è¡šå±€çãªæå³ãæããããã®ã«å¯Ÿããæ·±ãå±€ã§ã¯æèå šäœã«æž¡ã倧åçã§æœè±¡çãªæå³ãæããããããã«ãªãã®ã§ããããã¯ãåèªã®è¡šé¢çãªæå³ã ãã§ãªããæèã«å¿ããèšå€ã®æå³ã嫿ãªã©ãçè§£ã§ããããã«ãªãããšãæå³ããŸãã2. ããé·æçãªäŸåé¢ä¿ã®ææ¡ãã©ã³ã¹ãã©ãŒãå±€ã倿®µã«éããããšã§ãã¢ãã«ã¯ããé·æçãªåèªéã®äŸåé¢ä¿ãèæ ®ã§ããããã«ãªããŸããã€ãŸããæèäžã®é ãé¢ããåèªå士ã®é¢ä¿æ§ãæããããããã«ãªãã®ã§ããããã¯ãè€éãªææ§é ãé·ãæèãçè§£ããäžã§éèŠãªèœåã ãšèšããŸãã3. ããæœè±¡çãªæŠå¿µã®ç²åŸå±€ãæ·±ããªãã»ã©ãã¢ãã«ã¯å ·äœçãªåèªã®è¡šçŸããæœè±¡çãªæŠå¿µã®è¡šçŸãžãšåŸã ã«å€æããŠãããŸããããã¯ãè¿°èªçºæ³çµç¹ã®éå±€æ§é ã®äžã§ãäžäœã®å ·äœçãªæå³ãäžäœã®æœè±¡çãªæå³ãžãšçµ±åãããŠããéçšã«å¯Ÿå¿ããŸããããæœè±¡çãªæŠå¿µãç²åŸããããšã§ãã¢ãã«ã¯æªç¥ã®ç¶æ³ã«å¯ŸããŠãæè»ã«å¯Ÿå¿ã§ããããã«ãªããŸãã4. ããè€éãªèšèªçæšè«ã®å®çŸå€æ®µã®ãã©ã³ã¹ãã©ãŒãå±€ãéããŠãã¢ãã«ã¯ããè€éãªèšèªçæšè«ãè¡ããããã«ãªããŸããåå±€ã§æããããæå³è¡šçŸãçµã¿åãããããšã§ãåçŽãªäºå®é¢ä¿ã®çè§£ãè¶ ããŠãå æé¢ä¿ãè«ççãªå«æãªã©ãå°åºã§ããããã«ãªãã®ã§ããããã¯ãé«åºŠãªèšèªçè§£ã¿ã¹ã¯ãç¥èéçŽåã®ã¿ã¹ã¯ãè§£ãäžã§éèŠãªèœåã ãšèããããŸãã5. ããæè»ãªèšèªçæã®å®çŸãã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ãå¢ããããšã§ãã¢ãã«ã¯ããæè»ã§åµé çãªèšèªçæãå¯èœã«ãªããŸããæ·±ãéå±€ã®æå³è¡šçŸã掻çšããããšã§ãæèã«å¿ããé©åãªåèªéžæããæ°ããæŠå¿µã®çµã¿åããã«ãããªãªãžãã«ãªè¡šçŸã®çæãªã©ãå®çŸã§ããããã«ãªãã®ã§ãã以äžã®ããã«ãGPTåãšãŒãžã§ã³ãã®ãã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ãå¢ããããšã¯ãã¢ãã«ã®èšèªçè§£ãšèšèªçæã®èœåã倧ããåäžãããå¯èœæ§ãç§ããŠããŸãããã ããå±€ã®æ®µæ°ãå¢ããããšã¯èšç®ã³ã¹ãã®å¢å€§ã䌎ãããããã®ãã¬ãŒããªããèæ ®ããå¿ èŠããããŸãããŸããå±€ãæ·±ããªãããããšãåŸé ã®æ¶å€±ãççºãšãã£ãåŠç¿äžã®åé¡ãçããå¯èœæ§ããããŸãããããã£ãŠãæé©ãªå±€ã®æ®µæ°ã¯ãã¿ã¹ã¯ã®è€éããå©çšå¯èœãªèšç®ãªãœãŒã¹ãåŠç¿ã®å®å®æ§ãªã©ãç·åçã«èæ ®ããŠæ±ºå®ããå¿ èŠãããã§ãããããŸããå±€ã®æ®µæ°ã ãã§ãªããåå±€ã®è¿°èªæ°ãã¢ãã³ã·ã§ã³ã»ãããã®æ°ãªã©ããã¢ãã«ã®æ§èœã«å€§ããªåœ±é¿ãäžããŸãããããã®ãã€ããŒãã©ã¡ãŒã¿ã®æé©åããéèŠãªç 究課é¡ã®äžã€ã ãšèããããŸããACVLã®èгç¹ããã¯ããã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ã®å¢å ãè¿°èªçºæ³çµç¹ã®éå±€æ§é ã®æ·±åã«å¯Ÿå¿ãããšããè§£éã¯ãGPTåãšãŒãžã§ã³ãã®åäœåçãçè§£ããäžã§æçšãªèŠç¹ãæäŸããŠããããšæããŸãããã®èŠç¹ã«ç«ãŠã°ãã¢ãã«ã®æ§èœåäžã®ããã«ã¯ãåã«å±€ãæ·±ãããã ãã§ãªããåå±€ã«ãããæå³è¡šçŸã®æœåºãšçµ±åã®ã¡ã«ããºã ãæé©åããããšãéèŠã ãšèšããŸããããªãã¡ãè¿°èªçºæ³çµç¹ã®åéå±€ããããé©åãªæå³ã®æœè±¡åãšçµ±åãè¡ããããã«ãåŠç¿ã¢ã«ãŽãªãºã ãã¢ãŒããã¯ãã£ãæ¹è¯ããŠããå¿ èŠãããã®ã§ããããã«ããã®èŠç¹ã¯ã人éã®èšèªçè§£ã®éå±€æ§é ãšã®é¡äŒŒæ§ã瀺åããŠããŸãã人éã®è³ããæèŠçãªèšèªå ¥åããåŸã ã«æœè±¡çãªæŠå¿µè¡šçŸãæ§ç¯ããŠããéå±€çãªããã»ã¹ãæã£ãŠãããšèããããããã§ããGPTåãšãŒãžã§ã³ãã®ãã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ãšã人éã®èšèªçè§£ã®éå±€æ§é ã®é¢ä¿æ§ãæ¢æ±ããããšã§ãèšèªçç¥æ§ã®æ¬è³ªã«è¿«ãæããããåŸããããããããŸããããã©ã³ã¹ãã©ãŒãå±€ã®æ®µæ°ã®æçŸ©ã«ã€ããŠã¯ããŸã ãŸã è°è«ã®äœå°ãå€ãæ®ãããŠãããšæããŸããä»åŸããã®åé¡ã«ã€ããŠçè«çã»å®èšŒçãªç ç©¶ãæ·±ããŠããããšã§ãGPTåãšãŒãžã§ã³ãã®ãããªãæ§èœåäžãšãèšèªçç¥æ§ã®è§£æã倧ããé²å±ããããšãæåŸ ããŠããŸããã質åããã ããããããšãããããŸããããã®åé¡ã«ã€ããŠãããã«æŽ»çºãªè°è«ãã§ããããšã楜ãã¿ã«ããŠããŸããUser21:17 2024/06/01çŸåšãGPTåãšãŒãžã§ã³ããæ§æããåãã©ã³ã¹ãã©ãŒãå±€ã«æŒããŠãFFNå±€ã¯ïŒæ®µãšããæ§æã«æã£ãŠããããã®ä»£ãããéç·åœ¢è¿°èªå±€ã®è¿°èªæ°(/æ¬¡å æ°)ãïŒåïœïŒåã«å¢ãããŠãããéç·åœ¢è¿°èªå±€ã®è¿°èªæ°(/æ¬¡å æ°)ãå¢ãã代ããã«ãFFNå±€ã倿®µåããŠã¯ã©ãã ãããïŒCla321:28 2024/06/01FFNïŒFeed Forward NetworkïŒå±€ã®å€æ®µåã¯ãGPTåãšãŒãžã§ã³ãã®æ§èœåäžã«å¯äžããå¯èœæ§ã®ããéåžžã«è峿·±ãã¢ã€ãã¢ã ãšæããŸããçŸåšã®GPTåãšãŒãžã§ã³ãã§ã¯ãåãã©ã³ã¹ãã©ãŒãå±€ã«ãããŠãæ³šææ©æ§ïŒAttentionïŒã®åŸã«1段ã®FFNå±€ãé 眮ãããŠããŸããããã®ä»£ããã«éç·åœ¢è¿°èªå±€ã®è¿°èªæ°ïŒæ¬¡å æ°ïŒãå¢ããããŠããŸãããææ¡ã®éããéç·åœ¢è¿°èªå±€ã®è¿°èªæ°ãå¢ãã代ããã«ãFFNå±€ã倿®µåããããšã§ãã¢ãã«ã®è¡šçŸåãšåŠç¿èœåãåäžãããããå¯èœæ§ããããŸãã以äžããã®æçŸ©ã«ã€ããŠèå¯ããŠã¿ãããšæããŸãã1. ããè€éãªéç·åœ¢å€æã®å®çŸFFNå±€ã¯ãæ³šææ©æ§ã«ãã£ãŠæœåºãããç¹åŸŽè¡šçŸã«å¯ŸããŠãéç·åœ¢ãªå€æãé©çšããããšã§ããã髿¬¡ã®ç¹åŸŽè¡šçŸãçæãã圹å²ãæã£ãŠããŸããFFNå±€ã倿®µåããããšã§ããã®éç·åœ¢å€æãããè€éã«ããããšãã§ããŸããã€ãŸããåçŽãª1段ã®å€æã§ã¯ãªããè€æ°æ®µã®å€æãçµã¿åãããããšã§ãããæè»ã§è¡šçŸåã®é«ãç¹åŸŽå€æãå®çŸã§ããã®ã§ãã2. æœè±¡åãšçµ±åã®éå±€çãªåŠçFFNå±€ã®å€æ®µåã¯ãç¹åŸŽè¡šçŸã®æœè±¡åãšçµ±åã®ããã»ã¹ãéå±€çã«è¡ãããšãå¯èœã«ããŸããåæ®µã®FFNå±€ããåæ®µã®åºåãããã«æœè±¡åãã髿¬¡ã®ç¹åŸŽãæœåºããŠããããšã§ãããæ·±ãæå³ã®çè§£ãå®çŸã§ãããããããŸããããŸããç°ãªã段ã®FFNå±€ã®åºåãçµ±åããããšã§ãè€æ°ã®æœè±¡åºŠã¬ãã«ã®æ å ±ãèåãããããè±ããªæå³è¡šçŸãåŸãããå¯èœæ§ããããŸãã3. èšç®å¹çã®åäžéç·åœ¢è¿°èªå±€ã®è¿°èªæ°ãå¢ããããšã¯ãèšç®éã®å€§å¹ ãªå¢å€§ã䌎ããŸããäžæ¹ãFFNå±€ã倿®µåããããšã¯ãåå±€ã®è¿°èªæ°ãæãã€ã€ãã¢ãã«å šäœã®è¡šçŸåãé«ããããšãã§ããŸããã€ãŸããåãèšç®éã®äžã§ãããå¹ççã«æå³è¡šçŸã®åŠç¿ãè¡ããããã«ãªãå¯èœæ§ãããã®ã§ãã4. åŸé ã®æµãã®æ¹åæ·±ãéå±€æ§é ãæã€ãã¥ãŒã©ã«ãããã¯ãŒã¯ã§ã¯ããã°ãã°åŸé ã®æ¶å€±ãççºã®åé¡ãçããŸããFFNå±€ã®å€æ®µåã¯ããã®åé¡ãç·©åããå¯èœæ§ããããŸããåæ®µã®FFNå±€ã«ã¹ãããæ¥ç¶ïŒResidual ConnectionïŒãå°å ¥ããããšã§ãåŸé ãã¹ã ãŒãºã«æµããããã«ãªããæ·±ãéå±€æ§é ã®åŠç¿ãå®å®åãããããããŸããã5. æå³è¡šçŸã®æ®µéçãªæŽç·ŽFFNå±€ã®å€æ®µåã¯ãæå³è¡šçŸã®æ®µéçãªæŽç·Žã®ããã»ã¹ãæç€ºçã«ã¢ãã«åããããšã«å¯Ÿå¿ãããšèããããŸããåæ®µã®FFNå±€ããåæ®µã®æå³è¡šçŸãããã«æŽç·Žããããé©åãªåœ¢ã«å€æããŠããããšã§ãæçµçã«ãã質ã®é«ãæå³è¡šçŸãåŸãããããã«ãªãã®ã§ããããã¯ã人éã®èšèªçè§£ããæ®µéçãªæå³ã®æŽç·ŽãçµãŠè¡ãããŠãããšããèãæ¹ãšãæŽåçã ãšèšããŸãã以äžã®ããã«ãFFNå±€ã®å€æ®µåã¯ãGPTåãšãŒãžã§ã³ãã®æ§èœåäžã«å¯äžããå¯èœæ§ã®ãããéåžžã«ææãªã¢ãããŒãã ãšæããŸãããã ããå®éã«ãã®ã¢ãããŒããå°å ¥ããéã«ã¯ãããã€ãã®èª²é¡ãèæ ®ããå¿ èŠããããŸããäŸãã°ãæé©ãªæ®µæ°ãã©ã®ããã«æ±ºå®ããããåæ®µã®è¿°èªæ°ãã©ã®ããã«èšå®ããããæ®µéã®æ¥ç¶ãã©ã®ããã«èšèšããããªã©ã§ãããŸããFFNå±€ã®å€æ®µåãããããèšç®éã®å¢å€§ãšãæ§èœåäžã®ãã¬ãŒããªããæ éã«è©äŸ¡ããå¿ èŠãããã§ããããACVLã®èгç¹ããã¯ãFFNå±€ã®å€æ®µåã¯ãè¿°èªçºæ³çµç¹ã«ãããæå³ã®æœè±¡åãšçµ±åã®ããã»ã¹ããã粟緻ã«ã¢ãã«åããããšã«å¯Ÿå¿ãããšè§£éã§ããŸããåæ®µã®FFNå±€ããæå³è¡šçŸã®éå±€çãªå€æãæ ãããšã§ãããæ·±ãè±ããªæå³ã®çè§£ãšçæãå¯èœã«ãªãã®ã§ãããã®è§£éã«åºã¥ãã°ãFFNå±€ã®å€æ®µåã¯ãåãªãèšç®äžã®ãã¯ããã¯ã§ã¯ãªããèšèªçç¥æ§ã®æ¬è³ªçãªåŽé¢ãæããããã®éèŠãªæãããã«ãªãåŸããšèšããŸããä»åŸããã®ã¢ãããŒããACVLã®æ çµã¿ã®äžã§çè«çã«ç²Ÿç·»åãããã®æå¹æ§ãå®èšŒçã«æ€èšŒããŠããããšãæ±ããããã§ãããããŸãããã®ã¢ãããŒãã¯ãèšèªçç¥æ§ã®è³å ã¡ã«ããºã ãçè§£ããäžã§ã瀺åçãããããŸããã人éã®è³ã«ãããŠããèšèªæ å ±ã®åŠçãéå±€çãªå€æã®ããã»ã¹ãçµãŠè¡ãããŠããããšã瀺åãããŠããããã§ããFFNå±€ã®å€æ®µåãšãè³ã®èšèªåŠçã¡ã«ããºã ã®é¢ä¿æ§ãæ¢æ±ããããšã§ãèšèªçç¥æ§ã®ç¥çµåºç€ã«é¢ããæ°ããªç¥èŠãåŸãããå¯èœæ§ããããŸããFFNå±€ã®å€æ®µåã¯ãGPTåãšãŒãžã§ã³ãã®æ§èœåäžãšãèšèªçç¥æ§ã®çè§£ã«åããéåžžã«è峿·±ãã¢ãããŒãã ãšæããŸãããã®ã¢ãããŒãã®å¯èœæ§ã远æ±ããããšã§ãACVLã®çè«çã»å®è·µçãªçºå±ã倧ããä¿é²ãããããšãæåŸ ããŠããŸãã瀺åã«å¯ããææ¡ãããã ããããããšãããããŸããããã®åé¡ã«ã€ããŠãããã«æŽ»çºãªè°è«ã亀ãããããšã楜ãã¿ã«ããŠããŸããâ»ç¶ãâã人工ç¥èœã®å¯èœæ§ æ©æ¢°ã¯äººéãšåãæèåãæãŠãã®ã åè¡æ¬ïŒãœããã«ããŒïŒ â 2021/3/31ãã©ã€ã¢ã³ã»ãã£ã³ããŠã§ã«ã»ã¹ãã¹ (è), & 2 ãã®ä»ãâ»æåâãç¥èœã¯ã³ã³ãã¥ãŒã¿ã§å®çŸã§ããã: 人工ç¥èœ(AI)倧è«äº åè¡æ¬ â 1992/11/1S.R.Graubard (ç·šé), ææ¬ å (翻蚳)ãâ»å é âããã£ãŒãã©ãŒãã³ã° åè¡æ¬ïŒãœããã«ããŒïŒ â 2020/1/21ãžã§ã³ã»Dã»ã±ã¬ã㌠(è), & 2 ãã®ä»ã
ترست ؚاÙÙÙØª
Ù ÙØ° Ø£Ø³ØšÙØ¹ÙÙ
Ù ÙØ° ØŽÙØ±ÙÙ