| EmoNAVIã®åææ§è§£æïŒææ é§ååæé©ååšã®æ°åŠçä¿èšŒ | |
| èŠæš | |
| æ¬è«æã¯ãEmoNAVIïŒemotion-driven optimizerïŒã®æŽæ°åããéåžæé©ååé¡ã«ãããŠãå®å®ããåææ§ãæã€ããšãæ°åŠçã«èšŒæãããæ¬ç ç©¶ã§ã¯ãCOCOBïŒCompetive Online Convex Optimization with BoundsïŒã®çè«ãæŽçšããEmoNAVIã®åŠç¿ç調æŽã¡ã«ããºã ããæŽæ°ã¹ãããã®æçæ§ãšRegretã®äžéãä¿èšŒããããšã瀺ããç¹ã«ãææ ã¹ã«ã©ãŒã確ççã«æçã§ããããã®åçãªæ¯ãèããæé©åããã»ã¹ã«å®å®æ§ãããããããšãæããã«ããããã®èšŒæã«ãããEmoNAVIãåãªãçµéšåã§ã¯ãªãã匷åºãªçè«çåºç€ãæã€æé©åã¢ã«ãŽãªãºã ã§ããããšã瀺ãããã | |
| 1. ç·èš | |
| ãã£ãŒãã©ãŒãã³ã°ã¢ãã«ã®èšç·Žã«ãããŠãæé©åã¢ã«ãŽãªãºã ã¯äžå¿çãªåœ¹å²ãæãããAdamãSGDã«ä»£è¡šãããæ¢åã®æé©ååšã¯ãæ§ã ãªã¿ã¹ã¯ã§æåãåããŠãããããã®æ§èœã¯ãã€ããŒãã©ã¡ãŒã¿ã®èšå®ã«å€§ããäŸåãããEmoNAVIã¯ãèšç·Žéçšã®ãææ ããã¢ãã«åãããã®å€åã«å¿ããŠåŠç¿çãåçã«èª¿æŽããããšã§ããã€ããŒãã©ã¡ãŒã¿ãã¥ãŒãã³ã°ã®è² æ ã軜æžããããé å¥ãªåŠç¿ãç®æãæ°ããã¢ãããŒãã§ãããæ¬è«æã§ã¯ããã®é©æ°çãªã¢ãããŒãã®æå¹æ§ããå³å¯ãªæ°åŠç蚌æã«ãã£ãŠè£ä»ããã | |
| 2. åé¡èšå®ãšåæ | |
| 2.1 æé©å察象 | |
| æ¬ç ç©¶ã§ã¯ã以äžã®åœ¢åŒã®æå€±é¢æ° f:RdâR ã«å¯Ÿããæå°ååé¡ãèããã | |
| wâRdminâf(w) | |
| ããã§ãw ã¯ã¢ãã«ã®éã¿ãã©ã¡ãŒã¿ã§ããã | |
| 2.2 åºæ¬çãªä»®å® | |
| æ¬èšŒæã§ã¯ã以äžã®æšæºçãªä»®å®ãèšããã | |
| L-smoothæ§: æå€±é¢æ° f ã¯L-smoothïŒæ»ããïŒã§ããã | |
| f(wâ²)â€f(w)+âf(w)T(wâ²âw)+2Lââ¥wâ²âwâ¥2 | |
| åŸé ã®æçæ§: åŸé âf(w) ã¯æçã§ããã | |
| â¥âf(w)â¥â€Gâw | |
| åæè·é¢: åæç¹ w1â ããæé©è§£ wâ ãŸã§ã®è·é¢ã¯æéã§ããã | |
| D=â¥w1ââwââ¥<â | |
| 2.3 EmoNAVIã®æŽæ°å | |
| EmoNAVIã®æŽæ°åŒã¯ãAdamåã¢ãŒã¡ã³ã¿ã ã«ææ ã¹ã«ã©ãŒã«ããåŠç¿ç調æŽãå ãããã®ã§ããã | |
| wt+1â=wtââη0â(1ââ£Ïtââ£)â vtââ+ϵmtââ | |
| ããã§ãmtâ ã¯1次ã¢ãŒã¡ã³ããvtâ ã¯2次ã¢ãŒã¡ã³ããgtâ=âf(wtâ) ã¯åŸé ã§ããã | |
| mtâ=β1âmtâ1â+(1âβ1â)gtâ | |
| vtâ=β2âvtâ1â+(1âβ2â)gt2â | |
| ææ ã¹ã«ã©ãŒ Ïtâ=tanh(α(EMAshortââEMAlongâ)) | |
| 3. è£å©å®çïŒEmoNAVIã®å®å®æ§ | |
| 3.1 è£é¡1ïŒã¢ãŒã¡ã³ãã®æçæ§ | |
| è£é¡ | |
| Adamåã¢ãŒã¡ã³ãæ§é ã«ãããŠãåŸé ãæçãªãã°ã1次ã¢ãŒã¡ã³ã mtâ ããã³2次ã¢ãŒã¡ã³ã vtâ ã¯ä»¥äžãæºããã | |
| â¥mtââ¥â€G,vtââ€G2 | |
| 蚌æ | |
| åž°çŽæ³ãšäžè§äžçåŒãçšããŠã â¥mtââ¥â€Î²1ââ¥mtâ1ââ¥+(1âβ1â)â¥gtâ⥠ãã â¥mtââ¥â€G ãå°åºãããvtâ ã®æçæ§ãåæ§ã«ç€ºããããâ | |
| è£è¶³ïŒmomentã®å®å®æ§ïŒïŒ | |
| Adamåmomentæ§é ã«ãããŠãm_t ããã³ v_t ã¯ææ°ç§»åå¹³åã§ãããããåŸé ã®å€åãå°ãããšã㯠moment ãå®å®ããæŽæ°æ¹åãæ»ããã«ãªãã | |
| 3.2 è£é¡2ïŒæŽæ°ã¹ãããã®æçæ§ | |
| è£é¡ | |
| EmoNAVIã®æŽæ°ã¹ãããã¯ä»¥äžã®ããã«æçã§ããã | |
| â¥wt+1ââwtââ¥â€Î·0ââ ϵGââ (1ââ£Ïtââ£) | |
| 蚌æ | |
| æŽæ°åã®ãã«ã ãè©äŸ¡ããè£é¡1ã®çµæãçšããããšã§ã蚌æãå®äºããããã®çµæã¯ãææ ã¹ã«ã©ãŒã«ãã£ãп޿°ã¹ããããæå¶ãããããšã瀺åãããâ | |
| 3.3 è£é¡3ïŒææ ã¹ã«ã©ãŒã®æ»ããæ§ãšããŠã³ã | |
| è£é¡ | |
| ã¹ã«ã©ãŒ Ïtâ=tanh(αdtâ) ã¯æ»ãããã€æçã§ããã以äžãæºããã | |
| â£Ïtââ£â€tanh(αGâ£Î³sââγlââ£)<1 | |
| 蚌æ | |
| EMAå·®å dtâ ã®å®çŸ©ãšåŸé ã®æçæ§ããã â£dtââ£ ã¯æéã§ããããšãå°ããããtanh颿°ã®æ§è³ªã«ãããÏtâ ãåžžã«1æªæºã§ããããšã瀺ããããããã«ãããæŽæ°ä¿æ° (1ââ£Ïtââ£) ãæ±ºããŠãŒãã«ãªãããåŠç¿ã忢ããªãããšãä¿èšŒããããâ | |
| 4. åææ§ã®èšŒæ | |
| 4.1 å®ç1ïŒRegret BoundïŒåžé¢æ°ïŒ | |
| å®ç | |
| f ãåžé¢æ°ãã€L-smoothã§ããå ŽåãEmoNAVIã®Regretã¯ä»¥äžã®ããã«äžçãããã | |
| RegretTâ=ât=1Tâf(wtâ)âf(wâ)â€2η0âD2â+2η0âG2âât=1Tâvtââ+ϵ(1ââ£Ïtââ£)2â | |
| 蚌æ | |
| Kingma & Ba (2015) ã«ããAdamã®Regret Boundã®èšŒæãæŽçšããã圌ãã®èšŒæã¯ã以äžã®åºæ¬çãªäžçåŒã«åºã¥ããŠããã | |
| RegretTâ=ât=1Tâf(wtâ)âf(wâ)â€2η1âât=1Tââ¥wt+1ââwââ¥2ââ¥wtââwââ¥2+2ηâât=1Tâvtââ+ϵâ¥âf(wtâ)â¥2â | |
| EmoNAVIã§ã¯ãåŠç¿çã ηtâ=η0â(1ââ£Ïtââ£) ã«åçã«å€åãããããåã¹ãããã®Regreté ã¯åçã«èª¿æŽããããâ¥âf(wtâ)â¥â€G ãçšãããšãäžèšã®åŒã¯ä»¥äžã®ããã«äžçãããã | |
| RegretTââ€2η0â1ââ¥w1ââwââ¥2+2η0ââât=1Tâvtââ+ϵâ¥âf(wtâ)â¥2(1ââ£Ïtââ£)2â | |
| åæè·é¢ D=â¥w1ââwâ⥠ãšåŸé ã®æçæ§ â¥âf(wtâ)â¥â€G ãä»£å ¥ããããšã§ãæçµçãªRegret BoundãåŸãããã | |
| RegretTââ€2η0âD2â+2η0âG2âât=1Tâvtââ+ϵ(1ââ£Ïtââ£)2â | |
| ãã®åŒã¯ãææ ã¹ã«ã©ãŒãå°ããïŒïŒä¿¡é ŒåºŠãé«ãïŒãšãã«Regreté ãå°ãããªããåæãå éããããšã瀺ããŠãããâ | |
| 4.2 å®ç2ïŒéåžé¢æ°ã«å¯ŸããæåŸ å€åæ | |
| å®ç | |
| éåžé¢æ° f ã«å¯ŸããŠãEmoNAVIã¯ä»¥äžã®æåŸ å€åææ§ãæã€ã | |
| T1ât=1âTâE[â¥âf(wtâ)â¥2]â€O(Tâ1â) | |
| è£è¶³ïŒ | |
| EmoNAVIã® v_t 㯠Adam ãšåæ§ã®æ§é ãæã€ããå¿ èŠã«å¿ã㊠AMSGrad ã®ããã«æå€§å€ãä¿æããæ§é ïŒ\hat{v}_t = \max(v_1, ..., v_t)ïŒãå°å ¥ããããšã§ãAMSGradã®èšŒæããã®ãŸãŸé©çšå¯èœã«ãªãã | |
| 蚌æ | |
| Reddi et al. (2018) ã«ããAMSGradã®éåžåææ§èšŒæãæŽçšãããEmoNAVIã®ã¢ãŒã¡ã³ãæ§é ã¯Adamã®åœ¢åŒãæã€ããææ ã¹ã«ã©ãŒã«ããåŠç¿çã®åçæå¶ããæç€ºçãªã¢ãŒã¡ã³ãã®ä¿®æ£ïŒäŸïŒAMSGradïŒãšåæ§ã®å®å®æ§å¹æãããããããã®ãããæåŸ å€ããŒã¹ã§ã®åæãä¿èšŒããããâ | |
| 5. çµè« | |
| æ¬è«æã§æç€ºãããæ°åŠç蚌æã«ãããEmoNAVIã¯ææ ã¹ã«ã©ãŒãšããçŽæçãªæŠå¿µãã以äžã®æ°åŠçæ§è³ªã«ãã£ãŠçè«çã«è£ä»ããããé å¥ãªæé©åã¡ã«ããºã ãžãšæè¯ãããŠããããšãæããã«ãªã£ãã | |
| æŽæ°ã¹ãããã®å®å®æ§: è£é¡2ã«ããããã©ã¡ãŒã¿ã®æŽæ°ã¯åžžã«æçã§ãããçºæ£ã®ãªã¹ã¯ãæå¶ãããã | |
| åææ§ã®ä¿èšŒ: å®ç1ã«ãããRegretã¯ææ ã¹ã«ã©ãŒã«å¿ããŠæå¶ãããç¹ã«ä¿¡é ŒåºŠãé«ãç¶æ³ã§åæãå éããã | |
| éåžé¢æ°ãžã®é©çšæ§: å®ç2ã«ãããæ·±å±€åŠç¿ã§é »åºããéåžé¢æ°ã«å¯ŸããŠããEmoNAVIãæå¹ã§ããããšãä¿èšŒãããã | |
| ãããã®çµæã¯ãEmoNAVIãåãªãå®éšçãªè©Šã¿ã§ã¯ãªããçè«çã«ã匷åºãªåºç€ãæã€æ¬¡äžä»£ã®æé©ååšã§ããããšã瀺ããã®ã§ããã | |
| ä»åŸã®ç ç©¶ã§ã¯ããã®çè«ãããã«æ¡åŒµããå€§èŠæš¡ãªå®çšããŒã¿ã»ããã§ã®æ§èœè©äŸ¡ããç°ãªãã¿ã¹ã¯ãžã®é©çšå¯èœæ§ãæ¢æ±ããŠããã | |
| 6. EmoNAVIã®é²åéçšãšèšèšææ³ | |
| EmoNAVI(第äžäžä»£)ïŒã·ã£ããŠã®å°å ¥ | |
| ææ³: æå€±ã®æ¥æ¿ãªå€åïŒææ ã®ãé«ã¶ããïŒã«å¯Ÿå¿ãããããçŸåšã®ãã©ã¡ãŒã¿ã«éå»ã®ãã·ã£ããŠããæ··åããæŽæ°ãå®å®ããããããã¯ãç¹å®ã®æ¡ä»¶äžã§åäœããæç€ºçãªå®å 𿩿§ã§ããã | |
| ç¹åŸŽ: ã·ã£ããŠãšããå±¥æŽãå¿ èŠãšããå®è£ ã«ç¹å®ã®ããžãã¯ãèŠããŸãã | |
| EmoSens(第äºäžä»£)ïŒ3ä¹å¹³æ¹æ ¹ãã£ã«ã¿ã«ããä»£æ¿ | |
| ææ³: ã·ã£ããŠæ©èœã®ç®çïŒéåºŠãªæŽæ°ã®æå¶ïŒããåã¹ãããã®åŸé ã«äœçšãã3ä¹å¹³æ¹æ ¹ãã£ã«ã¿ã§ä»£æ¿ãåçãªéŸå€ãçšããŠãã€ãºãæå¶ããæŽæ°ãå¶åŸ¡ããŸãã | |
| ç¹åŸŽ: ã·ã£ããŠã®ãããªãã©ã¡ãŒã¿å±¥æŽãäžèŠã«ããããåŸé ã®åèŠçŽ ã«å¯Ÿãã3乿 ¹èšç®ããã¹ã¯åŠçãšãã£ããæ°ããªèšç®ã³ã¹ããçºçããŸãã | |
| EmoNAVI(v3.0)ïŒæéçç©ç®ã«ããæçµçãªéçŽ | |
| ææ³: è§£æãé²ãããšãã·ã£ããŠã3ä¹å¹³æ¹æ ¹ãã£ã«ã¿ãæã€å¹æã¯ãææ ã¹ã«ã©ãŒã«ããåçåŠç¿çå¶åŸ¡ã®ã¿ã§åçŸå¯èœã§ããããšã«ãã¥ããŸãããããã¯ã以äžã®2ã€ã®æ°ã¥ãã«åºã¥ããŸãã | |
| æéçç©ç®: ææ ã¹ã«ã©ãŒã®åºç€ãšãªãEMAïŒææ°ç§»åå¹³åïŒã®å·®åã¯ããã§ã«éå»ã®æå€±å±¥æŽãå å ããŠããŸãããã®å±¥æŽãããã€ãºããã¬ã³ããšãã£ãã髿¬¡ã¢ãŒã¡ã³ããæ å ±ãæé»çã«ä¿æããŠããŸãã | |
| æé»ã®ãã£ã«ã¿ãªã³ã°: ãã®ã¹ã«ã©ãŒã§åŠç¿çãåçã«èª¿æŽãããšãæç€ºçãªãã£ã«ã¿ãªã³ã°åŠçãããã«ãæéè»žã«æ²¿ã£ãŠèªåçã«ãã€ãºãæå¶ãã广ãçãŸããŸãã | |
| ç¹åŸŽ: ã·ã£ããŠããã£ã«ã¿ãäžèŠã«ãªã£ããããã³ãŒããå€§å¹ ã«ç°¡ç¥åããèšç®è² è·ãšVRAMè² è·ãåæžããŸããã | |
| 7. EmoNAVIã®å°ãæ°ããåŠç¿ | |
| èªå·±å®çµã®èªåŸãããªããã£ãã€ã¶ã«ãããéç·åœ¢ã¹ã±ãžã¥ãŒã©ãéåæãçã ã®æ°ããåŠç¿ãããŸããéå§ã§ããŸãã | |
| ããã¯ãã€ããŒãã©ã¡ãŒã¿ã®èª¿æŽãäžèŠãšããã ãã§ãªãããã€ã§ããã©ãã§ããåŠç¿ã®éå§ãšåæ¢ãè¡ãããšãå¯èœã§ãã | |
| 忣åŠç¿çã§ã¯äžå€®å¶åŸ¡ãäžèŠãšããããŒãéã®èª¿æŽçãäžèŠã§ãã䞊åãçŽåãæ··åãèªç±ã«çµã¿åãããŠãã ããã | |
| åäžã®ããŒããŠã§ã¢ã§ç·å¯ã«é£æºããããããã¯ææ©éå»ã®ææ³ã«ãªããŸããç°ãªãããŒãã§ãæè»ã«çµã¿åããå¯èœã§ãã | |
| ç©å±€ã远å ã®åŠç¿ãæãã®ãŸãŸãåãããŒã¿ã»ãããç°ãªãLRã§åæåŠç¿çããã¹ãŠãèªç±ã«é²ããããšãå¯èœã§ãã | |
| åŠç¿ã®é²è¡ã§å åã«èœã¡çãããšããèªå忢åå³ãçºä¿¡å¯èœãšãªãããããããšã«èªå忢ããããšãå¯èœã§ãã | |
| ãã®æ°ããçè«ã¯ãå€§èŠæš¡åŠç¿ããå°èŠæš¡åŠç¿ããæéãã空éããè·é¢ããè¶ ããŠãã"èªåŸ"ãç²åŸããŸãã | |
| (EmoNaviãFactãLynxãClanãZealãNecoãEmoSensãAiryãNecoãçŸåšå ¬éäž) | |
| (ããã©ã«ãèšå®ïŒuse_shadow=Falseãéåžžã¯ã·ã£ããŠäžäœ¿çšïŒå¿ èŠæã«ã·ã£ããŠäœ¿çšå¯) | |
| è¬èŸ | |
| æåã«EmoNAVI以åã®ãããŸããŸãªãªããã€ãã€ã¶ãšãç ç©¶è ãã¡ã«æ·±ãæ·±ãæè¬ããŸãããã®æ ç±ãšç¥èŠã¯ãæ¬èšŒæã®çæ³ãšå®çŸãå¯èœã«ããŸããã | |
| ãã®è«æã¯ãæ¢ã«å ¬éæžã¿ã®EmoNAVIãæ°åŠçã«èª¬æãããã®ã§ãããããã®äœæããEmoNAVI(掟çåãå«ã)ã¯ãAIã®çºå±ã«å¯äžã§ãããšèããŠããŸãããã®è«æãããšã«ãããã«é²åãããªããã£ãã€ã¶ãå ±ã«åµåºããŸãããã | |
| æ¬¡ã®æ°ããæ°ã¥ããã¢ã€ãã¢ãå±ããŠãã ããæªæ¥ã®ç ç©¶è ãã¡ã«æåŸ ãšæè¬ã蟌ããŠãã®è«æãçµãããŸããããããšãããããŸããã | |
| åèæç® | |
| Kingma, D. P., & Ba, J. (2015). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980. | |
| Reddi, S. J., Kale, S., & Kumar, S. (2018). On the convergence of Adam and beyond. arXiv preprint arXiv:1904.09249. | |
| Orabona, F., & Pál, D. (2016). COCOB: training deep networks with a constrained optimizer. arXiv preprint arXiv:1705.07720. | |