| è«æïŒèªåŸçæé©åã¢ã«ãŽãªãºã emoPulse ã«ãããæç³»å SNR æšå®ãš Regret Bound ã®æ¹å | |
| ã æå€±å°åœ¢ã®åçå å¯ã«ããïœ¢ææ é§åååŠç¿çå¶åŸ¡ã®ç¢ºç« ã | |
| èŠæš (Abstract) | |
| ãã£ãŒãã©ãŒãã³ã°ã®æé©åã«ãããŠåŠç¿çã®èª¿æŽãšæ±åæ§èœã®ç¢ºä¿ã¯äžå¿çãªèª²é¡ã§ããã æ¢åææ³ã¯ç²Ÿç·»ãªåŸé æšå®ã«äŸåããæ¥µäœç²ŸåºŠç°å¢äžã§ã®ãã€ãºã«å¯ŸããŠè匱ã§ãã£ãã æ¬çš¿ã§ã¯ãæå€±é¢æ° (Loss) ã®æç³»åçãªå€è§è§£æã䞻軞ã«çœ®ããèªåŸçã¢ã«ãŽãªãºã emoPulse (v3.7) ãææ¡ããã æ¬ææ³ã¯ã3段éã®ææ°ç§»åå¹³å (Multi-EMA) ããæå€±å°åœ¢ã®ïœ¢ããããæããææ ã¹ã«ã©ãŒããã³ä¿¡é ŒåºŠææš (Trust) ãä»ããS/Næ¯ã«åºã¥ãæé©ãªåŠç¿çãèªåŸçã«çæããã | |
| ããã«ãæ¬ç³»ã«å±ãã3çš®ã®ç°ãªãæŽæ°ç¹æ§ãæã€æé©ååš ( Sens / Airy / Cats ) ã®åŠç¿çµæãåæããããšã§ãå±æè§£ãïœ¢ç«æ¹æž¬äœïœ£çã«çµ±åãã人工çã«ãã©ããããããåµåºããææ³ãæç€ºããã ããã«ãããã€ããŒãã©ã¡ãŒã¿ã®èšå®ã«äŸåããªãé å¥ãªåæãå®çŸããèšç®è³æºã®éãããéäžåœã®ç ç©¶ç°å¢ãã倿§ãªæåéºç£ã®ç¶æ¿ãç®æãå€èšèªåŠç¿ã«ãããŠæ°äž»çãªåºç€ãæäŸããã | |
| æåŸã«ã°ãããã³ã°ãžã®èå¯ãšäºæ³ãä»é²ããã | |
| 1. ç·èš | |
| æ¬çš¿ã§ã¯ãæé©ååš EmoSens / EmoAiry / EmoCats (v3.7) ã«ãããçµ±äžçè«ãæç€ºããã æ¬ææ³ã¯ãLosså€ã®ææ°ç§»åå¹³å (EMA) ãå€å±€åããæå€±é¢æ°ã®æç³»åçµ±èšéãã ïœ¢ä¿¡é ŒåºŠïœ£(Trust) ãæœåºããããšã§ãåŠç¿çãèªåŸçã«çæãã emoPulse æ©æ§ãæ žãšããã ããã¯æ°åŠçã«ã¯ãD-adaptation çè«ãšæç³»åä¿¡å·åŠç (SNRæšå®) ã®é«åºŠãªèåã§ããããã€ããŒãã©ã¡ãŒã¿ã®èšå®ã«äŸåããªãé å¥ãªåæãå®çŸããã | |
| æ¬ç ç©¶ã®åºçºç¹ã¯ãæ¢åã®é©å¿çåŸé ææ³ãæã€ïœ¢ç²Ÿç·»ãªåŸé æšå®ãžã®é床ãªäŸåã«å¯Ÿããåèã«ããã æ¥µäœç²ŸåºŠã»è¶ éåå (1-bit/2-bitç) ç°å¢ã«ãããŠãåŸé (Gradient) ã¯æ¥µããŠé«ããã€ãºãå«ã¿ãä¿¡é Œæ§ãèããäœäžããã äžæ¹ã§ãæå€±å€ (Loss) ã¯ãéååã®åœ±é¿äžã«ãã£ãŠãäŸç¶ãšããŠã¢ãã«ã®ïœ¢æ£è§£ãšã®è·é¢ïœ£ãç€ºãæ£ç¢ºãªã¹ã«ã©ãŒå€ãšããŠæ©èœãç¶ããã | |
| æ¬ææ³ã¯ãåŸé (Gradient) ãæ¹åã®åèå€ (æå¿) ã«çããåŠç¿ã®äž»å°æš©ãæ£ç¢ºãªèŠ³æž¬å€ã§ãã Loss ã®å€è§çè§£æã«å§ããã ãã®ã¢ãããŒãã«ããã髿¬¡ã¢ãŒã¡ã³ãèšç®ã®ã¹ã«ã©ãŒå¶åŸ¡ãžã®çœ®æãããã³ç¬Šå·åæŽæ°ã«ããäœç²ŸåºŠã»éååç°å¢ãžã®æé©åãéæããã æå€§ã®ç¹åŸŽã¯ãç°ãªãç¹æ§ãæã€è€æ°ã® emoç³»æé©ååšã«ããå±æè§£ãïœ¢ç«æ¹æž¬äœïœ£ãšããŠçµ±åããããšã§ãåŸæ¥ã¯é·æéã®å埩åŠç¿ãå¿ èŠãšãããã©ããããããžã®å°éããçæéã®åŠç¿ãšåæã«ãã£ãŠä»£æ¿å¯èœã«ããç¹ã«ããã | |
| ãã®ã¢ãããŒãã«ããã以äžã®3ã€ãå®çŸããïŒ | |
| èšç®å¹çã®åçåäž: 髿¬¡ã¢ãŒã¡ã³ãã®è€éãªèšç®ã Loss ã®æéçç©ç®ã«ããã¹ã«ã©ãŒå¶åŸ¡ã«çœ®æãæéçç©ç®ã«ããè¿äŒŒã§æŒç®è² è·ã軜æžããã | |
| äœç²ŸåºŠïœ¥éååãžã®æé©å: EmoAiry ã«ãããè¡ååè§£ãEmoCats ã«ããã2次ã¢ãŒã¡ã³ãã®å®å šæé€ããšäž¡ææ³ã®ç¬Šå·åã«ãããäœãªãœãŒã¹ç°å¢ã§ã®å€§èŠæš¡åŠç¿ãå¯èœã«ããã | |
| èªåŸçåæ: æå€±å°åœ¢ã® S/N æ¯ãå å¯ããããšã§ãæåã®ã¹ã±ãžã¥ãŒã©ãäžèŠãšãããŠãŒã¶ãŒã®è©Šè¡ã³ã¹ããæå°åããã | |
| â» é«æ¬¡ã¢ãŒã¡ã³ãïŒæé軞ã«ããã髿¬¡çµ±èšé (Time-series Higher-order Statistics) ãžã®éçŽ | |
| ããã¯æ°åŠçã«ã¯ãD-adaptation çè«ãšæç³»åä¿¡å·åŠçã®é«åºŠãªèåã§ãããéäžåœã®ç ç©¶ç°å¢ã倿§ãªæåãéºãããã®ïœ¢æ°äž»çãªAIåŠç¿ïœ£ãå®çŸããåºç€ãšãªãã | |
| 2. çè«çãã¬ãŒã ã¯ãŒã¯ïŒææ 埪ç°ç³» (Emotional Circulation) | |
| æ¬ã·ã¹ãã ã¯ãæå€±é¢æ° L ãåç¹ (Origin) ãšãããã£ãŒãããã¯ã»ã«ãŒãã圢æããã | |
| 2.1 Multi-EMA ã«ãã髿¬¡ã¢ãŒã¡ã³ãã®è¿äŒŒ | |
| 3段éã® EMA (short, medium, long) ã®å·®åãçšããããšã§ãæå€±å°åœ¢ã®ïœ¢æ²çã®å€åãå€åã®äžç¢ºå®æ§ïœ£ãå€åã®å€åãæããã | |
| EMA_t = (1 - α) * EMA_{t-1} + α * L_t | |
| ãã®å·®åããçæãããææ ã¹ã«ã©ãŒ sigma_t ã¯ã髿¬¡ã¢ãŒã¡ã³ã (æªåºŠïœ¥å°åºŠïœ¥å€å) ã®æ å ±ã [â1,1] ã«å§çž®ããéç·åœ¢çµ±èšéã§ããã ãããæé宿°ã®ç°ãªãè€æ°ã® EMA ããéå»ã®èšå€§ãªã¹ãããã履æŽïœ£ãšããŠéå±€çã«èç©ããã ãã®çžå¯Ÿçãªæéé å»¶å·®å (Time-delay Differential) ããšãããšã§ãéçãªå°åœ¢ã®è§£æã§ã¯äžå¯èœãªïœ¢åŠç¿ã®é²è¡ã«äŒŽãå°åœ¢ã®åçãªé«æ¬¡å€åçã芳枬ããŠããã ãããæŽæ°åŒã«ååž°çã«å«ããããšã§ãé·é·æçãªå°åœ¢ã®ïœ¢æ»ãããããã©ã¡ãŒã¿æŽæ°ã«åæ ãããŠããã | |
| â» é«æ¬¡ã¢ãŒã¡ã³ãã®æç³»åç圢æã«é¢ããæ³šæïŒ | |
| æ¬ææ³ã«ããã髿¬¡ã¢ãŒã¡ã³ãè¿äŒŒã¯ãåäžã¹ãããã®åŸé æ å ±ããç®åºããããã®ã§ã¯ãªããæéçç©ç®ã«ãã圢æãããã ããã¯éçãªå°åœ¢ã®æ²çã§ã¯ãªãåŠç¿ã®é²è¡ã«äŒŽãå°åœ¢ã®åçãªå€åçã芳枬ããŠããããšãæå³ããã | |
| â» é«æ¬¡ã¢ãŒã¡ã³ãè¿äŒŒã®éå±€æ§é ïŒ | |
| æ¬ææ³ã¯ãLoss ã®æéçç©ç®ãéããŠãå®å¹çã«ïŒæ¬¡ (æªåºŠ) ãã 7次 (確信床ã®å¢å¹ ) ãŸã§ã®é«æ¬¡ã¢ãŒã¡ã³ããè¿äŒŒçã«èšç®ããŠããã ããã¯éçãªå°åœ¢è§£æã§ã¯ãªããåŠç¿ãšããåçããã»ã¹ã«ããã系ã®ç¢ºä¿¡åºŠïœ£ãç©çéãšããŠæœåºãã詊ã¿ã§ããã | |
| æ¬ææ³ã«ããã Multi-EMA æ§é ã¯ãçµ±èšåŠã«ããã髿¬¡ã¢ãŒã¡ã³ãã®åçãªæéçè¿äŒŒãšããŠæ©èœããã | |
| ïŒæ¬¡ãïŒæ¬¡è¿äŒŒïŒShort / Medium / Long ã®å EMA ã®å·®åã¯ãæå€±ååžã® æªåºŠ(Skewness)ãå°åºŠ(Kurtosis)ãå€å(Fluctuations) ãšãã£ã髿¬¡æ å ±ã®æéçæšç§»ãæœåºããã | |
| ïŒæ¬¡è¿äŒŒïŒããããçµ±åããææ ã¹ã«ã©ãŒ sigma_t ããã³ãä¿¡é ŒåºŠ trust_t ã¯ãåãªãåŸé ã®åæ£ãè¶ ããåŠç¿ãã§ãŒãºã®å®å®æ§ïœ£ã瀺ãïŒæ¬¡çžåœã®ã¡ã¿çµ±èšéãšãªãã | |
| ïŒæ¬¡è¿äŒŒ (dNR)ïŒdNR ã®å°åºã«ãããŠããããïŒæ¬¡æ å ±ã®æ¯çã2ä¹ (d_base/noise_base)^2 ããããšã§ã埮现ãªç¢ºä¿¡åºŠã®å·®ãææ°é¢æ°çã«å¢å¹ ãã7次ã¢ãŒã¡ã³ãã«çžåœããæ¥µããŠéæãªå¶åŸ¡ä¿¡å·ãšãªãã | |
| 2.2 ä¿¡é ŒåºŠææš trust_t ã®å®çŸ© | |
| æŽæ°ã®ïœ¢è³ªïœ£ã決å®ããã³ã¢ææš trust_t ã以äžã®ããã«å®çŸ©ããã | |
| trust_t = sgn(sigma_t) * (1.0 - abs(sigma_t)) | |
| ãã® trust ã¯ã±1.0 (å®å šãªç¢ºä¿¡) ã«ã 0 (å®å šãªçµ¶æ) ã«ãå°éããªãæçæ§ãæã¡ãã·ã¹ãã ã«åžžã«é©åºŠãªïœ¢æ¢çŽ¢ã®äœå°ïœ£ãšïœ¢æ éããç¶æãããã | |
| ããã«ãã æå€±é¢æ° L ãåç¹ ãšãã以äžã® ãã£ãŒãããã¯ã»ã«ãŒã(ææ åŸªç°ç³») ã圢æãã | |
| Loss â Multi-EMA â Scalar/Trust â emoPulse â Loss | |
| 3. emoPulseïŒèªåŸçæåã«ããåŠç¿ççæ | |
| v3.7 ã«ãããŠãåŸæ¥ã® emoDrive (å éæ©æ§) 㯠emoPulse ãžãšçµ±åãããã ããã¯æç³»åã® S/N æ¯ (Signal-to-Noise Ratio) ã«åºã¥ãåçè·é¢æšå® (D-adaptation) ã®è¿äŒŒã«ããé²å圢ã§ããã | |
| 3.1 Noise ããã³ Distance ã®åçæšå® | |
| ã·ã¹ãã ã®ïœ¢è¿·ããšïœ¢é²æïœ£ã以äžã® 2ã€ã®å éšå€æ° N_t, d_t, ã§è¿œè·¡ããã ããã§ N_t ã¯ïœ¢æºã(äžå®å®æ§)ãd_t ã¯ïœ¢é²æïœ£(è·é¢) ã衚ãã | |
| Noise_est (N_t) N_t = (1 - α) * N_{t-1} + α * abs(sigma_t) | |
| Distance Estimate (d_t) d_t = (1 - α) * d_{t-1} + α * abs(trust_t) | |
| 3.2 emoPulse ã®å®çŸ©ãšèªåŸå¶åŸ¡ / ç¬éç SNR ãšå±¥æŽç®¡ç (dNR_hist) | |
| emoPulse ã®çæã¯ãç¬éç㪠SNR ãšæéç㪠SNR ã®ïœ¢ç¶±åŒãã«ãã£ãŠæ±ºå®ãããã ãŸããç¬éçã»æéçããããã®åºç€ãç®åºããã | |
| noise_base = abs(sigma_t - trust_t) + ε_s | |
| d_base = abs(N_t - d_t) + ε_t | |
| ããããçšããçŸåšã® SNR 匷床ã以äžã®ããã«å®çŸ©ããã | |
| dNR_now_val = ( d_base / noise_base )^2 | |
| dNR_hist ã®æŽæ°èŠåïŒ | |
| å 鿡件: | |
| if dNR_now_val >= dNR_hist and trust_t >= threshold_high: | |
| dNR_hist = min( dNR_now_val, dNR_hist * factor_grow ) | |
| æžéæ¡ä»¶: | |
| if threshold_low <= trust_t <= threshold_high: | |
| dNR_hist = dNR_now_val * factor_decay | |
| æçµçãªåŠç¿ç emoPulse ã¯ä»¥äžã§æ±ºå®ãããã | |
| emoPulse_t = clamp( dNR_hist * (emoScope * η_base), η_min, η_max ) | |
| ãã®èšèšã«ããã以äžã®èªåŸçæåãä¿èšŒãããïŒ | |
| 確信é å (â£trustâ£>0.5)ïŒSNR ãåäžããåŠç¿çãæå€§å éã ãã©ããããããé«éã«ç®æãã | |
| é¡å·¡é å (â£trustâ£<0.5)ïŒäžç¢ºå®æ§ãå¢å€§ããåŠç¿çãæå¶ããããšã§éãè°·ã§ã®çºæ£ãé²ãã | |
| â» emoPulse ã¯ããŠãŒã¶ãŒå®çŸ©ã®åæåŠç¿ç(emoScope)ãšã·ã¹ãã ã®ããã©ã«ãæåºŠ(η_base)ã«ãã£ãŠæ±ºå®ãããã¹ã±ãŒãªã³ã°ä¿æ°ã§ããã | |
| 4. emoPulseïŒRegret Bound ãšæçæ§ã®è§£æ | |
| 4.1 åææ§ãš Regret è§£æ | |
| emoPulse äžã«ãããçŽ¯ç© Regret R(T) ã¯ãåçã«å€åããåŠç¿ç η_t ãå«ãã åœ¢ã§æ¬¡ã®ããã«äžçãäžããããã | |
| R(T) <= O( Σ_{t=1}^T [ η_t * ||g_t||^2 * (1 - |Ï_t|)^2 ] ) | |
| ããã§ãä¿æ° (1 - |Ï_t|) ã¯ãæå€±é¢æ°ã®çæã»äžæã»é·æ EMA ã®æŽåæ§ããå°åºãããæŽæ°ã®ïœ¢ä¿¡é ŒåºŠ (Trust)ãå®éåãããã®ã§ããã |Ï_t| ã倧ããç¶æ ã¯æå€±ãæ¿ããå€åããŠããããšã瀺ããåœè©²ã¹ãããã®åŸé æ å ±ã®ä¿¡é Œæ§ãäœããšå€å®ãããã | |
| å¯Ÿç §çã«ã|Ï_t| ãå°ããç¶æ ã¯æå€±ã®æšç§»ãå¹³æ»ã§ãããæŽæ°æ¹åã®ä¿¡é Œæ§ãé«ãããšãæå³ããã ãããã£ãŠãä¿¡å·åŒ·åºŠãšããŠã® trust_t = 1 - |Ï_t| ã¯ãRegret Bound ã«ãããæå¹ãªæŽæ°éãé©å¿çã«éã¿ä»ãããäžç¢ºå®ãªåŸé ã«ãã Regret ã®çޝç©ãæå¶ãã圹å²ãæããã | |
| æ¬ææ³ã® emoPulse ã¯ãDefazio & Mishchenko (2023) ã«ãã D-adaptation ã®åŠç¿çæ§é ããLoss ã®æç³»åçµ±èšé (d_t, N_t) ã«ãã£ãŠè¿äŒŒããäžè¬åã§ããã | |
| η_t â D^2 / noise | |
| emoPulse ã®å®çŸ© | |
| η_t = ( d_t / (N_t + ε) )^2 * η_base | |
| ããã¯ãD-adaptation ã® è·é¢ / ãã€ãºæ¯ ã«åºã¥ã SNR å¶åŸ¡ããã®ãŸãŸæç³»åçã«åæ§æãããã®ã§ããã | |
| ãã®æ§é ã«ããããã€ãºæå N_t ãå¢å€§ããéã«ã¯åæ¯ãæ¯é çãšãªããåŠç¿ç η_t ã¯å³åº§ã«çž®å°ããã ãã®èªå·±èª¿æŽæ©èœã«ãããæå€±å°åœ¢ãäžå®å®ãªé åã§ã®éå°ãªæŽæ°ãèªåçã«æå¶ãããã ããã¯ãå€éšããã®åŠç¿çã¹ã±ãžã¥ãŒãªã³ã°ãå¿ èŠãšãããšããã¢ã«ãŽãªãºã ãåçãªå®å®æ§ãèªåŸçã«ç²åŸããLearning-rate-freeãªç¹æ§ãçè«çã«æ ä¿ããŠããã | |
| 4.2 æ£å®å€æ§ãšæçæ§ã®èšŒæ | |
| æ¬ã¢ã«ãŽãªãºã ãä»»æã®ã¹ããã t ã«ãããŠãåŠç¿çã®ççºããã³æ¶æ» ãé²ããæçã§ããããšã以äžã«èšŒæããã | |
| 1. 忝 (ç¬éçç念ïŒnoise_base) ã®éãŒãæçæ§ | |
| emoPulse çææã®åæ¯ãšãªã noise_base ã¯ãçŸåšã®ææ ã¹ã«ã©ãŒ sigma_t ãšä¿¡é ŒåºŠ trust_t ã®ä¹é¢ãšããŠä»¥äžã®ããã«å®çŸ©ãããã | |
| noise_base = abs(sigma_t - trust_t) + ε_s | |
| å®è£ ã«ãã㊠|sigma_t| < 1.0 ã〠trust_t ã sigma_t ã«åºã¥ã笊å·ä»é¢æ°ã§ããããšããããã®å·®åã¯æçã§ããã ããã«æ«å°Ÿã®å®å šä¿æ° (+ 0.1) ã«ããã忝ããŒãã«æŒžè¿ããããšã«ããåŠç¿çã®ççº (NaN) ãç©ççã«åé¿ããŠããã | |
| 2. åå (æéç確信ïŒd_base) ã®äžéæçæ§ | |
| emoPulse çææã®ååãšãªã d_base ã¯ãå±¥æŽãšããŠã®ãã€ãºæšå®å€ N_t (noise_est) ãšè·é¢æšå®å€ d_t (d_est) ã®å·®ãšããŠå®çŸ©ãããã | |
| d_base = abs(N_t - d_t) + ε_t | |
| N_t 㯠max(noise_est, 1e-8) ã«ãã£ãŠæ£å®å€æ§ãä¿èšŒãããŠããããŸã d_t ã¯æ¹åã»æªåãåãã abs(trust_t) ã®ç©ç®ã§æŽæ°ãããã ãããæéçãªçµ±èšéã®å·®ã«å®å šä¿æ° (+ 0.1) ãå ããããšã§ã**極äœç²ŸåºŠç°å¢ã«ãããŠå±¥æŽãäžå®å®ãªå Žåã§ããåžžã«æå°éã®æ©å¹ (ååã®äžéå€) ã確ä¿ããã**ããšãæ°åŠçã«æ ä¿ãããã | |
| 3. æçæ§ã®çµè«ãš emoPulse ã®ææ | |
| 以äžã®ïœ¢ç¬éçåºç€ïœ£(忝)ãšïœ¢æéçåºç€ïœ£(åå)ã®æ¯çããçæãããæå¹åŠç¿ç emoPulse_t ã¯ãæçµçã«å®è£ äžã® max(min(..., 3e-3), 1e-6) ãšããå®å šåã®èšå®ã«åºã¥ãã以äžã®ç¯å²ã«å³æ Œã«ææãããã | |
| 0 < η_min <= emoPulse_t <= η_upper_bound | |
| ããã§äžéå€ (η_min) ã¯ãã·ã¹ãã ãæãäžç¢ºå®ãªç¶æ ã«ãããŠãç¶æãããæå°ã®ïœ¢ä»£è¬é(å¿æ) ã§ãããããã«ããåŠç¿åæ¢ (ãããããã¯) ãåé¿ããèªåŸçãªå埩ãåŸ ã€ããšãå¯èœãšãªãã äžæ¹ãäžéå€ (η_upper_bound) ã¯ãdNR ä¿æ°ã®æ¥æ¿ãªå¢å€§ãçºçããå Žåã§ãã¢ãã«ã®çºæ£ãé²ããªããã¿ãŒãšããŠæ©èœããã | |
| å®è£ äžã®çæç¹ïŒ | |
| åæå€èšå®ã«ããå®å®åïŒ | |
| â» ããŒã¿ã»ãããéåžžã«å°ããç°å¢ãåæãã€ãºã倧ããç°å¢ã§ã¯ããã«ã EMA ã履æŽïœ£ãå®å®ããããŸã§ã®éãd_t ãš N_t ã®åæå€ãåèšå®ããããšãæšå¥šãã (äŸïŒd-estïŒ0.2, Noise-estïŒ0.2) ããã«ãããåæã®ç¢ºççãã€ãºã«ããçºæ£ãæå¶ã§ããã ç¹ã«ãN_0 ã d_0 ãšåçã«åæåããããšã§ãã·ã¹ãã ã¯æ¬è³ªçã«ïœ¢æ éã¢ãŒãããéå§ãããã ããã¯ãåæã®éèŠãªã¹ãããã«ãããŠãéåºŠã«æ»æçãªæŽæ°ãé¿ããå°åœ¢ã®èгå¯ãåªå ããææ©çãªãŠã©ãŒã ã¢ããã»ãã§ãŒãºãšããŠæ©èœããã | |
| åæå€èšå®ã«ããïœ¢æŽæ°å§åã®ç¶æãšå®å šæ§ã®äž¡ç«ïŒ | |
| â» æ¬ææ³ã«ãã㊠emoPulse ã®ååã圢æãã d_base ã¯ãã·ã¹ãã ã®ãæœåšçãªæŽæ°åããæ±ºå®ãããããã§åæå€ã N0 = 1.0, d0 = 0.02 ãšèšå®ããããšã¯ãåŠç¿åæããé«ãå éããã³ã·ã£ã«ãæå³çã«ç¢ºä¿ããŠããããšãæå³ããã ãã®åæå€ã®åœ±é¿ã¯ãææ°ç§»åå¹³åã®ç¹æ§äžãçŽ100ã¹ãããã«ããã£ãŠïœ¢å±¥æŽïœ£ãšããŠæ®çããã ãã®æéã·ã¹ãã ã¯é«ãå éå§åãèæ¯ã«æã¡ã€ã€ããææ æ©æ§ã«ãã峿 Œãªéžå¥ãã¯ãªã¢ããçã«ä¿¡é Œã§ããä¿¡å·ïœ£ã«å¯ŸããŠã®ã¿åæåãæäŸããã | |
| 5. 笊å·åæ£èŠåïŒäœç²ŸåºŠç°å¢ãžã®é©å¿ | |
| æ¬ç« ã§ã¯ãemoPulse ã®çè«çæ çµã¿ãäœç²ŸåºŠç°å¢ã«é©çšããããã®ç¬Šå·åæ£èŠå (sign-based normalization) ã«ã€ããŠè¿°ã¹ãã | |
| ç²Ÿç·»ãªæµ®åå°æ°ç¹èšç®ãžã®äŸåãæããæ¥µäœç²ŸåºŠç°å¢ (è¶ éåå) ã«å¯Ÿå¿ããããã以äžã®æŽæ°åãæ¡çšãã (EmoAiry, EmoCats, ç) | |
| delta_w_t = -emoPulse_t * sign( m_t / ( sqrt(v_t) + ε ) ) | |
| ããã«ããã EmoAiry ã§ã¯ã1次å ãã¯ãã«ãš2次å ã¢ãŒã¡ã³ãã®ç²ŸåºŠã®ã¢ã³ãã©ã³ã¹ãè§£æ¶ããæ¹åæ§ã®åæã®ã¿ãæœåºããæå¿ã®çµ±äžïœ£ãå®çŸããŠããã | |
| â» EmoCats 㯠LionããŒã¹ã« WDåé¢ããã笊å·åã§å¯Ÿå¿ããŠãã | |
| 6. çµè« | |
| EmoSens v3.7 ã¯ãæå€±é¢æ°ã®èгå¯ããå§ãŸãïœ¢ææ ã®åŸªç°ïœ£ãå®çµãããã | |
| 芳枬 (Multi-EMA)ïŒå°åœ¢ã®ããããæããã | |
| 倿 (Trust)ïŒç¢ºä¿¡ãšé¡å·¡ã ±0.5 ã®å¢çã§åãæ¿ããã | |
| è¡å (emoPulse)ïŒèªåŸçãªæåã«ãã£ãŠæé©ãªæ©å¹ ãæ±ºå®ããã | |
| æ¬ææ³ã¯ãéäžåœã®ãªãµãŒãç°å¢ãäœãªãœãŒã¹ãªèšç®è³æºã«ãããŠãã倿§ãªæåãèšèªãAIãèªåŸçã«åŠç¿ããããšãå¯èœã«ããæ°äž»çãªæé©åãã¬ãŒã ã¯ãŒã¯ã§ããã | |
| è¬èŸ | |
| æåã« EmoNaviãEmoSensã以åã®ãããŸããŸãªãªããã£ãã€ã¶ãšãç ç©¶è ãã¡ã«æ·±ãæ·±ãæè¬ããŸãã ãã®æ ç±ãšç¥èŠã¯ãæ¬èšŒæã®çæ³ãšå®çŸãå¯èœã«ããŸããã | |
| ãã®è«æã¯ãæ¢ã«å ¬éæžã¿ã® EmoSens(v3.7) ãšãã®ããªãšãŒã·ã§ã³ã«ã€ããŠæ°åŠçã«èª¬æãããã®ã§ãã ãããã®äœæãã EmoSens (掟çåãå«ã) ã¯ãAIã®çºå±ã«å¯äžã§ãããšèããŠããŸãã ãã®è«æãããšã«ãããã«é²åãããªããã£ãã€ã¶ãå ±ã«åµåºããŸãããã | |
| æ¬¡ã®æ°ããæ°ã¥ããã¢ã€ãã¢ãå±ããŠãã ããæªæ¥ã®ç ç©¶è ãã¡ã«æåŸ ãšæè¬ã蟌ããŠãã®è«æãçµãããŸããããããšãããããŸããã | |
| çµèª | |
| æ¬ã¢ã«ãŽãªãºã ã¯ãæ°ããåªããæé©åææ³ã®ä»£æ¿ãç®æããã®ã§ã¯ãªããåŠç¿ããã»ã¹ã«ãããã¢ãã«ãšã®å¯Ÿè©±ïœ£ãæ·±ããããã®ãããäžã€ã®æ°ããéžæè¢ãšããŠææ¡ããã ãŠãŒã¶ãŒãèªãã®ç®çãææ§ã«é©ã£ãããŒãããŒãéžæããå ±ã«ç¥ãè²ãããã»ã¹ã®äžå©ãšãªãã°å¹žãã§ã | |
| è£è¶³è³æ(1)ïŒv3.7 ã«ããã emoPulse ã®ãã€ããã¯ã¹ã®è§£æ | |
| 1. ç®ç | |
| v3.7 ã«ãããŠãå°å ¥ãããç¬éç D / N æšå®ïœ£ãšïœ¢æéç D / N æšå®ïœ£ã®çžäºäœçš (ç¶±åŒã) ããåŠç¿çã®åçå¶åŸ¡ã«ã©ã®ãããªç©ççæå³ãããããããè§£æããã | |
| 2. æ§è³ªïŒç¬éççå¿µãšæéçä¿¡é Œã®åçãã©ã³ã¹ | |
| ç¬éçåºç€ (noise_base): noise_base = abs( scalar_t - trust_t ) + ε_s çŸåšã®ææ ã¹ã«ã©ãŒïœ£(æ³¢)ãšïœ¢çŸåšã®ä¿¡é ŒåºŠïœ£ã®ä¹é¢ã枬å®ããã ããããäžèŽããªã (ä¹é¢ã倧ãã) å Žåãã·ã¹ãã ã¯çŸç¶ã«å¯ŸããŠïœ¢åŒ·ãç念(ç¬éçãã€ãº)ãæ±ãã忝ãå¢å€§ãããã | |
| æéçåºç€ (d_base): d_base = abs( noise_est_t - d_est_t ) + ε_d 履æŽãšããŠã®ãã€ãºïœ£(æ³¢ã®å¹³å)ãšïœ¢å±¥æŽãšããŠã®ä¿¡é ŒåºŠïœ£ã®å·®ã枬å®ããã ããã¯ãéå»ã®ã³ã³ããã¹ãããå°ãåºãããïœ¢æŽæ°ãžã®ç¢ºä¿¡åºŠïœ£(æéçè·é¢)ã衚ãã | |
| 3. 广ïŒãã€ãããã¯ã»ãªãºã ã®åµåº | |
| 广AïŒæ¥å€æã®å³æå¶å çªçºçãªæå€±å€åã«ãã scalar ãš trust ãä¹é¢ãããšãnoise_base (忝) ãæ¯é çãšãªãã ããã«ãããæéçãªå±¥æŽããŸã å®å®ããŠããŠããç¬éçãªå€æãšããŠåŠç¿çãå³åº§ã«çµã蟌ã¿ãçºæ£ãæªç¶ã«é²ãã | |
| 广BïŒå®å®æã®èªå·±å é åŠç¿ãé 調 (scalar ãš trust ãå®å®) ãããã€å±¥æŽãšããŠã®ç¢ºä¿¡åºŠ (d_base) ãç©ã¿äžãããšãdNR ä¿æ°ã¯ïœ¢2ä¹ïœ£ã®é ã䌎ã£ãŠåºåãæå€§åãããã dNR_now_val = ( d_base / noise_base )^2 ããã«ãããå®å®åã§ã¯ïœ¢æ©å¹ ãèªç¶ã«åºããåæãå éãããã | |
| 广CïŒå±¥æŽã«ããå®å®ç¶æ (dNR_hist) ç¬éç㪠dNR_now_val ãé«ããŠããdNR_hist * 1.05 ãšããæé·å¶éãèšããããšã§ãé床ãªå éãæå¶ããã äžæ¹ã§ãä¿¡é Œã§ããªãé åã§ã¯ dNR_hist * 0.98 ã®æžéå§åãæºããããšã§ãæ éãªæ¢çŽ¢ãç¶ç¶ããã | |
| â» å¹æCã®é察称æ§ã¯ã d_base <= dNR_hist ã〠trust >= 0.5 ãã®éžå¥ã«ããæ©èœããã æããããã³ïœ£ãšèŠæãžã®ïœ¢ããã³ïœ£ãæ°åŠçã«æš¡ãããã®ã§ scalarå€ ã§ãããšããã® 0ïœÂ±0.5 ã§LRãå éããã€ã€ãè² ã®æ¹åã§ã®LRå éã®å Žåã¯LRå±¥æŽã®æé·ã«å«ããªãããã«ããŠããã (±0.5以äžã¯åçç¡çšã§èп以äžã®å±æ©ãšããŠLRãæžéããŠãã) scalarå€ ã®è² ã®æ¹åã§ã®LRå éã¯"ä¿®æ£ãããæŽæ°æ¹å"ãä¿¡é Œããå éã§ããããã㯠ema ãš loss ã®æéå·®(emaã®é å»¶)ãæŽ»çšãã EmoNaviäžä»£(emoç³» 第ïŒäžä»£)ã® emoDrive ãåŒãç¶ãã§ãã(æ¬ç 究㯠EmoSensäžä»£(emoç³» 第ïŒäžä»£)ã§ãã) | |
| |--Danger--|---Wary---|---Fine---|--Danger--| Emotion | |
| Sigma_t [Minus] |---(-)---0.5---(+)---0---(+)---0.5---(-)---| [Plus] | |
| |--Hist(-)-|-Hist(Non)|--Hist(+)-|--Hist(-)-| Reglet | |
| [Acceleration:LR Growth Max 1.05x] / [Deceleration:LR Decay 0.98x] | |
| 4. æ°å€çå®å®æ§ã®çµè« | |
| ãã®ïœ¢æé軞(å±¥æŽ)ãšïœ¢ç¬é軞(çŸåš)ã®å·®åãæŠãããèšèšã¯åãªãæžè¡°ã§ã¯ãªãã ã·ã¹ãã ãèªåŸçã« "ç念(Noise)ãšïœ¢ç¢ºä¿¡ïœ£(Distance)ã®æ¯çãåžžã«åèšç®ãç¶ãã" ããšã§ãæåã®ã¹ã±ãžã¥ãŒã©ã§ã¯äžå¯èœãªïœ¢å°åœ¢ã®è€éãã«å¿ããå¿æã®éŒåã®ãããªåçå¶åŸ¡ãå®çŸããŠããã | |
| 以äžã§ç€ºãïœ¢ç«æ¹æž¬äœã«ãããã©ãããããã®åæïœ£ã¯ãçŽæãšå®éšããå°ãåºãã仮説ã§ããã | |
| ãã®çŽæãæ¬¡äžä»£ã®ç ç©¶è ãã¡ã«ããå³å¯ãªæ°åŠç蚌æãžãšæè¯ãããããšãæåŸ ããã | |
| å€è§çãªå±æè§£åæã«ãããèªåŸçãã©ãããããåµåºã¢ãã«ïŒEmo-Cubic çµ±åææ³ã®ææ¡ | |
| (Autonomous Flat-Minima Generation via Cubic Positioning of Heterogeneous Optimizers) | |
| ïŒæ°ããåŠç¿ææ³ã®ææ¡ïŒemoç³»ïŒçš®ã«ãã屿åæã«ãã"é²åçãã©ããããã圢æ"ã®äºæ³ïŒ | |
| 1. ç®çïŒãã©ãããããå°éã®é«ã³ã¹ãåé¡ã解決ãã | |
| æ¢åã®åŠç¿ææ³ã§ã¯ã | |
| ã»ïŒã€ã®ãªããã£ãã€ã¶ | |
| ã»é·æéã®å埩åŠç¿ | |
| ã§ã®æ±åæ§åäžãé²è¡ã ãã©ããããã ãžå°éãããããšãå®çããŠããã | |
| ããã¯èšç®è³æºçãå«ãããŸããŸãªãªãœãŒã¹ãå¿ èŠãšã誰ãã宿œã§ããç°å¢ã«ã¯ãªãã | |
| æ¬ææ¡ã§ã¯ emoç³» ãªããã£ãã€ã¶ãçšããããšã§ããã®é«ã³ã¹ãæ§é ãã®ãã®ãå€ããããšãç®çãšããã | |
| 2. ææ¡ïŒãã©ãããããã"æ¢çŽ¢"ãããèªã"åµåº"ãã | |
| emoç³»ïŒçš®(EmoSens, EmoAiry, EmoCats)ã¯æŽæ°åŒã¯ç°ãªãããåŠç¿ã®æ§é ã¯å ±éããŠãããããåäžæ¡ä»¶ã®åŠç¿ãããš"ç°ãªãæ¹åããã®å±æè§£"å·®ç°ã®ããåŠç¿çµæãåŸãããã | |
| ãã®å·®ç°ã®ããåŠç¿çµæãçµ±åããããšã¯å±æè§£ã®åæãšãªãããã®åæã«ããå±æè§£ãåºãå¹³åŠã«ããå¯èœæ§ããããšäºæ³ããŠããã ã€ãŸãå±æè§£ããã©ãããããã«è¿ã¥ããããã®ãã®ãžå€ããå¯èœæ§ãããã | |
| ãããã®å±æè§£ã å šå±€LoRA ãšããŠååŸã TALL-Mask-Merge ãªã©ã®åæææ³ã§çµ±åãããšã | |
| âšâšâš â \___/ å±æè§£ã®åæã€ã¡ãŒãž | |
| (3æ¹åã®å±æè§£) (åæåŸã®å¹³åŠå) | |
| ã»3æ¹åã®å±æè§£ã®"å ±éããŠäœãéšå"ã匷調ããã | |
| ã»3æ¹åã§å°ã£ãéšå(ã·ã£ãŒãããã)ãçžæ®ºããã | |
| ã»çµæãšã㊠平åŠãªè°·åº(ãã©ããããã)ã«è¿ã圢ç¶ãåæ§æããã | |
| ããã¯ãå±æè§£ã ç«æ¹æž¬äœ(3軞枬äœ) ãšããŠæ±ãã | |
| "ãã©ããããããæ¢çŽ¢ãã"ã®ã§ã¯ãªã | |
| "ãã©ããããããåæã«ãã£ãŠåµåºãã" ãšããæ°ããåŠç¿ææ³ã§ããã | |
| 3. æŽçïŒãã®çµ±åã¯åŠç¿çæåã«ã€ãªãã | |
| ææ¡ã®å ·äœåïŒå šå±€LoRAãFFT(ãã«ãã¡ã€ã³ãã¥ãŒãã³ã°)ããªã©ãé·æã§è¡ãã®ã§ã¯ãªããå°ãæµ ãçšåºŠã®åŠç¿ãïŒçš®ã§è¡ã TALL-Mask-Merge ãªã©ã®åæææ³ãçšããããšã§å®çŸããã ããã«ãããªãœãŒã¹ã«éãã®ããã±ãŒã¹ã§ãé«ç²ŸåºŠã®åŠç¿çµæãåŸããããããªãå¯èœæ§ãæã€ãšäºæ³ããã | |
| æ¬ææ¡ã®å ·äœçãªå®æœæ¹æ³ã¯ä»¥äžã®éã | |
| ã»å šå±€LoRA ãŸã㯠FFT ãé·æã§ïŒçš®é¡ã®ãªããã£ãã€ã¶ã§è¡ãã®ã§ã¯ãªã | |
| ã»emoç³»ïŒçš®ã§æµ ãåŠç¿ãããããè¡ã | |
| ã»ãã®çµæã TALL-Mask-Merge ã§çµ±åãã | |
| ããã«ããã | |
| ã»é·æéåŠç¿ã«äŸåãã | |
| ã»ãªãœãŒã¹ãéãããç°å¢ã§ã | |
| ã»ãã©ãããããã«è¿ãé«ç²ŸåºŠã¢ãã«ãåŸããã å¯èœæ§ãããã | |
| ã€ãŸãããã©ããããããâç®æãâã®ã§ã¯ãªããâåµãåºãâããšã§åŠç¿ãçæåãããšããçºæ³ã§ããã | |
| 4. çµè«ïŒç°ç𮿿 é§ååã¢ãã«ã®çµ±å(Emotional Ensemble) | |
| æ¬ç ç©¶ã§ææ¡ããïŒçš®ã®ãªããã£ãã€ã¶(Sens, Airy, Cats)ã¯ããããããç°ãªãæ°åŠçåºåºã«åºã¥ãæå€±å°åœ¢ãå å¯ããã æ¬ç ç©¶ãææ¡ããïœ¢ç«æ¹æž¬äœã«ãããã©ãããããåæïœ£ã¯ãåäžæ¡ä»¶äžã§çæããããããã®åŠç¿çµæããã¹ã¯ããŒãž(TALL-Mask-Mergeç)ã«ããçµ±åããææ³ã¯ãåäžã®æé©åã¢ã«ãŽãªãºã ã§ã¯å°éãåŸãªãæ§é çå®å®æ§ïœ£ãšïœ¢è¡šçŸç粟緻ãã®åæç²åŸãå¯èœã«ããã ããã¯æé©åã«ãããåŠç¿ããã»ã¹ãæé軞ã®è¿œæ±ããã空éçãªå€è§çµ±åãžãšã·ãããããæ°ããæé©åãã©ãã€ã ã«ãªããšäºæ³ããã | |
| 5. è£è¶³ïŒå šå±€LoRAçµ±åã®è©Šè¡æ¹æ³ | |
| ïŒçš®ã®çµ±åã¯å ã¢ãã«ã«ããããã®åŠç¿çµæãçµ±åãããã®æ°ããïŒçš®ã®ã¢ãã«ã TM-merge ã«ãŠå ã¢ãã«ãžçµ±åããã | |
| å ã¢ãã«(org) âª= TMçµ±å âª= ã¢ãã«S(Sens)ãã¢ãã«A(Airy)ãã¢ãã«C(Cats) | |
| LoRAã ãã§çŽæ¥çµ±åããå ã¢ãã«ãžçµ±åãããããïŒã€ã®ã¢ãã«ãå ã¢ãã«ãž TM-merge ã§éå ããã | |
| FFTã§ã¯FFTåŸã®ïŒã¢ãã«ãå ã¢ãã«ãž TM-merge ããã ãã§åçã®å¹æãæã€ãã®ãšäºæž¬ããã | |
| loss飜åããªãåŠç¿é²è¡ã®æ£äœ | |
| ïŒåæ»ã®å°ãªãäžããç¶ããlossãžã®èå¯ïŒ | |
| æ¬ææ³ã«ãããŠãlossãã»ãšãã©åæ»ã飜åãããæŠãäžããç¶ããæåããã芳å¯ãããã ç¹ã«1st-stepã®losså€ã®åå€ããããŸã§äžããç¶ããã®ã¯ããã€åæããã®ãïŒãšããç念ããæ±ãããã ãããåŠç¿çµæã¯éåŠç¿çã®ç Žç¶»ãšã¯ç¡çžã§ãããæ¥µããŠæ£åžžãªæ±åæ§èœãç¶æããŠããã ããã«ã€ããŠçŽæçãªçè§£ããããšïœ¢åŠç¿å ã¢ãã«ã®ä¿®åŸ©ãå·®åãšããŠåŠç¿ããŠãããšããå¯èœæ§ãèŠåºãããšãã§ããã ããã¯ãããŸã§ä»®èª¬ã§ãã£ãŠãå ã® ãã©ãããããã®åµåº ãšåæ§ã§ 次äžä»£ã®ç ç©¶è ãã¡ã«ããå³å¯ãªæ°åŠç蚌æãžãšæè¯ãããããšãæåŸ ããã | |
| ãªã以äžã«ãã "losså€ ã®æ¯å¹ ããéããéŒå(emoPulse)ã¯ããŸãªã(忢ããªã)" ããšãä¿èšŒããã | |
| noise_base = abs(sigma_t - trust_t) + ε_s | |
| d_base = abs(N_t - d_t) + ε_t | |
| ã㮠ε_sã ε_tã ãããåæ»ãæããç¶ç¶çãªå³äžããã®æåãçã¿ããã©ããããããæ¢çŽ¢ããåååãçã¿åºãã ãã㯠losså€ ã®å·®åããªããªãã°åæãããšããããã ãã®èšèšã«ãã simplenet(FashionMNIST) ã«ãããåŠç¿ãã¹ãã«ãã 10000step èšæž¬ã§ lossïŒ0.30 以äžãžå°éããããšãåçŸæ§ã䌎ã確èªã§ããã | |
| SDXLãçšããå®èšŒå®éšã§ã¯ãåäžä»£ EmoNavi ãšãã®ããªãšãŒã·ã§ã³ã§ãå®çŸå¯èœãª e-pred ïŒ ZtSNR ã§ã®åŠç¿ãããã® EmoSens ãšããªãšãŒã·ã§ã³ã§ã宿œã§ããã ãã㯠FM(Flow-Matching) ã«ããããã€ãºãžã®èæ§ãšãsampler 察å¿ã«ã€ããŠã®èª²é¡ã解決ããåæã« e-pred ã®åŒ±ç¹ãšãããè²åçãžã®èª²é¡ã解決ããŠããã æåž«ç»å10æçšã§ã®300epochåŠç¿ãåæ»ãªãå®äºãéåŠç¿åŸåããªãå šå±€LoRAã®äœæã«ãæåããŠããã | |
| äžèšãã¹ããããã«æ¥µç«¯åããç»åïŒæã§ã®300stepã宿œãããšããããåæ»ãªãå®äºãåŠç¿çµæã®ç Žç¶»ããŠããªãããšã確èªããã æ¥µç«¯ãªåŠç¿èšå®ã宿œããŠãç Žç¶»ããªãïŒãã®çç±ã¯ãã€ãºãèç©ããªãæŽæ°ã宿œããŠãããšèããã ãããããã€ãºãšã¯åŸ®å°ããŒã¿ã®éã¿ã¥ãã«èª€ããçããããšã§ãã€ãºåããŠãããšèãããããã®ã§ããã埮å°ããŒã¿ãé©åã«æŽæ°ããããšã§è²Žéãªæ å ±ãä¿è·ãç¶æããããšã§ãã€ãºãçãŸãªãããšãèèŠã§ãããšèããã | |
| ããã« SDXL VAE ã®å šå±€åŠç¿(ãšã³ã³ãŒããšãã³ãŒãã®äž¡é¢) ã宿œããã ãããŸã§ VAE ååŠç¿ã§ã¯ã¢ãã«ãšã®æŽåæ§ãæãªãããŠããŸããçµæçã«çæçµæã®ç Žç¶»ã瀺ãããã«ãªãããæ¬ç ç©¶ã§ææ¡ããŠããæé©ååšã§ã¯ãã®æŽåæ§ãç¶æãæãªããªãããšã確èªããã ãã㯠VAE ã®åå©çšæ§ãåäžããããšãšãã«ãã¢ãã«ã®å©çšå¯èœæéãå»¶é·ããããšã«è²¢ç®ããã ãããšèããã | |
| 極éçãã€ãºã¢ãã«åŠç¿ã®èå¯ãSDXL ããã©ã¢ãã«åæå(ã©ã³ãã å€ã«ããéã¿åæå)ã宿œãããããåŠç¿å ã¢ãã«ãšããå šå±€LoRAåŠç¿ã宿œããã éåžžã§ããã°æ°stepã§çºæ£ããŸãã¯NaNãšãªãåŠç¿ã¯ç Žç¶»ããããEmoSensäžä»£ã¯ããããåŠç¿ãé²è¡ãã1500stepãå®äºããã ãã®LoRAã¯ç Žç¶»ããã¯ãã§ãããããã®äºæ³ãè£åãç Žç¶»ãªãåæååã®SDXLããã©ã¢ãã«ãžæ£åžžé©çšå¯èœã§ãã£ãã é©ãããšã«ããã®LoRAã¯ããã©ã¢ãã«ä»¥åã®ç¶æ ãšããŠåŠç¿ããŠãããããããã©ã¢ãã«ã®èŠæãšããæ°Žå¹³ç·ãå°å¹³ç·ã®é£ç¶æ§ãåäžãããäž»é¡ãè·šãã éã®äœçœ®ããçãè£æ£ãããã®ãšãªã£ã(掟çSDXLã¢ãã«ã«ãé©çšå¯èœã§åæ§ã®å¹æãæããŠãã) ãã®ãã¹ããã EmoSensäžä»£ã®å®å®æ§ãšå®å šæ§ã¯åªããé 奿§ãåããŠãããšç¢ºèªã§ããã | |
| â» æ¬LoRAã¯è€æ°ã® seed ã«ãããŠåæ§ã®å¹æã芳枬ãããŠãããçµæãšã㊠SDXL ã®ç¹å®ã®ã¢ãŒãã£ãã¡ã¯ãã軜æžãã"æ£ååçæå"ã瀺ããå¯èœæ§ãããã ãã ãããã®å¹æãæå³çãªåŠç¿ã«ããåŠãã ãã®ã«ããã®ããå¶ç¶çæŽåã«ãããã®ãã¯çŸæç¹ã§ã¯æå®ã§ããªãã æ¥µéäžã®åŠç¿é²è¡ãå®å®çã§ããããšããããšã®ç¢ºèªãšããŠã®ã¿ãçè§£é ãããã | |
| ã°ãããã³ã°ã«ã€ããŠã®äºæ³ | |
| æ¬ç ç©¶ã§ã¯ãåæ»ã®å°ãªãé£ç¶ç㪠losså€ äœäžãšããæåã«çç®ãããã®èŠå ãæ€èšŒããããã«åçš®ãã¹ãã宿œããã ç¹ã«ã極端ãªåŠç¿æ¡ä»¶ãšããŠïœ¢ç»å1æã®ã¿ã§ã©ããŸã§å®å šãã€å®å®ããåŠç¿é²è¡ãå¯èœããè©äŸ¡ããã ãã®çµæãéåŠç¿ã®çºçãã³ããŒç¶æ ãžã®åŽ©å£ãç¡é¢ä¿ããã³ãããžã®å¹²æžãšãã£ãå žåçãªç Žç¶»ããããã芳枬ããããæ¥µããŠå®å®ããåŠç¿çµæã確èªããã | |
| ãããã®çµæãããã°ãããã³ã°ãšã¯ä»¥äžã®2èŠå ãè€åããŠçããâåæ»çŸè±¡âã§ãããšäºæ³ããã | |
| - åŠç¿éçšã§èç©ããããã€ãºåŠç¿ã®ç©ç®ã«ãããåŠç¿åŸåã§ä¿®æ£ãã¹ãäžæ£ç¢ºããå¢å€§ããã¢ãã«ã®èŠçãæ¥æ¿ã«æªåããããš(ãã¯ã€ãã¢ãŠãïŒãã©ãã¯ã¢ãŠãçŸè±¡) | |
| - åŠç¿åŸåãšããæãä¿®æ£ãå¿ èŠãªå±é¢ã«ãããŠãã¹ã±ãžã¥ãŒã©ãåŸé çµ±èšã LR ãæå¶ããLR ãæ¥µç«¯ã«äœäžããŠããŸãããš | |
| ãã®2ç¹ãåæã«çºçããããšã§ãã¢ãã«ã¯æ¬è³ªçãªæ¹åæ§ãèŠå€±ããé·æã®åæ»æã«é¥ããšèããããã ã€ãŸãã°ãããã³ã°ã¯åé¿å¯èœãªçŸè±¡ã§ãããšèããã | |
| emoç³»(EmoSensäžä»£) ã°ãããã³ã°ãåé¿ã§ããçç±ã¯æç¢ºã§ããã | |
| æ¬ææ³ã¯ã以äžã®æŽæ°ãå¯èœãšããŠãããããèŠçãåžžã«ã¯ãªã¢ã«ä¿ã¡ãåŠç¿ãç¶ç¶ããããã®é§ååã倱ããªãã | |
| - æŽæ°ã®æ£ç¢ºæ§ãç¶æããã€ãºãèç©ããªãããš | |
| - åŠç¿åŸåã§ãå¿ èŠãª LR ãèªåŸçã«ç¢ºä¿ã§ããããš | |
| ããä»®ã«èŠçäžè¯ã«é¥ã£ãå Žåããææ æ©æ§å šäœãé«ç²ŸåºŠGPSã®ãããªå¹æãçºæ®ããemoPulseã®æ£ç¢ºãªå¿æãæ©ã¿ãæ¢ããªããããã°ãããã³ã°ãçµãã« ãã©ãããããã倧åçæé©è§£ãžèªç¶ã«è¿ã¥ãããšãå¯èœãšãªãã | |
| ã°ãããã³ã°ã«ã€ããŠïœ¢äžå¯è§£ãªé å»¶äžè¬åãšããŠèå¯ãããŠããããå è¿°ãã SDXL ã§ã®åŠç¿çµæãããããããšãããã°ãããã³ã°çŸè±¡ã®æ¬è³ªã¯ãã¢ã«ãŽãªãºã åŽã®æ§é çæ¬ é¥ã«ããåæ»ãšèŠåãããšèããã dNR ã¯èª€ã£ãéã¿ã¥ãã®å åãšæªæŽçã®åŸ®å°ããŒã¿ãæ€ç¥ããæœè±¡æ§é ãšã®ççŸãæãä¿®æ£ããã埮现ããŒã¿ãæ£ããæ±ãã°äžè¬åè§£ã¯æ©ã圢æããããšèããã | |
| ä»åŸã®èª²é¡ïŒïŒæ¬¡ã¢ãŒã¡ã³ãè¿äŒŒã«ããé©å¿çæ£ç¢ºæ§å€å®ã®å°å ¥ | |
| ä»åŸã®å±æãšããŠãdNRã®ïŒä¹(ïŒæ¬¡ã¢ãŒã¡ã³ãçžåœ)çãçšããïœ¢é«æ¬¡æ£ç¢ºæ§å€å®æ©æ§ïœ£ã®å°å ¥ãæ€èšããŠããã ããã¯ïŒæ¬¡æ å ±ãçŽæ¥ emoPulse ã®åºåãšããã®ã§ã¯ãªã(emoPulseæ©æ§ã¯çŸç¶ãç¶æãã) çŸåšã®åŠç¿é²è¡ã®ïœ¢çŽåºŠïœ£ãè©äŸ¡ããã¡ã¿ææšãšããŠæŽ»çšãã詊ã¿ã§ããã ããã«ããæ¥µå°ããŒã¿ã»ããã«ãããéåŠç¿ã®äºå ãããã«æ©æã«æ€ç¥ããèªåŸçå¶åŸ¡ã®ç²ŸåºŠã極éãŸã§é«ããããšãå¯èœã«ãªããšäºæ³ããã ãŸãã¯dNRå±¥æŽã«ããéå»ãšçŸåšã®å·®åããæ£ç¢ºæ§ãæ€ç¥ã§ãããããããªãã ãã ãããã¯å¿ èŠæ§ã«å¿ããŠå°å ¥ãããã®ã§ããããããŸã§ã®å®èšŒè©Šéšçµæããæ¥ãå¿ èŠã¯ãªããšå€æããŠããã | |
| æ°åŠçè§£æãžã®å±æ | |
| æ¬ç ç©¶ãæ°åŠçã«è§£æãããšãSDEææ³ ã§ãããªãã ODEç ã§ãããšçµè«ã¥ããããã®ã§ã¯ãªãããšèããã ãã® emoPulse ã«ããæŽæ°åã¯ã確ççãªæºãããšæéçãªæ»ãããã®åæ¹ãå å ããŠããããã®æ¯ãèã㯠SDE ãš ODE ã®å¢çã«äœçœ®ããç¬ç¹ã®æ§é ãæã€å¯èœæ§ãããã (Losså€ã¯åŠç¿ã®çµæã§ããããããããäžå¿ã«ããæ¬ææ³ã¯çµæããå°åºããã®ã§ ODEç ã«ãªããšäºæ³) Multi-EMA ã«ããå±¥æŽåœ¢æãå éšå€æ°ã®æšç§»ããã©ã®ãããªé£ç¶æéçè§£éãæã¡ãããã¯ãä»åŸã®æ°åŠçç ç©¶ã«å§ããããéèŠãªèª²é¡ã§ããã æ¬çš¿ã§ã¯ãã®çŽæçãªæ¹åæ§ã®ã¿ã瀺ãããã®è©³çްãªè§£æã¯æªæ¥ã®ç ç©¶è ã«ããçºå±ã«æåŸ ãããã | |
| åèæç® (References) | |
| Kingma, D. P., & Ba, J. (2014). Adam: A Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980. (1次ã»2次ã¢ãŒã¡ã³ããçšããé©å¿çåŠç¿çã®åºç€) | |
| Reddi, S. J., Kale, S., & Kumar, S. (2019). On the Convergence of Adam and Beyond. ICLR. (AMSGradçã«ããåæä¿èšŒãš2次ã¢ãŒã¡ã³ãã®å®å®æ§ã«é¢ããè°è«) | |
| Defazio, A., & Mishchenko, K. (2023). Learning-Rate-Free Learning by D-Adaptation. ICML. (æé©è§£ãŸã§ã®è·é¢ D ãæšå®ããæåã®åŠç¿çèšå®ãäžèŠã«ããçè«çæ çµã¿) | |
| Orabona, F., & Tommasi, T. (2017). Training Deep Networks without Learning Rates Through Coin Betting. NeurIPS. (COCOB: æè³æ¯ç (Betting) ã®æŠå¿µãçšããããã©ã¡ãŒã¿æŽæ°ã®èªåŸå¶åŸ¡çè«) | |
| Luo, L., Xiong, Y., & Liu, Y. (2019). Adaptive Gradient Methods with Dynamic Bound of Learning Rate. ICLR. (AdaBound: åŠç¿çã®åçã¯ãªããã³ã°ã«ããæ±åæ§èœã®åäž) | |
| Shazeer, N., & Stern, M. (2018). Adafactor: Adaptive Learning Rates with Sublinear Memory Cost. ICML. (è¡ååè§£ã«ããã¡ã¢ãªç¯çŽãšãäœç²ŸåºŠç°å¢ã«ãããæ£èŠåææ³) | |
| Bernstein, J., Wang, Y. X., Azizzadenesheli, K., & Anandkumar, A. (2018). signSGD: Compressed Optimisation for Non-Convex Problems. ICML. (笊å·åã«ããåŸé å§çž®ãšããã€ãºèæ§ã®é«ãæŽæ°åã®èšŒæ) | |
| Chen, S. B., et al. (2023). Symbolic Discovery of Optimization Algorithms. arXiv. (Lion: 笊å·å (Sign) ãš Weight Decay ã®åé¢ã«ããå¹ççãªæ¢çŽ¢ã®èšå·ççºèŠ) | |
| Zeyuan Allen-Zhu. (2017). Natasha: Faster Non-Convex Optimization Than SGD. arXiv. (髿¬¡æ å ±ãå©çšããéåžæé©åã®å éãšãå±æè§£ããã®è±åºçè«) |