| è«æïŒèªåŸçæé©åã¢ã«ãŽãªãºã emoPulse ã«ãããæç³»å SNR æšå®ãš Regret Bound ã®æ¹å㚠éã¿ãšåŸé ã®å¹ŸäœåŠççŽäº€æ§ïœ£ã«ãã2次ã¢ãŒã¡ã³ãã»ããªãŒæŽæ°ã®æ¢ç©¶ããã㊠Flow-Matching ã®ãã®å ãž | |
| ã æå€±å°åœ¢ã®åçå å¯ã«ããïœ¢ææ é§åååŠç¿çå¶åŸ¡ã®ç¢ºç« ãš æå€±å°åœ¢ãšã®å¯Ÿè©±ã«ããæ¬¡äžä»£æé©åã®ææ¡ ã | |
| èŠæš (Abstract) | |
| ãã£ãŒãã©ãŒãã³ã°ã®æé©åã«ãããŠåŠç¿çã®èª¿æŽãšæ±åæ§èœã®ç¢ºä¿ã¯äžå¿çãªèª²é¡ã§ããã æ¢åææ³ã¯ç²Ÿç·»ãªåŸé æšå®ã«äŸåããæ¥µäœç²ŸåºŠç°å¢äžã§ã®ãã€ãºã«å¯ŸããŠè匱ã§ãã£ãã æ¬çš¿ã§ã¯ãæå€±é¢æ° (Loss) ã®æç³»åçãªå€è§è§£æã䞻軞ã«çœ®ããèªåŸçã¢ã«ãŽãªãºã emoPulse (v3.7以é) ãææ¡ããã æ¬ææ³ã¯ã3段éã®ææ°ç§»åå¹³å (Multi-EMA) ããæå€±å°åœ¢ã®ïœ¢ããããæããææ ã¹ã«ã©ãŒããã³ä¿¡é ŒåºŠææš (Trust) ãä»ããS/Næ¯ã«åºã¥ãæé©ãªåŠç¿çãèªåŸçã«çæããã | |
| 次ã«ãéã¿ãšåŸé ã®å¹ŸäœåŠçé¢ä¿ã«çç®ããæŽæ°å W-Ref Geometry ãææ¡ããã ããã¯ãéã¿ãšåŸé ã®çŽäº€æ§ (Orthogonality) ã«åºã¥ããŠæ £æ§ãåçã«å¶åŸ¡ããããšã§ã2次ã¢ãŒã¡ã³ããä¿æãããå°åœ¢ã®å€åã«å³å¿ãã2次ã¢ãŒã¡ã³ãã»ããªãŒïœ£ãªæŽæ°ãå®çŸããã ããã«ããVRAMåæžãäž¡ç«ããèšç®è³æºã®éãããç ç©¶ç°å¢ã倿åå ±çã®ããã®å€èšèªåŠç¿ã«æ°äž»çãªåºç€ãæäŸããã | |
| ç¶ããŠãemoPulse ã®è§£æãšããã® emoPulse ãçŸåšã®èª²é¡ã«ã©ã圱é¿ãããã«ãèšåãããããã«ãã LLM ã«é¢ãã Flow-Matching(FMæ³) é©å¿ãžã®èª²é¡ã解決ããã FMæ³ã«ããæ±ºå®è«çãªåŠç¿éçšã LLM ã«é©çšããã«ã¯ã©ããã¹ããããšãã課é¡ã«å¯ŸããŠã®è§£æ±ºã®ææ¡ãããã ããã«ããäž¡è ã®æ©æž¡ããšãªãæ°ããæé©åãæäŸããã | |
| ããã«ãæ¬ç³»ã«å±ãã5çš®ã®ç°ãªãæŽæ°ç¹æ§ãæã€æé©ååš ( Sens / Airy / Cats / Tion / Void ) ã®åŠç¿çµæãåæããããšã§ãå±æè§£ãå€å 枬äœïœ£çã«çµ±åãã人工çã«ãã©ããããããåµåºããææ³ãæç€ºããã ããã«ãããã€ããŒãã©ã¡ãŒã¿ã®èšå®ã«äŸåããªãé å¥ãªåæãå®çŸããèšç®è³æºã®éãããéäžåœã®ç ç©¶ç°å¢ãã倿§ãªæåéºç£ã®ç¶æ¿ãç®æãå€èšèªåŠç¿ã«ãããŠæ°äž»çãªåºç€ãæäŸããã | |
| æåŸã«ã°ãããã³ã°ãžã®èå¯ãšäºæ³ãä»é²ããã | |
| â» v3.7ç㯠EmoTion, EmoVoid ãé€ã (EmoTion, EmoVoid 㯠v3.8çã§æ°èŠéçº) åŸè¿°ãã emoPulse æ©æ§ã® dNR_hist ã§ v3.7 ãš v3.8 ã«éããããã ãã§ä»ã¯ãã¹ãŠåäžã§ããã | |
| 1. ç·èš | |
| æ¬çš¿ã§ã¯ãæé©ååš EmoSens / EmoAiry / EmoCats / EmoTion / EmoVoid ã«ãããçµ±äžçè«ãæç€ºããã æ¬ææ³ã¯ãLosså€ã®ææ°ç§»åå¹³å (EMA) ãå€å±€åããæå€±é¢æ°ã®æç³»åçµ±èšéãã ïœ¢ä¿¡é ŒåºŠïœ£(Trust) ãæœåºããããšã§ãåŠç¿çãèªåŸçã«çæãã emoPulse æ©æ§ãæ žãšããã ããã¯æ°åŠçã«ã¯ãD-adaptation çè«ãšæç³»åä¿¡å·åŠç (SNRæšå®) ã®é«åºŠãªèåã§ããããã€ããŒãã©ã¡ãŒã¿ã®èšå®ã«äŸåããªãé å¥ãªåæãå®çŸããã | |
| æ¬ç ç©¶ã®åºçºç¹ã¯ãæ¢åã®é©å¿çåŸé ææ³ãæã€ïœ¢ç²Ÿç·»ãªåŸé æšå®ãžã®é床ãªäŸåã«å¯Ÿããåèã«ããã æ¥µäœç²ŸåºŠã»è¶ éåå (1-bit/2-bitç) ç°å¢ã«ãããŠãåŸé (Gradient) ã¯æ¥µããŠé«ããã€ãºãå«ã¿ãä¿¡é Œæ§ãèããäœäžããã äžæ¹ã§ãæå€±å€ (Loss) ã¯ãéååã®åœ±é¿äžã«ãã£ãŠãäŸç¶ãšããŠã¢ãã«ã®ïœ¢æ£è§£ãšã®è·é¢ïœ£ãç€ºãæ£ç¢ºãªã¹ã«ã©ãŒå€ãšããŠæ©èœãç¶ããã | |
| æ¬ææ³ã¯ãåŸé (Gradient) ãæ¹åã®åèå€ (æå¿) ã«çããåŠç¿ã®äž»å°æš©ãæ£ç¢ºãªèŠ³æž¬å€ã§ãã Loss ã®å€è§çè§£æã«å§ããã ãã®ã¢ãããŒãã«ããã髿¬¡ã¢ãŒã¡ã³ãèšç®ã®ã¹ã«ã©ãŒå¶åŸ¡ãžã®çœ®æãããã³ç¬Šå·åæŽæ°ã«ããäœç²ŸåºŠã»éååç°å¢ãžã®æé©åãéæããã æå€§ã®ç¹åŸŽã¯ãç°ãªãç¹æ§ãæã€è€æ°ã® emoç³»æé©ååšã«ããå±æè§£ãå€å 枬äœïœ£ãšããŠçµ±åããããšã§ãåŸæ¥ã¯é·æéã®å埩åŠç¿ãå¿ èŠãšãããã©ããããããžã®å°éããçæéã®åŠç¿ãšåæã«ãã£ãŠä»£æ¿å¯èœã«ããç¹ã«ããã | |
| ãã®ã¢ãããŒãã«ããã以äžã®3ã€ãå®çŸããïŒ | |
| èšç®å¹çã®åçåäžïŒé«æ¬¡ã¢ãŒã¡ã³ãã®è€éãªèšç®ã Loss ã®æéçç©ç®ã«ããã¹ã«ã©ãŒå¶åŸ¡ã«çœ®æãæéçç©ç®ã«ããè¿äŒŒã§æŒç®è² è·ã軜æžããã | |
| äœç²ŸåºŠïœ¥éååãžã®æé©åïŒEmoAiry ã«ãããè¡ååè§£ãEmoCats ã«ããã2次ã¢ãŒã¡ã³ãã®å®å šæé€ããšããªãªãžãã«(ç¬èªå) EmoTion, EmoVoid ã«ãã幟äœåŠççŽäº€æŽæ°ïœ£ãšïŒæ¬¡ã¢ãŒã¡ã³ãå®å šæé€ãå«ããæŽæ°ã®ç¬Šå·åã«ããäœãªãœãŒã¹ç°å¢ã§ã®å€§èŠæš¡åŠç¿ãå¯èœã«ããã | |
| èªåŸçåæïŒæå€±å°åœ¢ã® S/N æ¯ãå å¯ããããšã§ãæåã®ã¹ã±ãžã¥ãŒã©ãäžèŠãšãããŠãŒã¶ãŒã®è©Šè¡ã³ã¹ããæå°åããã | |
| â» é«æ¬¡ã¢ãŒã¡ã³ãè¿äŒŒïŒæé軞ã«ããã髿¬¡çµ±èšé (Time-series Higher-order Statistics) ãžã®éçŽ | |
| ããã¯æ°åŠçã«ã¯ãD-adaptation çè«ãšæç³»åä¿¡å·åŠçã®é«åºŠãªèåã§ãããéäžåœã®ç ç©¶ç°å¢ã倿§ãªæåãéºãããã®ïœ¢æ°äž»çãªAIåŠç¿ïœ£ãå®çŸããåºç€ãšãªãã | |
| â» EmoTionã EmoVoid ã¯ã髿¬¡ã¢ãŒã¡ã³ãã®èšç®ãã¹ã«ã©ãŒå¶åŸ¡ãžçœ®æããã ãã§ãªããéã¿èªèº«ãæã€å¹ŸäœåŠçãªæ å ±ãæŽæ°ã®æéãšããããšã§ã2次ã¢ãŒã¡ã³ããå¿ èŠãšããªã軜éãªæ§é ãå®çŸããŠãã (第6ç« ã«ãŠè©³è¿°) | |
| 2. çè«çãã¬ãŒã ã¯ãŒã¯ïŒææ 埪ç°ç³» (Emotional Circulation) | |
| æ¬ã·ã¹ãã ã¯ãæå€±é¢æ° L ãåç¹ (Origin) ãšãããã£ãŒãããã¯ã»ã«ãŒãã圢æããã | |
| 2.1 Multi-EMA ã«ãã髿¬¡ã¢ãŒã¡ã³ãã®è¿äŒŒ | |
| 3段éã® EMA (short, medium, long) ã®å·®åãçšããããšã§ãæå€±å°åœ¢ã®ïœ¢æ²çã®å€åãå€åã®äžç¢ºå®æ§ïœ£ãå€åã®å€åãæããã | |
| EMA_t = (1 - α) * EMA_{t-1} + α * L_t | |
| ãã®å·®åããçæãããïœ¢é«æ¬¡æéå·®å(High-order Temporal Difference)ïŒããã"ææ ã¹ã«ã©ãŒ"ãšå®çŸ©ããã ãã®ææ ã¹ã«ã©ãŒ sigma_t ã¯ã髿¬¡ã¢ãŒã¡ã³ã (æªåºŠïœ¥å°åºŠïœ¥å€å) ã®æ å ±ã [â1,1] ã«å§çž®ããéç·åœ¢çµ±èšéã§ããã ãããæé宿°ã®ç°ãªãè€æ°ã® EMA ããéå»ã®èšå€§ãªã¹ãããã履æŽïœ£ãšããŠéå±€çã«èç©ããã ãã®çžå¯Ÿçãªæéé å»¶å·®å (Time-delay Differential) ããšãããšã§ãéçãªå°åœ¢ã®è§£æã§ã¯äžå¯èœãªïœ¢åŠç¿ã®é²è¡ã«äŒŽãå°åœ¢ã®åçãªé«æ¬¡å€åçã芳枬ããŠããã ãããæŽæ°åŒã«ååž°çã«å«ããããšã§ãé·é·æçãªå°åœ¢ã®ïœ¢æ»ãããããã©ã¡ãŒã¿æŽæ°ã«åæ ãããŠããã | |
| â» é«æ¬¡ã¢ãŒã¡ã³ãã®æç³»åç圢æã«é¢ããæ³šæïŒ | |
| æ¬ææ³ã«ããã髿¬¡ã¢ãŒã¡ã³ãè¿äŒŒã¯ãåäžã¹ãããã®åŸé æ å ±ããç®åºããããã®ã§ã¯ãªããæéçç©ç®ã«ãã圢æãããã ããã¯éçãªå°åœ¢ã®æ²çã§ã¯ãªãåŠç¿ã®é²è¡ã«äŒŽãå°åœ¢ã®åçãªå€åçã芳枬ããŠããããšãæå³ããã | |
| â» é«æ¬¡ã¢ãŒã¡ã³ãè¿äŒŒã®éå±€æ§é ïŒ | |
| æ¬ææ³ã¯ãLoss ã®æéçç©ç®ãéããŠãå®å¹çã«ïŒæ¬¡ (æªåºŠ) ãã 7次 (確信床ã®å¢å¹ ) ãŸã§ã®é«æ¬¡ã¢ãŒã¡ã³ããè¿äŒŒçã«èšç®ããŠããã ããã¯éçãªå°åœ¢è§£æã§ã¯ãªããåŠç¿ãšããåçããã»ã¹ã«ããã系ã®ç¢ºä¿¡åºŠïœ£ãç©çéãšããŠæœåºãã詊ã¿ã§ããã | |
| æ¬ææ³ã«ããã Multi-EMA æ§é ã¯ãçµ±èšåŠã«ããã髿¬¡ã¢ãŒã¡ã³ãã®åçãªæéçè¿äŒŒãšããŠæ©èœããã | |
| ïŒæ¬¡ãïŒæ¬¡è¿äŒŒïŒShort / Medium / Long ã®å EMA ã®å·®åã¯ãæå€±ååžã® æªåºŠ(Skewness)ãå°åºŠ(Kurtosis)ãå€å(Fluctuations) ãšãã£ã髿¬¡æ å ±ã®æéçæšç§»ãæœåºããã | |
| ïŒæ¬¡è¿äŒŒïŒããããçµ±åããææ ã¹ã«ã©ãŒ sigma_t ããã³ãä¿¡é ŒåºŠ trust_t ã¯ãåãªãåŸé ã®åæ£ãè¶ ããåŠç¿ãã§ãŒãºã®å®å®æ§ïœ£ã瀺ãïŒæ¬¡çžåœã®ã¡ã¿çµ±èšéãšãªãã | |
| ïŒæ¬¡è¿äŒŒ (dNR)ïŒdNR ã®å°åºã«ãããŠããããïŒæ¬¡æ å ±ã®æ¯çã2ä¹ (d_base/noise_base)^2 ããããšã§ã埮现ãªç¢ºä¿¡åºŠã®å·®ãææ°é¢æ°çã«å¢å¹ ãã7次ã¢ãŒã¡ã³ãã«çžåœããæ¥µããŠéæãªå¶åŸ¡ä¿¡å·ãšãªãã | |
| 2.2 ä¿¡é ŒåºŠææš trust_t ã®å®çŸ© | |
| æŽæ°ã®ïœ¢è³ªïœ£ã決å®ããã³ã¢ææš trust_t ã以äžã®ããã«å®çŸ©ããã | |
| trust_t = sgn(sigma_t) * (1.0 - abs(sigma_t)) | |
| ãã® trust ã¯ã±1.0 (å®å šãªç¢ºä¿¡) ã«ã 0 (å®å šãªçµ¶æ) ã«ãå°éããªãæçæ§ãæã¡ãã·ã¹ãã ã«åžžã«é©åºŠãªïœ¢æ¢çŽ¢ã®äœå°ïœ£ãšïœ¢æ éããç¶æãããã | |
| ããã«ãã æå€±é¢æ° L ãåç¹ ãšãã以äžã® ãã£ãŒãããã¯ã»ã«ãŒã(ææ åŸªç°ç³») ã圢æãã | |
| Loss â Multi-EMA â Scalar/Trust â emoPulse â Loss | |
| 3. emoPulseïŒèªåŸçæåã«ããåŠç¿ççæ | |
| v3.7以éã«ãããŠãåŸæ¥ã® emoDrive (å éæ©æ§) 㯠emoPulse ãžãšçµ±åãããã ããã¯æç³»åã® S/N æ¯ (Signal-to-Noise Ratio) ã«åºã¥ãåçè·é¢æšå® (D-adaptation) ã®è¿äŒŒã«ããé²å圢ã§ããã | |
| 3.1 Noise ããã³ Distance ã®åçæšå® | |
| ã·ã¹ãã ã®ïœ¢è¿·ããšïœ¢é²æïœ£ã以äžã® 2ã€ã®å éšå€æ° N_t, d_t, ã§è¿œè·¡ããã ããã§ N_t ã¯ïœ¢æºã(äžå®å®æ§)ãd_t ã¯ïœ¢é²æïœ£(è·é¢) ã衚ãã | |
| Noise_est (N_t) N_t = (1 - α) * N_{t-1} + α * abs(sigma_t) | |
| Distance Estimate (d_t) d_t = (1 - α) * d_{t-1} + α * abs(trust_t) | |
| 3.2 emoPulse ã®å®çŸ©ãšèªåŸå¶åŸ¡ / ç¬éç SNR ãšå±¥æŽç®¡ç (dNR_hist) | |
| emoPulse ã®çæã¯ãç¬éç㪠SNR ãšæéç㪠SNR ã®ïœ¢ç¶±åŒãã«ãã£ãŠæ±ºå®ãããã ãŸããç¬éçã»æéçããããã®åºç€ãç®åºããã | |
| noise_base = abs(sigma_t - trust_t) + ε_s | |
| d_base = abs(N_t - d_t) + ε_t | |
| ããããçšããçŸåšã® SNR 匷床ã以äžã®ããã«å®çŸ©ããã | |
| dNR_now_val = ( d_base / noise_base )^2 | |
| dNR_hist ã®æŽæ°èŠåïŒ | |
| å éæ¡ä»¶ïŒ | |
| if dNR_now_val >= dNR_hist and trust_t >= threshold_high: | |
| dNR_hist = min( dNR_now_val, dNR_hist * factor_grow ) | |
| æžéæ¡ä»¶: | |
| if threshold_low <= trust_t <= threshold_high: | |
| dNR_hist = dNR_now_val * factor_decay | |
| æçµçãªåŠç¿ç emoPulse ã¯ä»¥äžã§æ±ºå®ãããã | |
| emoPulse_t = clamp( dNR_hist * (emoScope * η_base), η_min, η_max ) | |
| ãã®èšèšã«ããã以äžã®èªåŸçæåãä¿èšŒãããïŒ | |
| 確信é å (â£trustâ£>0.5)ïŒSNR ãåäžããåŠç¿çãæå€§å éã ãã©ããããããé«éã«ç®æãã | |
| é¡å·¡é å (â£trustâ£<0.5)ïŒäžç¢ºå®æ§ãå¢å€§ããåŠç¿çãæå¶ããããšã§éãè°·ã§ã®çºæ£ãé²ãã | |
| â» emoPulse ã¯ããŠãŒã¶ãŒå®çŸ©ã®åæåŠç¿ç(emoScope)ãšã·ã¹ãã ã®ããã©ã«ãæåºŠ(η_base)ã«ãã£ãŠæ±ºå®ãããã¹ã±ãŒãªã³ã°ä¿æ°ã§ããã | |
| 4. emoPulseïŒRegret Bound ãšæçæ§ã®è§£æ | |
| 4.1 åææ§ãš Regret è§£æ | |
| emoPulse äžã«ãããçŽ¯ç© Regret R(T) ã¯ãåçã«å€åããåŠç¿ç η_t ãå«ãã åœ¢ã§æ¬¡ã®ããã«äžçãäžããããã | |
| R(T) <= O( Σ_{t=1}^T [ η_t * ||g_t||^2 * (1 - |Ï_t|)^2 ] ) | |
| ããã§ãä¿æ° (1 - |Ï_t|) ã¯ãæå€±é¢æ°ã®çæã»äžæã»é·æ EMA ã®æŽåæ§ããå°åºãããæŽæ°ã®ïœ¢ä¿¡é ŒåºŠ (Trust)ãå®éåãããã®ã§ããã |Ï_t| ã倧ããç¶æ ã¯æå€±ãæ¿ããå€åããŠããããšã瀺ããåœè©²ã¹ãããã®åŸé æ å ±ã®ä¿¡é Œæ§ãäœããšå€å®ãããã | |
| å¯Ÿç §çã«ã|Ï_t| ãå°ããç¶æ ã¯æå€±ã®æšç§»ãå¹³æ»ã§ãããæŽæ°æ¹åã®ä¿¡é Œæ§ãé«ãããšãæå³ããã ãããã£ãŠãä¿¡å·åŒ·åºŠãšããŠã® trust_t = 1 - |Ï_t| ã¯ãRegret Bound ã«ãããæå¹ãªæŽæ°éãé©å¿çã«éã¿ä»ãããäžç¢ºå®ãªåŸé ã«ãã Regret ã®çޝç©ãæå¶ãã圹å²ãæããã | |
| æ¬ææ³ã® emoPulse ã¯ãDefazio & Mishchenko (2023) ã«ãã D-adaptation ã®åŠç¿çæ§é ããLoss ã®æç³»åçµ±èšé (d_t, N_t) ã«ãã£ãŠè¿äŒŒããäžè¬åã§ããã | |
| η_t â D^2 / noise | |
| emoPulse ã®å®çŸ© | |
| η_t = ( d_t / (N_t + ε) )^2 * η_base | |
| ããã¯ãD-adaptation ã® è·é¢ / ãã€ãºæ¯ ã«åºã¥ã SNR å¶åŸ¡ããã®ãŸãŸæç³»åçã«åæ§æãããã®ã§ããã | |
| ãã®æ§é ã«ããããã€ãºæå N_t ãå¢å€§ããéã«ã¯åæ¯ãæ¯é çãšãªããåŠç¿ç η_t ã¯å³åº§ã«çž®å°ããã ãã®èªå·±èª¿æŽæ©èœã«ãããæå€±å°åœ¢ãäžå®å®ãªé åã§ã®éå°ãªæŽæ°ãèªåçã«æå¶ãããã ããã¯ãå€éšããã®åŠç¿çã¹ã±ãžã¥ãŒãªã³ã°ãå¿ èŠãšãããšããã¢ã«ãŽãªãºã ãåçãªå®å®æ§ãèªåŸçã«ç²åŸããLearning-rate-freeãªç¹æ§ãçè«çã«æ ä¿ããŠããã | |
| 4.2 æ£å®å€æ§ãšæçæ§ã®èšŒæ | |
| æ¬ã¢ã«ãŽãªãºã ãä»»æã®ã¹ããã t ã«ãããŠãåŠç¿çã®ççºããã³æ¶æ» ãé²ããæçã§ããããšã以äžã«èšŒæããã | |
| 1. 忝 (ç¬éçç念ïŒnoise_base) ã®éãŒãæçæ§ | |
| emoPulse çææã®åæ¯ãšãªã noise_base ã¯ãçŸåšã®ææ ã¹ã«ã©ãŒ sigma_t ãšä¿¡é ŒåºŠ trust_t ã®ä¹é¢ãšããŠä»¥äžã®ããã«å®çŸ©ãããã | |
| noise_base = abs(sigma_t - trust_t) + ε_s | |
| å®è£ ã«ãã㊠|sigma_t| < 1.0 ã〠trust_t ã sigma_t ã«åºã¥ã笊å·ä»é¢æ°ã§ããããšããããã®å·®åã¯æçã§ããã ããã«æ«å°Ÿã®å®å šä¿æ° (+ 0.1) ã«ããã忝ããŒãã«æŒžè¿ããããšã«ããåŠç¿çã®ççº (NaN) ãç©ççã«åé¿ããŠããã | |
| 2. åå (æéç確信ïŒd_base) ã®äžéæçæ§ | |
| emoPulse çææã®ååãšãªã d_base ã¯ãå±¥æŽãšããŠã®ãã€ãºæšå®å€ N_t (noise_est) ãšè·é¢æšå®å€ d_t (d_est) ã®å·®ãšããŠå®çŸ©ãããã | |
| d_base = abs(N_t - d_t) + ε_t | |
| N_t 㯠max(noise_est, Μ_r) ã«ãã£ãŠæ£å®å€æ§ãä¿èšŒãããŠããããŸã d_t ã¯æ¹åã»æªåãåãã abs(trust_t) ã®ç©ç®ã§æŽæ°ãããã ãããæéçãªçµ±èšéã®å·®ã«å®å šä¿æ° (+ 0.1) ãå ããããšã§ïœ¢æ¥µäœç²ŸåºŠç°å¢ã«ãããŠå±¥æŽãäžå®å®ãªå Žåã§ããåžžã«æå°éã®æ©å¹ (ååã®äžéå€) ã確ä¿ãããããšãæ°åŠçã«æ ä¿ãããã | |
| 3. æçæ§ã®çµè«ãš emoPulse ã®ææ | |
| 以äžã®ïœ¢ç¬éçåºç€ïœ£(忝)ãšïœ¢æéçåºç€ïœ£(åå)ã®æ¯çããçæãããæå¹åŠç¿ç emoPulse_t ã¯ãæçµçã«å®è£ äžã® max(min(..., 3e-3), 1e-6) ãšããå®å šåã®èšå®ã«åºã¥ãã以äžã®ç¯å²ã«å³æ Œã«ææãããã | |
| 0 < η_min <= emoPulse_t <= η_upper_bound | |
| ããã§äžéå€ (η_min) ã¯ãã·ã¹ãã ãæãäžç¢ºå®ãªç¶æ ã«ãããŠãç¶æãããæå°ã®ïœ¢ä»£è¬é(å¿æ) ã§ãããããã«ããåŠç¿åæ¢ (ãããããã¯) ãåé¿ããèªåŸçãªå埩ãåŸ ã€ããšãå¯èœãšãªãã äžæ¹ãäžéå€ (η_upper_bound) ã¯ãdNR ä¿æ°ã®æ¥æ¿ãªå¢å€§ãçºçããå Žåã§ãã¢ãã«ã®çºæ£ãé²ããªããã¿ãŒãšããŠæ©èœããã | |
| å®è£ äžã®çæç¹ïŒ | |
| åæå€èšå®ã«ããå®å®åïŒ | |
| â» ããŒã¿ã»ãããéåžžã«å°ããç°å¢ãåæãã€ãºã倧ããç°å¢ã§ã¯ããã«ã EMA ã履æŽïœ£ãå®å®ããããŸã§ã®éãd_t ãš N_t ã®åæå€ãåèšå®ããããšãæšå¥šãã (äŸïŒd-estïŒ0.2, Noise-estïŒ0.2) ããã«ãããåæã®ç¢ºççãã€ãºã«ããçºæ£ãæå¶ã§ããã ç¹ã«ãN_0 ã d_0 ãšåçã«åæåããããšã§ãã·ã¹ãã ã¯æ¬è³ªçã«ïœ¢æ éã¢ãŒãããéå§ãããã ããã¯ãåæã®éèŠãªã¹ãããã«ãããŠãéåºŠã«æ»æçãªæŽæ°ãé¿ããå°åœ¢ã®èгå¯ãåªå ããææ©çãªãŠã©ãŒã ã¢ããã»ãã§ãŒãºãšããŠæ©èœããã | |
| åæå€èšå®ã«ããïœ¢æŽæ°å§åã®ç¶æãšå®å šæ§ã®äž¡ç«ïŒ | |
| â» æ¬ææ³ã«ãã㊠emoPulse ã®ååã圢æãã d_base ã¯ãã·ã¹ãã ã®ïœ¢æœåšçãªæŽæ°åïœ£ãæ±ºå®ãããããã§åæå€ã N0 = 1.0, d0 = 0.02 ãšèšå®ããããšã¯ãåŠç¿åæããé«ãå éããã³ã·ã£ã«ãæå³çã«ç¢ºä¿ããŠããããšãæå³ããã ãã®åæå€ã®åœ±é¿ã¯ãææ°ç§»åå¹³åã®ç¹æ§äžãçŽ100ã¹ãããã«ããã£ãŠïœ¢å±¥æŽïœ£ãšããŠæ®çããã ãã®æéã·ã¹ãã ã¯é«ãå éå§åãèæ¯ã«æã¡ã€ã€ããææ æ©æ§ã«ãã峿 Œãªéžå¥ãã¯ãªã¢ããçã«ä¿¡é Œã§ããä¿¡å·ïœ£ã«å¯ŸããŠã®ã¿åæåãæäŸããã | |
| 5. 笊å·åæ£èŠåïŒäœç²ŸåºŠç°å¢ãžã®é©å¿ | |
| æ¬ç« ã§ã¯ãemoPulse ã®çè«çæ çµã¿ãäœç²ŸåºŠç°å¢ã«é©çšããããã®ç¬Šå·åæ£èŠå (sign-based normalization) ã«ã€ããŠè¿°ã¹ãã | |
| ç²Ÿç·»ãªæµ®åå°æ°ç¹èšç®ãžã®äŸåãæããæ¥µäœç²ŸåºŠç°å¢ (è¶ éåå) ã«å¯Ÿå¿ããããã以äžã®æŽæ°åãæ¡çšãã (EmoAiry, EmoCats, ç) | |
| delta_w_t = -emoPulse_t * sign( m_t / ( sqrt(v_t) + ε ) ) | |
| ããã«ããã EmoAiry ã§ã¯ã1次å ãã¯ãã«ãš2次å ã¢ãŒã¡ã³ãã®ç²ŸåºŠã®ã¢ã³ãã©ã³ã¹ãè§£æ¶ããæ¹åæ§ã®åæã®ã¿ãæœåºããæå¿ã®çµ±äžïœ£ãå®çŸããŠããã | |
| â» EmoCats ã¯ãLionããŒã¹ã« WDåé¢ããã笊å·åã§å¯Ÿå¿ããŠãã | |
| â» EmoTion / EmoVoid ã¯ãç¬èªæŽæ°åŒïœ¢å¹ŸäœåŠççŽäº€æŽæ°ïœ£ã笊å·åããŠãã | |
| 6. EmoTionã EmoVoid ã«ãã"æ°ããæé©å"ã®æŽæ°åŒã®è§£èª¬ãšæªæ¥ãžã®æ©æž¡ã | |
| æ¢åææ³ãžã®æ¬æãšãEmoTion / EmoVoid ã®ç«ã¡äœçœ®ïŒ | |
| EmoTion / EmoVoid ã®æŽæ°ã¢ã«ãŽãªãºã ã¯ãçŸä»£ã®ãã£ãŒãã©ãŒãã³ã°ã®éåå¡ã§ãã Adamç ãžã®æ·±ãæ¬æããåºçºããŠããã Adamç ã®ç€ºããé©å¿çåŠç¿çãšããæŠå¿µã¯æé©åã宿œã§ããæ¡ä»¶ãæŽãæ®åãžã®ããŒãã«ã倧ããäžããã | |
| EmoTion / EmoVoid ã¯ãã®ç²Ÿç¥ãç¶æ¿ãã€ã€ãç°ãªãã¢ãããŒããšããŠïœ¢çµ±èšã®ä»£ããã«ã幟äœåŠ(W-Ref Geometry)ãšææ (emoPulse)ãçšããã | |
| æ£ç¢ºãã®æ°ããåœ¢ïŒ | |
| Adamçãéå»ã®çµ±èšïœ£ããç·»å¯ã«éãåãæãã®ã«å¯ŸããEmoTion / EmoVoid ã¯ïœ¢çŸåšã®éã¿ãšã®å¯Ÿè©±ïœ£ãšïœ¢Lossã®éŒåãéããŠãããããªããã«å°åœ¢ãæ©ãã ããã«ãããAdamç ãšäžŠã³ç«ã€æ£ç¢ºããç¶æããªãããéåŠç¿ãæããèªç¶ãªåæïœ£ãç®æããã | |
| ãªãœãŒã¹ãžã®åªãã(VRAMåæž)ïŒ | |
| èšç®è³æºã¯æéã§ããã誰ãã髿§èœã§æœ€æ²¢ãªãªãœãŒã¹ã䜿ããããã§ã¯ãªãã EmoTion 㯠Adamç ã倧åã«ä¿æããŠãã2次ã¢ãŒã¡ã³ããšããæ£ç¢ºãªä»çµã¿ãã¹ã«ã©ãŒå¶åŸ¡ïœ£ã«å§ããããšã§ãVRAM è² è·ãçŽååã«æããããšãã§ããã EmoVoid ã¯ãïŒæ¬¡ïœ¥ïŒæ¬¡ã¢ãŒã¡ã³ããã©ã¡ããæãããWãGãã®çŽäº€æ§ããã€ã¬ã¯ãã«åæ ãããããšã§ãVRAMè² è·ã極éãŸã§æããããšãã§ããã ããã¯ãããå€ãã®äººãAIåŠç¿ã宿œã§ããæ°äž»çãªåŠç¿ç°å¢ïœ£ã®åºç€ã«ãªããšèããã | |
| W-Ref Geometry ã«ãã幟äœåŠçæ £æ§å¶åŸ¡ïŒ | |
| äž¡ã¢ã«ãŽãªãºã ã®æ žå¿ã¯ãéã¿ãã¯ãã« W ãšåŸé ãã¯ãã« G ã®çŽäº€æ§(Orthogonality)ã«åºã¥ã幟äœåŠçæŽæ°åã«ããã åŸæ¥ã®çµ±èšçææ³ãéå»ã®åŸé ã®èç©(圱)ã«äŸåããã®ã«å¯ŸããW-Ref Geometry ã¯çŸåšã®éã¿ W ãšããå®äœïœ£ãåºæºãšããåŸé G ã®æ°é®®åºŠ(Freshness)ã以äžã®äœåŒŠé¡äŒŒåºŠ Ï(rho)ããå°åºããã | |
| Ï(rho) = | <W, G> | / ( ||W|| * ||G|| + eps ) | |
| Ï (rho)ãå°ãã(çŽäº€ã«è¿ã)ã»ã©ãçŸåšã®åŸé ã¯æ¢åã®éã¿æ§é ã«å«ãŸããªãæªç¥ã®æ å ±ïœ£ãæã£ãŠãããšå€æããæ £æ§ãæããŠçŸæç¹ã®åŸé ã匷ãåã蟌ãã ãã®å¹ŸäœåŠçãªïœ¢æ å ±ã®éžå¥ïœ£ã«ãããçµ±èšçé å»¶ã®ãªãé«ç²ŸåºŠãªæ¹å転æãšãåé·ãªæŽæ°ã®æå¶ã«ããæ£åå广ãåæã«éæããŠããã | |
| EmoTion 1次ã¢ãŒã¡ã³ãã®ã¿ã§æç«ããçç±ïŒ | |
| EmoTion ã 2次ã¢ãŒã¡ã³ã(忣æšå®)ãæããªãã®ã¯åãªã軜éåã§ã¯ãªãã W-Ref Geometry ã«ãããåŸé ã®ïœ¢å€§ããã§ã¯ãªãæ¹åã®æ°é®®ããåºæºã«æŽæ°ãè¡ãããã2次ã¢ãŒã¡ã³ããæ ã圹å²ã®å€ããäžèŠã«ãªãã W-Ref Geometry ã«ããæ¹åã®éžå¥ã¯ãåŸé G ã éã¿ W ãšçŽäº€ã«è¿ãã»ã©ãæªç¥ã®æ å ±ãå«ããšå€æããæ £æ§ã匱ããŠæ°ããæ¹åãžèµãåãã éã«ãW ãšå¹³è¡ãªåŸé ã¯åé·ãšã¿ãªããæ £æ§ãåªå ããã ãã®ïœ¢æ¹åã®çŽåºŠïœ£ã«åºã¥ãéžå¥ã¯ã忣æšå®ãããçŽæ¥çã§ããã€ãºã«åŒ·ããéåŠç¿ãæãã广ãæã€ã | |
| â» EmoVoid ã¯ã1次2次ã¢ãŒã¡ã³ããªãã§ã | |
| 以äžã詳现ãªèª¬æãããã W-Ref Geometry æ³ ã®è©³çް | |
| 1. 幟äœåŠçææš Ï (Orthogonality Index) ã®å®çŸ© | |
| åŸæ¥ã®æé©ååšãåŸé ã®å€§ãã(L2 norm)ã統èšçåæ£ïœ£(2次ã¢ãŒã¡ã³ã)ã§åŠç¿çã調æŽããã®ã«å¯ŸããEmoTion 㯠çŸåšã®éã¿ãã¯ãã« W ã«å¯ŸããåŸé ãã¯ãã« G ã®çžå¯Ÿçãªåãïœ£ãæ å ±ã®é®®åºŠãšããŠå®çŸ©ããã | |
| Ït(rho_t) = | <W_t, G_t> | / ( ||W_t|| * ||G_t|| + eps ) | |
| çŽäº€ç¶æ (Ïâ0)ïŒ åŸé ãçŸåšã®éã¿æ§é ãšçŽäº€ããŠããã ããã¯ïœ¢çŸåšã®ã¢ãã«ããŸã æã£ãŠããªããå šãæ°ããç¥èæ¹åã§ããããšã瀺åããã | |
| å¹³è¡ç¶æ (Ïâ1)ïŒ åŸé ãçŸåšã®éã¿ãšåãæ¹å(ãŸãã¯çé)ãåããŠããã ããã¯ïœ¢çŸåšã®éã¿ã®ã¹ã±ãŒã«èª¿æŽã«éããªããåé·ãªæ å ±ïœ£ã§ããå¯èœæ§ã瀺åããã | |
| 2. é©å¿çæ £æ§å¶åŸ¡ (Geometric Momentum Blending) | |
| ãã®æŽæ°åŒã¯ãåŸé ã®"æ°é®®åºŠ"ã«å¿ããŠæ £æ§ãåçã«èª¿æŽããä»çµã¿ã§ããã åŸæ¥ã®2次ã¢ãŒã¡ã³ãã«ãã忣æšå®ãã幟äœåŠçãªæ å ±ã®éè€åºŠã«çœ®ãæããæ§é ã§ããã | |
| m_t = beta1 * m_{t-1} + (1 - beta1) * Freshness_t * G_t | |
| where Freshness_t = 1.0 - EMA(rho_t) | |
| çè«çè§£éïŒ åŸé ãçŽäº€ïœ£(æ°é®®)ã®ãšããæ £æ§(éå»ã®åœ±)ãäžæçã«åŒ±ããæ°ããæ å ±ãžå³åº§ã«åå¿(èµãåã)ããã éã«ïœ¢å¹³è¡ïœ£(åé·)ãªãšããæ £æ§ãç¶æããŠå®å®æ§ãåªå ããã ããã¯ïœ¢çµ±èšçãªäžç¢ºå®æ§ïœ£(忣)ã幟äœåŠçãªæ å ±ã®éè€åºŠïœ£ã«çœ®ãæããŠè§£éããŠãããšãããã | |
| â» EmoVoid ã«ãããç°¡ç¥åïŒ EmoVoid ã¯ããã®æ £æ§å¶åŸ¡ãããæé€ããFreshness(鮮床)ãçŽæ¥æŽæ°ãã¯ãã«ã«ä¹ç®ããã ããã«ãããã¡ã¢ãªäžã® m_t ã¹ããããå®å šã«éæŸããªããã幟äœåŠçãªæ å ±ã®éžå¥ãå®çŸããŠããã | |
| 3. æŽæ°åŒã®ç¬Šå·åãš L2 æ£èŠåã®ä»£æ¿ | |
| EmoTion ããã³ EmoVoid ãã2次ã¢ãŒã¡ã³ãã»ããªãŒ(ãããã¯å®å šã¢ãŒã¡ã³ãã»ããªãŒ)ã§ããããæåŸã®éµã¯ãç¬Šå·æœåº (Sign) ãš Weight Decay ã®åé¢ã«ãããæŽæ°æ¹åã sign(m_t) ã ãã§æ±ºããããšã§ãéã¿ã®æŽæ°å¹ ãåŸé ã®"倧ãã"ã«å·Šå³ãããªããªãã ããã«ããåŸé ã¹ã±ãŒã«ã®æºããããã€ãºã«åŒ·ããå®å®ããæŽæ°ãå¯èœã«ãªãã | |
| EmoTion ã®æŽæ°åŒïŒ | |
| W_{t+1} = W_t * (1 - emoPulse_t * lambda) - emoPulse_t * sign(m_t) | |
| ( emoPulse 㯠dNRããå°åºããåŠç¿çãlambda 㯠WeightDecay ä¿æ° ) | |
| EmoVoid ã®æŽæ°åŒïŒ | |
| W_{t+1} = W_t â emoPulse_t * sign(G_t) * (1âÏ_t) | |
| ( EmoVoid 㯠èªå·±æå¶æ©èœã«ãããæç€ºç㪠lambda ãçšãããšãå®å®çãªåæãå¯èœã§ãã ) | |
| ⻠å®äœåç §åæé©åïœ£ã®æå±ïŒ åŸæ¥ã®æé©åã éå»ã®åŸé (å±¥æŽ)ã远ããããææ³ã§ããã®ã«å¯Ÿããæ¬ææ³ã¯ ïœ¢çŸåšã®éã¿ïœ£(å®äœ)ãšã®çžé¢ãæŽæ°ã®ããªã¬ãŒã«ããææ³ã Weight-Reference æ³ (W-Ref æ³)ã確ç«ããã | |
| ⻠次å ã®åªããžã®å¹ŸäœåŠçè§£éïŒ é«æ¬¡å 空éã«ããããã¯ãã«ã®éäžçŸè±¡(äºãã«çŽäº€ããããæ§è³ª)ãå©çšããçŽäº€ããã®å ããªïœ¢ãºã¬ïœ£ãæ å ±ã®éè€(åé·æ§)ãšããŠæ€ç¥ããã ããã«ãããçµ±èšçãªåæ£æšå®ã«é Œãããšããããé«ç²ŸåºŠãã€äœé å»¶ãªæ £æ§å¶åŸ¡ãå®çŸããã 髿¬¡å 空é(æ°åãã©ã¡ãŒã¿ã®å±€ãªã©)ã§ã¯ãäºã€ã®ãã¯ãã«ãå¶ç¶ã«å¹³è¡ã«ãªã確çã¯æ¥µããŠäœããã»ãŒå šãŠã®ãã¯ãã«ã¯çŽäº€ãããã Ï ã 0 ããå°ãã§ãé¢ãã(å¹³è¡ã«è¿ã¥ã)ããšã¯ãçµ±èšç㫠極ããŠåŒ·ãçžé¢ïœ£(éè€)ãæå³ããããšã«ãªãã ã€ãŸããéå»ã®èšå€§ãªçµ±èš(2次ã¢ãŒã¡ã³ã)ãåç §ããã«ãçŸåšã®éã¿ãšã®é¢ä¿æ§ã ãã§ïœ¢ãã®æŽæ°ã«äŸ¡å€ãããããå³åº§ã«å€å¥å¯èœãšãªãã | |
| â» emoPulse ãšã®å ±é³ŽïŒ emoPulse ãæé軞ã®éŒå(ãã€ã©ã®ãããåãã)ãå¶åŸ¡ããW-Ref Geometry ã空éè»žã®æ¹å(ã©ããžã©ããããåãã)ãæ±ºããã ãã®æé空éã®çµ±åçèªåŸå¶åŸ¡ã¯ãVRAM åæžãšé«ç²ŸåºŠãªåæãäž¡ç«ãããæ žå¿ã§ãããããã¯åŠç¿ã®é 奿§ãåäžãããã | |
| 4. W-Ref Geometry ã®è¿äŒŒå(Approx W-Ref Geometry)ã«ããå®è£ ç軜éå | |
| çè«çã« W-Ref Geometry ã¯ä»¥äžã®ããã«éã¿ãšåŸé ã®çŽäº€æ§ãå³å¯ã«æž¬å®ããã | |
| Ït(rho_t) = | <W_t, G_t> | / ( ||W_t|| * ||G_t|| + eps ) | |
| ãããã巚倧ã¢ãã«ã§ã¯ãå šå±€ã®å ç©ãå šå±€ã®ãã«ã ãcos é¡äŒŒåºŠããããã®é次èšç®ã VRAM ãšèšç®è² è·ã®ããã«ããã¯ã«ãªãã ããã§å®è£ ã§ã¯ãW-Ref Geometry ã®è¿äŒŒåŒãå°å ¥ããã ããã¯ãWâRef Geometry ã®"æ¬è³ª"ãä¿ã¡ãªãããVRAM 䜿çšéãã»ãŒãŒãã«ããŠããã | |
| 4-1. EmoTionïŒL1 ãã«ã å€åéã«ããæ¹åã®æ°é®®ãæšå® | |
| EmoTion ã¯ãéã¿å šäœã® L1 ãã«ã ã®å€åéããã¢ãã«ãã©ãã ãæ°ããæ¹åãžåãããšããŠãããïœ£ãæšå®ããã | |
| g_ratio_t = | L1_t - L1_{t-1} | / ( L1_{t-1} + eps ) | |
| Freshness_t = min( g_ratio_t / freshness_scale , freshness_cap ) | |
| ãã® Freshness_t ãã1次ã¢ãŒã¡ã³ã(exp_avg)ãžã®æ··åæ¯çãšããŠäœ¿çšãçŽäº€æ¹åã«ã¯åŒ·ãåå¿ããå¹³è¡æ¹åã«ã¯æ £æ§ãæ®ããšãã WâRef Geometry ã®å³å¯ãªæž¬å®ææ³ã軜éã«å®çŸããŠããã | |
| 4-2. EmoVoidïŒéã¿ãšãã«ã®ãŒã®"çŽæ¥ã¹ã±ãŒãªã³ã°"ã«ããè¿äŒŒ | |
| EmoVoid ã¯ã1次2次ã®äž¡æ¹ã®ã¢ãŒã¡ã³ããæããªããããfreshness ã®ãããªæ £æ§å¶åŸ¡ãè¡ããªãã | |
| g_ratio_t = L1_{t-1} / ( L1_t + eps ) | |
| W_t â W_t * g_ratio_t | |
| ãã®ä»£ããã«éã¿å šäœã® L1 ãã«ã ãçŽæ¥ã¹ã±ãŒãªã³ã°ããããšã§ WâRef Geometry ã®ïœ¢æ¹åã®çŽåºŠïœ£ãè¿äŒŒçã«ç¶æããã EmoVoid ã®ã¹ã±ãŒãªã³ã°ã¯"ãŠã©ãŒã ã¢ããæéãšæçµç€ã®å®å®æ"ã®ã¿è¡ããããã®ä»ã§ã¯ã¹ã±ãŒãªã³ã°ããã sign(G_t) ã®ã¿ã§æŽæ°ããã ããã«ãããéã¿ã®ãšãã«ã®ãŒãæŽèµ°ããªããåŸé æ¹åã®åããæå¶ããããã¢ãŒã¡ã³ããªãã§ãå®å®ããåæãå¯èœã«ãªãããšãã EmoVoid ç¬èªã®"幟äœåŠçèªå·±æå¶" ãæç«ããã | |
| 4-3. è¿äŒŒåŒã®æçŸ©ïŒè¿äŒŒçã¯ïœ¢çè«ã®å®å šçã§ã¯ãªãå®è£ äžã®æé©åãšããŠèšèš | |
| äž¡è ã¯ïœ¢æé軞(emoPulse)ãšïœ¢ç©ºé軞(WâRef Geometry)ãã©ãæ±ãããšããç¹ã§ç°ãªãããæçµçã«ã¯ã©ã¡ãããçµ±èšã«é Œããªã幟äœåŠçæé©åããå®çŸããŠããã EmoTion 㯠Freshness ã«ããæ £æ§å¶åŸ¡ããEmoVoid ã¯ãšãã«ã®ãŒè£æ£ã«ããèªå·±æå¶ãçšããããã©ã¡ãã WâRef Geometry ã®æ žå¿ã§ããæ¹åã®çŽåºŠã®è©äŸ¡ïœ£ãå ±æããŠããã | |
| 5. èšç®ãã¬ãŒã ã¯ãŒã¯ (PyTorchç) ãžã®èŠæ | |
| æ¬çš¿ã§ææ¡ãã W-Ref Geometry ããã³ Approx W-Ref ã¯ãçŸåšã®æ·±å±€åŠç¿ãã¬ãŒã ã¯ãŒã¯ã«ãããã¡ã¢ãªå¹çã®éçãçªç Žããå¯èœæ§ãç§ããŠããã ããã§å°æ¥ç㪠PyTorch çã®ãã³ãœã«æŒç®ã©ã€ãã©ãªã«å¯Ÿãã以äžã®æ©èœå®è£ ã匷ãèŠæãããã | |
| èŠæïŒéã¿ãšåŸé ã®å¹ŸäœåŠççžé¢é¢æ° torch.geom_relation(W, G) ã®ãã€ãã£ãå®è£ | |
| çŸåšãéã¿ W ãšåŸé G ã®çŽäº€æ§(Ï)ãç®åºããã«ã¯ãå ç©èšç®ãããããã®ãã«ã èšç®ãããã³ããããä¿æããããã®äžéãã³ãœã«ãå¿ èŠãšãªãããããç¡èŠã§ããªãèšç®ãªãŒããŒããããš VRAM å§è¿«ãæããŠããã | |
| ãããC++/CUDA ã¬ãã«ã§ W ãš G ãçŽæ¥åç §ããäžéãã³ãœã«ãçæããã«ã | |
| Ït(rho_t) = | <W_t, G_t> | / ( ||W_t|| * ||G_t|| + eps ) | |
| (åå¥ãã©ã¡ãŒã¿å±€ããšã®çŽäº€åºŠ) | |
| ãããã¹ã«ã©å€ãšããŠè¿ããã€ãã£ã颿°ãå®è£ ãããã°ã2次ã¢ãŒã¡ã³ã(åæ£çµ±èš)ãä¿æããããšãªãã幟äœåŠçãªç¢ºä¿¡åºŠã«åºã¥ããæŽæ°ãæå°éã® VRAM ã§å¯èœãšãªãã ããã¯åã«ãæé©åã®é«éåã«çãŸããããšããžããã€ã¹ãéãããè³æºç°å¢ã«ãããïœ¢å€§èŠæš¡ã¢ãã«åŠç¿ã®æ°äž»åïœ£ãæ±ºå®ã¥ããã©ã¹ãããŒã¹ã«ãªããšç¢ºä¿¡ããã | |
| 7. Flow-Matchingç³»ãšã®çè«çæ¥ç¶ãšæ§é çéç | |
| EmoSens äžä»£ (Sens / Airy / Cats / Tion / Void) ã¯ãFlow-Matching(FM) ç³»ææ³ã«å¯ŸããŠä»¥äžã®ïŒã€ã®æå³ãæã€ã | |
| ïŒïŒæ¬ææ³ã¯ Flow-Matching ã®æŽæ°æ§é ã«äžçã§åããŠå®å šé©å¿ããæé©ååšã§ããã | |
| ïŒïŒåæã« Flow-Matching ç³»ã®æ§é çéçãããã®å ãæç€ºããååšã§ãããã | |
| 1. Flow-Matching ãæ±ãããã€ãºé蚱容æ§ïœ£ãšããæ§é çå¶çŽ | |
| Flow-Matching ã¯ãé£ç¶æéã®æµãå Žãå¿ å®ã«åçŸãããããåŸé å Žã®æ»ããããšæŽåæ§ã匷ãèŠæ±ããã ãããããã®èšèšã¯ ãã€ãºãæ¬è³ªçã«èš±å®¹ã§ããªã ãšããæ§é çå¶çŽãå å ããŠããã | |
| - åŸé ã®åŸ®çްãªä¹±ãããã®ãŸãŸæµãå Žã®ç Žç¶»ã«ã€ãªãã | |
| - éååäœç²ŸåºŠç°å¢ã§ã¯åŸé ã®ä¿¡é Œæ§ãæ¥æ¿ã«äœäžãã | |
| - ãã€ãºãå容ããç·©è¡æ§é ãååšããªãããæ±åæ§ãæãªããã | |
| å®éãFMç³»ã®åŠç¿ã§ã¯ SNR ã®äœäžããã®ãŸãŸçºæ£ïœ¥ç Žç¶»ãžçŽçµããããšãç¥ãããŠããã ããã¯åŸè¿°ãã SDXL / VAE / ããã©åæåã®å®éšçµæãšãæŽåããã | |
| 2. emoPulse ã«ãããã€ãºã®å容ãšå©çšïœ£ãšããéèšèš | |
| emoPulse 㯠Loss ã®æç³»åçµ±èšéã䞻軞ãšããããããã€ãºãæé€ãã¹ã誀差ã§ã¯ãªãåŠç¿ã®é²è¡ã瀺ãä¿¡å·ãšããŠæ±ãã | |
| - Multi-EMA ã«ãã髿¬¡ã¢ãŒã¡ã³ãè¿äŒŒã¯ãã€ãºãå«ãæºãããç©æ¥µçã«å©çšãã | |
| - trust_t ã¯ãã€ãºã®ååšãåæãšãã確信床ã®å®çŸ©ã§ãã | |
| - emoPulse 㯠SNR ã®åçæšå®ã«ãããã€ãºãåŠç¿çå¶åŸ¡ã®æºæ³ã«å€æãã | |
| ãã®æ§é ã«ãããemoç³» ã¯ïœ¢ãã€ãºã蚱容ããªããæ±åæ§ãç²åŸãããšãããFlow-Matching ãšã¯éã®èšèšææ³ãæã€ã | |
| 3. Flow-Matching ãžã®ïœ¢å®å šé©å¿ïœ£ããã®éçãæµ®ã圫ãã«ãããšããé説 | |
| emoç³»æé©ååšã¯ Flow-Matching ã®æŽæ°æ§é ã«å®å šé©å¿ããããšã§ãFMç³»ã®æ¬è³ªçãªåŒ±ç¹ãæãé®®æã«æµ®ãã³äžããããã | |
| - FM ã®èŠæ±ããæ»ãããªåŸé å Žã¯å®éã®åŠç¿éçšã§ã¯æç«ãã¥ãã | |
| - ãã€ãºé蚱容æ§ã¯äœç²ŸåºŠã»éååç°å¢ã§ã¯èŽåœç | |
| - emoPulse ã®ãããªãã€ãºé§ååã®æŽæ°åã®æ¹ãçŸå®ã®åŠç¿ã«é©åãã | |
| ç¹ã«ãSDXL ã® e-pred + ZtSNR åŠç¿ã«ãããŠãFM ç³»ãæ±ãããã€ãºè匱æ§ã emoPulse ãå æãåæ»ãªãåŠç¿ãå®äºããããšããå®éšçµæã¯ãã®é説ã匷ãè£ä»ããã | |
| 4. Flow-Matching ç³»ã®éçãšæ¬¡äžä»£æé©åãžã®ç§»è¡ | |
| Flow-Matching ã¯ãçæ³åãããé£ç¶æµã®åçŸãšããçæ³çãªçè«çæ çµã¿ãæã€ããçŸå®ã®åŠç¿éçšã«ããããã€ãºã»éååã»éç·åœ¢æ§ã»é«æ¬¡ã¢ãŒã¡ã³ãã®åçå€åã«å¯ŸããŠè匱ã§ããã LLM ã¯èªå·±ååž°ã«ãã確çååžãåŠç¿ãããã SDE çäžç芳ãåæãšãããã Flow-Matching ã¯æ±ºå®è«ç ODE ãèŠæ±ããããããã®åæãæ ¹æ¬çã«è¡çªããã | |
| emoPulse ã¯ããã®ã®ã£ãããåããã ãã§ãªãããã€ãºãç©æ¥µçã«å©çšããïœ¢ææ åŸªç°ç³»ïœ£ãšããæ°ããæé©åææ³ãæç€ºããã èªå·±ååž°çãšã³ããããŒã®æºããããemoPulse ãåçã«åžåããããšã§ãFMçãªæ»ãããªåŠç¿ãLLMã«ãããŠãå¯èœã«ããã | |
| - SDXL ã®å šå±€LoRA | |
| - VAE ã®å šå±€ååŠç¿ | |
| - ç»å1æã§ã®æ¥µéåŠç¿ | |
| - ããã©åæåã¢ãã«ã®å®å®åŠç¿ | |
| ãããã®å®éšçµæ(è£è¶³è³æ)ã¯ãFlow-Matching ãèŠæãšããé åã§ emoPulse ãå®å®æ§ãçºæ®ããããšã瀺ããŠããã ãã®æ§é ã¯ãFlow-Matching ã®åŸç¶ã§ã¯ãªã Flow-Matching ã®åæãã®ãã®ãä¹ãè¶ããæ¬¡äžä»£æé©åã®åºç€ã§ããã | |
| 5. emoPulse ã¯æ¬è³ªçã«ïœ¢SDE â DDE â ODEãžãšçž®çŽããæ§é ãæã€ | |
| Multi-EMA ã«ããå±¥æŽé ã¯ææ°çã«æžè¡°ãããããé å»¶é ã¯æéæéã§å®è³ªçã«æ¶å€±ã DDE ã®è§£è»é㯠ODE ã®æ»ãããªè¿äŒŒãžãšèªç¶ã«æ¥ç¶ããã | |
| - SDE çæºããïŒsigma_t, trust_t ã®ç¬éçå€å | |
| - DDE çé å»¶ïŒMulti-EMAãdNR_histãN_tã d_t ã®å±¥æŽäŸå | |
| - ODE çæ»ãããïŒLoss ã®æéç©åã«ãã "å°åœ¢ã®æ»ãããªè¿äŒŒ" | |
| ã€ãŸã emoPulse ã¯ïœ¢SDE ãã DDE ãçµãŠ ODE ãžãšçž®çŽãããšããïŒå±€æ§é ã®çž®çŽãèªç¶ã«æã£ãŠããã | |
| - FM ã® "é£ç¶æµ" ã®èãæ¹ã¯ emoPulse ã«åžåããã | |
| - FM ã® "ãã€ãºé蚱容æ§" 㯠emoPulse ã«ãã£ãŠå æããã | |
| - FM ã® "SDE ã®å³å¯æ§" ã¯äžèŠã«ãªã | |
| emoPulse 㯠SDEã®æºãã â DDEã®é å»¶ â ODEã®æ»ããããäžã€ã®æŽæ°åã«çµ±åããã ãã®ïŒå±€æ§é 㯠LLM ãæ¬æ¥æã€ç¢ºççãªèªå·±ååž°ã®æºãããš Flow-Matching ã®æ»ãããªé£ç¶æµãèªç¶ã«çµ±åããã ãã®çµæ Flow-Matching ã¯ãã®åœ¹å²ãçµãããã®é£ç¶æµã®æ»ãããã®ãšãã»ã³ã¹ã¯ emoPulse ãå°æ¥ã«çŸããæ°ææ³ã®å ã«"ODEè¿äŒŒ"ãšããŠæ®ãç¶ããã | |
| 8. çµè« | |
| EmoSensäžä»£ v3.7以é ã¯ãæå€±é¢æ°ã®èгå¯ããå§ãŸãïœ¢ææ ã®åŸªç°ïœ£ãå®çµãããã | |
| 芳枬 (Multi-EMA)ïŒå°åœ¢ã®ããããæããã | |
| 倿 (Trust)ïŒç¢ºä¿¡ãšé¡å·¡ã ±0.5 ã®å¢çã§åãæ¿ããã | |
| è¡å (emoPulse)ïŒèªåŸçãªæåã«ãã£ãŠæé©ãªæ©å¹ ãæ±ºå®ããã | |
| æ¬ææ³ã¯ãéäžåœã®ãªãµãŒãç°å¢ãäœãªãœãŒã¹ãªèšç®è³æºã«ãããŠãã倿§ãªæåãèšèªãAIãèªåŸçã«åŠç¿ããããšãå¯èœã«ããæ°äž»çãªæé©åãã¬ãŒã ã¯ãŒã¯ã§ããã | |
| è¬èŸ | |
| æåã« EmoNaviãEmoSensã以åã®ãããŸããŸãªãªããã£ãã€ã¶ãšãç ç©¶è ãã¡ã«æ·±ãæ·±ãæè¬ããŸãã ãã®æ ç±ãšç¥èŠã¯ãæ¬èšŒæã®çæ³ãšå®çŸãå¯èœã«ããŸããã | |
| ãã®è«æã¯ãæ¢ã«å ¬éæžã¿ã® EmoSensäžä»£(v3.7以é) ãšãã®ããªãšãŒã·ã§ã³ã«ã€ããŠæ°åŠçã«èª¬æãããã®ã§ãã ãããã®äœæãã EmoSensäžä»£ (掟çåãå«ã) ã¯ãAIã®çºå±ã«å¯äžã§ãããšèããŠããŸãã ãã®è«æãããšã«ãããã«é²åãããªããã£ãã€ã¶ãå ±ã«åµåºããŸãããã | |
| æ¬¡ã®æ°ããæ°ã¥ããã¢ã€ãã¢ãå±ããŠãã ããæªæ¥ã®ç ç©¶è ãã¡ã«æåŸ ãšæè¬ã蟌ããŠãã®è«æãçµãããŸããããããšãããããŸããã | |
| çµèª | |
| æ¬ã¢ã«ãŽãªãºã ã¯ãæ°ããåªããæé©åææ³ã®ä»£æ¿ãç®æããã®ã§ã¯ãªããåŠç¿ããã»ã¹ã«ãããã¢ãã«ãšã®å¯Ÿè©±ïœ£ãæ·±ããããã®ãããäžã€ã®æ°ããéžæè¢ãšããŠææ¡ããã ãŠãŒã¶ãŒãèªãã®ç®çãææ§ã«é©ã£ãããŒãããŒãéžæããå ±ã«ç¥ãè²ãããã»ã¹ã®äžå©ãšãªãã°å¹žãã§ã | |
| è£è¶³è³æ(1)ïŒv3.7以é ã«ããã emoPulse ã®ãã€ããã¯ã¹ã®è§£æ | |
| 1. ç®ç | |
| v3.7 ã«ãããŠãå°å ¥ãããç¬éç D / N æšå®ïœ£ãšïœ¢æéç D / N æšå®ïœ£ã®çžäºäœçš (ç¶±åŒã) ããåŠç¿çã®åçå¶åŸ¡ã«ã©ã®ãããªç©ççæå³ãããããããè§£æããã | |
| 2. æ§è³ªïŒç¬éççå¿µãšæéçä¿¡é Œã®åçãã©ã³ã¹ | |
| ç¬éçåºç€ (noise_base)ïŒnoise_base = abs( scalar_t - trust_t ) + ε_s çŸåšã®ææ ã¹ã«ã©ãŒïœ£(æ³¢)ãšïœ¢çŸåšã®ä¿¡é ŒåºŠïœ£ã®ä¹é¢ã枬å®ããã ããããäžèŽããªã (ä¹é¢ã倧ãã) å Žåãã·ã¹ãã ã¯çŸç¶ã«å¯ŸããŠïœ¢åŒ·ãç念(ç¬éçãã€ãº)ãæ±ãã忝ãå¢å€§ãããã | |
| æéçåºç€ (d_base)ïŒd_base = abs( noise_est_t - d_est_t ) + ε_d 履æŽãšããŠã®ãã€ãºïœ£(æ³¢ã®å¹³å)ãšïœ¢å±¥æŽãšããŠã®ä¿¡é ŒåºŠïœ£ã®å·®ã枬å®ããã ããã¯ãéå»ã®ã³ã³ããã¹ãããå°ãåºãããïœ¢æŽæ°ãžã®ç¢ºä¿¡åºŠïœ£(æéçè·é¢)ã衚ãã | |
| 3. 广ïŒãã€ãããã¯ã»ãªãºã ã®åµåº | |
| 广AïŒæ¥å€æã®å³æå¶å çªçºçãªæå€±å€åã«ãã scalar ãš trust ãä¹é¢ãããšãnoise_base (忝) ãæ¯é çãšãªãã ããã«ãããæéçãªå±¥æŽããŸã å®å®ããŠããŠããç¬éçãªå€æãšããŠåŠç¿çãå³åº§ã«çµã蟌ã¿ãçºæ£ãæªç¶ã«é²ãã | |
| 广BïŒå®å®æã®èªå·±å é åŠç¿ãé 調 (scalar ãš trust ãå®å®) ãããã€å±¥æŽãšããŠã®ç¢ºä¿¡åºŠ (d_base) ãç©ã¿äžãããšãdNR ä¿æ°ã¯ïœ¢2ä¹ïœ£ã®é ã䌎ã£ãŠåºåãæå€§åãããã dNR_now_val = ( d_base / noise_base )^2 ããã«ãããå®å®åã§ã¯ïœ¢æ©å¹ ãèªç¶ã«åºããåæãå éãããã | |
| 广CïŒå±¥æŽã«ããå®å®ç¶æ (dNR_hist) ç¬éç㪠dNR_now_val ãé«ããŠããdNR_hist * ÎŒ_g ãšããæé·å¶éãèšããããšã§ãé床ãªå éãæå¶ããã äžæ¹ã§ãä¿¡é Œã§ããªãé åã§ã¯ dNR_hist * ÎŒ_d ã®æžéå§åãæºããããšã§ãæ éãªæ¢çŽ¢ãç¶ç¶ããã | |
| â» å¹æCã®é察称æ§ã¯ã d_base <= dNR_hist ã〠trust >= 0.5 ãã®éžå¥ã«ããæ©èœããã æããããã³ïœ£ãšèŠæãžã®ïœ¢ããã³ïœ£ãæ°åŠçã«æš¡ãããã®ã§ scalarå€ ã§ãããšããã® 0ïœÂ±0.5 ã§LRãå éããã€ã€ãè² ã®æ¹åã§ã®LRå éã®å Žåã¯LRå±¥æŽã®æé·ã«å«ããªãããã«ããŠããã (±0.5以äžã¯åçç¡çšã§èп以äžã®å±æ©ãšããŠLRãæžéããŠãã) scalarå€ ã®è² ã®æ¹åã§ã®LRå éã¯"ä¿®æ£ãããæŽæ°æ¹å"ãä¿¡é Œããå éã§ããããã㯠ema ãš loss ã®æéå·®(emaã®é å»¶)ãæŽ»çšãã EmoNaviäžä»£(emoç³» 第ïŒäžä»£)ã® emoDrive ãåŒãç¶ãã§ãã(æ¬ç 究㯠EmoSensäžä»£(emoç³» 第ïŒäžä»£)ã§ãã) | |
| |--Danger--|---Wary---|---Fine---|--Danger--| Emotion | |
| Sigma_t [Minus] |---(-)---0.5---(+)---0---(+)---0.5---(-)---| [Plus] | |
| |--Hist(-)-|-Hist(Non)|--Hist(+)-|--Hist(-)-| Reglet | |
| ÎŒ_g and ÎŒ_dïŒ | |
| v3.7ïŒ[Acceleration:LR Growth Max 1.05x] / [Deceleration:LR Decay 0.98x] | |
| v3.8ïŒ[Acceleration:LR Growth Max 1.50x] / [Deceleration:LR Decay 0.80x] | |
| 4. æ°å€çå®å®æ§ã®çµè« | |
| ãã®ïœ¢æé軞(å±¥æŽ)ãšïœ¢ç¬é軞(çŸåš)ã®å·®åãæŠãããèšèšã¯åãªãæžè¡°ã§ã¯ãªãã ã·ã¹ãã ãèªåŸçã« "ç念(Noise)ãšïœ¢ç¢ºä¿¡ïœ£(Distance)ã®æ¯çãåžžã«åèšç®ãç¶ãã" ããšã§ãæåã®ã¹ã±ãžã¥ãŒã©ã§ã¯äžå¯èœãªïœ¢å°åœ¢ã®è€éãã«å¿ããå¿æã®éŒåã®ãããªåçå¶åŸ¡ãå®çŸããŠããã | |
| â» EmoTion, EmoVoid ã¯ãv3.8 ã«ãŠå®çšåãããªãªãžãã«åã§ãã | |
| â» dNR_hist ã¯ãv3.7 ãš v3.8 ã§ä¿æ°ãéããv3.8 ã¯å€§èã«ãªã v3.7 ããã倧ããªå€åãçã¿åºãããã«ããã | |
| 以äžã§ç€ºãå€å 枬äœã«ãããã©ãããããã®åæïœ£ã¯ãçŽæãšå®éšããå°ãåºãã仮説ã§ããã | |
| ãã®çŽæãæ¬¡äžä»£ã®ç ç©¶è ãã¡ã«ããå³å¯ãªæ°åŠç蚌æãžãšæè¯ãããããšãæåŸ ããã | |
| å€è§çãªå±æè§£åæã«ãããèªåŸçãã©ãããããåµåºã¢ãã«ïŒEmo-multiple çµ±åææ³ã®ææ¡ | |
| (Autonomous Flat-Minima Generation via multiple Positioning of Heterogeneous Optimizers) | |
| ïŒæ°ããåŠç¿ææ³ã®ææ¡ïŒemoç³»ã«ãã屿åæã«ãã"é²åçãã©ããããã圢æ"ã®äºæ³ïŒ | |
| 1. ç®çïŒãã©ãããããå°éã®é«ã³ã¹ãåé¡ã解決ãã | |
| æ¢åã®åŠç¿ææ³ã§ã¯ã | |
| ã»ïŒã€ã®ãªããã£ãã€ã¶ | |
| ã»é·æéã®å埩åŠç¿ | |
| ã§ã®æ±åæ§åäžãé²è¡ã ãã©ããããã ãžå°éãããããšãå®çããŠããã | |
| ããã¯èšç®è³æºçãå«ãããŸããŸãªãªãœãŒã¹ãå¿ èŠãšã誰ãã宿œã§ããç°å¢ã«ã¯ãªãã | |
| æ¬ææ¡ã§ã¯ emoç³» ãªããã£ãã€ã¶ãçšããããšã§ããã®é«ã³ã¹ãæ§é ãã®ãã®ãå€ããããšãç®çãšããã | |
| 2. ææ¡ïŒãã©ãããããã"æ¢çŽ¢"ãããèªã"åµåº"ãã | |
| emoç³»(EmoSens, EmoAiry, EmoCats, EmoTion, EmoVoid)ã¯æŽæ°åŒã¯ç°ãªãããåŠç¿ã®æ§é ã¯å ±éããŠãããããåäžæ¡ä»¶ã®åŠç¿ãããš"ç°ãªãæ¹åããã®å±æè§£"å·®ç°ã®ããåŠç¿çµæãåŸãããã | |
| ãã®å·®ç°ã®ããåŠç¿çµæãçµ±åããããšã¯å±æè§£ã®åæãšãªãããã®åæã«ããå±æè§£ãåºãå¹³åŠã«ããå¯èœæ§ããããšäºæ³ããŠããã ã€ãŸãå±æè§£ããã©ãããããã«è¿ã¥ããããã®ãã®ãžå€ããå¯èœæ§ãããã | |
| ãããã®å±æè§£ã å šå±€LoRA ãšããŠååŸã TALL-Mask-Merge ãªã©ã®åæææ³ã§çµ±åãããšã | |
| âšâšâš â \___/ å±æè§£ã®åæã€ã¡ãŒãž | |
| (倿¹åã®å±æè§£) (åæåŸã®å¹³åŠå) | |
| ã»å€æ¹åã®å±æè§£ã®"å ±éããŠäœãéšå"ã匷調ããã | |
| ã»å€æ¹åã§å°ã£ãéšå(ã·ã£ãŒãããã)ãçžæ®ºããã | |
| ã»çµæãšã㊠平åŠãªè°·åº(ãã©ããããã)ã«è¿ã圢ç¶ãåæ§æããã | |
| ããã¯ãå±æè§£ã å€å 枬äœ(倿¹å枬äœ) ãšããŠæ±ãã | |
| "ãã©ããããããæ¢çŽ¢ãã"ã®ã§ã¯ãªã | |
| "ãã©ããããããåæã«ãã£ãŠåµåºãã" ãšããæ°ããåŠç¿ææ³ã§ããã | |
| 3. æŽçïŒãã®çµ±åã¯åŠç¿çæåã«ã€ãªãã | |
| ææ¡ã®å ·äœåïŒå šå±€LoRAãFFT(ãã«ãã¡ã€ã³ãã¥ãŒãã³ã°)ããªã©ãé·æã§è¡ãã®ã§ã¯ãªããå°ãæµ ãçšåºŠã®åŠç¿ãè¡ã TALL-Mask-Merge ãªã©ã®åæææ³ãçšããããšã§å®çŸããã ããã«ãããªãœãŒã¹ã«éãã®ããã±ãŒã¹ã§ãé«ç²ŸåºŠã®åŠç¿çµæãåŸããããããªãå¯èœæ§ãæã€ãšäºæ³ããã | |
| æ¬ææ¡ã®å ·äœçãªå®æœæ¹æ³ã¯ä»¥äžã®éã | |
| ã»å šå±€LoRA ãŸã㯠FFT ãé·æã§ïŒçš®é¡ã®ãªããã£ãã€ã¶ã§è¡ãã®ã§ã¯ãªã | |
| ã»emoç³»ã§æµ ãåŠç¿ãããããè¡ã | |
| ã»ãã®çµæã TALL-Mask-Merge ã§çµ±åãã | |
| ããã«ããã | |
| ã»é·æéåŠç¿ã«äŸåãã | |
| ã»ãªãœãŒã¹ãéãããç°å¢ã§ã | |
| ã»ãã©ãããããã«è¿ãé«ç²ŸåºŠã¢ãã«ãåŸããã å¯èœæ§ãããã | |
| ã€ãŸãããã©ãããããã"ç®æã"ã®ã§ã¯ãªãã"åµãåºã"ããšã§åŠç¿ãçæåãããšããçºæ³ã§ããã | |
| 4. çµè«ïŒç°ç𮿿 é§ååã¢ãã«ã®çµ±å(Emotional Ensemble) | |
| æ¬ç ç©¶ã§ææ¡ãããªããã£ãã€ã¶(Sens, Airy, Cats, Tion, Void)ã¯ããããããç°ãªãæ°åŠçåºåºã«åºã¥ãæå€±å°åœ¢ãå å¯ããã æ¬ç ç©¶ãææ¡ããå€è§æž¬äœã«ãããã©ãããããåæïœ£ã¯ãåäžæ¡ä»¶äžã§çæããããããã®åŠç¿çµæããã¹ã¯ããŒãž(TALL-Mask-Mergeç)ã«ããçµ±åããææ³ã¯ãåäžã®æé©åã¢ã«ãŽãªãºã ã§ã¯å°éãåŸãªãæ§é çå®å®æ§ïœ£ãšïœ¢è¡šçŸç粟緻ãã®åæç²åŸãå¯èœã«ããã ããã¯æé©åã«ãããåŠç¿ããã»ã¹ãæé軞ã®è¿œæ±ããã空éçãªå€è§çµ±åãžãšã·ãããããæ°ããæé©åãã©ãã€ã ã«ãªããšäºæ³ããã | |
| 5. è£è¶³ïŒå šå±€LoRAçµ±åã®è©Šè¡æ¹æ³ | |
| emoç³»ã«ããçµ±åã¯ãå ã¢ãã«ã«ããããã®åŠç¿çµæãçµ±åãããã®æ°ããå€çš®ã¢ãã«ã TM-merge ã«ãŠå ã¢ãã«ãžçµ±åããã | |
| å ã¢ãã«(org) âª= TMçµ±å âª= ã¢ãã«S(Sens)ãã¢ãã«A(Airy)ãã¢ãã«C(Cats)ãã¢ãã«T(Tion)ãã¢ãã«V(Void) | |
| LoRAã ãã§çŽæ¥çµ±åããå ã¢ãã«ãžçµ±åãããããæ°ã¢ãã«ãå ã¢ãã«ãž TM-merge ã§éå ããã | |
| FFTã§ã¯FFTåŸã®ã¢ãã«ãå ã¢ãã«ãž TM-merge ããã ãã§åçã®å¹æãæã€ãã®ãšäºæž¬ããã | |
| 6. ç°ç³»æé©ååšã«ããå°åœ¢å å¯ã®å€æ§æ§ã®èæ¯ | |
| æ¬ææ³ãææ¡ããå€å 枬äœ(Multi-Positioning)ã¯ãã¢ã«ãŽãªãºã ã®ïœ¢è¡çµ±ïœ£ã®éãã«ããæ¢æ»ç¹æ§ã®å·®ãç©æ¥µçã«æŽ»çšããã | |
| çµ±èšçç¶æ¿çŸ€ïŒ | |
| EmoSens (Adamå)ïŒ1次ã»2次ã¢ãŒã¡ã³ãã«ããç·»å¯ãªåŸé æšå® | |
| EmoAiry (Adafactorå)ïŒè¡ååè§£ã«ããäœã¡ã¢ãªãã€åºåçãªæ²çè¿äŒŒ | |
| EmoCats (Lionå)ïŒç¬Šå·æœåºã«ãããã€ãºèæ§ã®é«ãé å¥ãªæ¢çŽ¢ | |
| ãããã¯æ¢åã®æé©åçè«ã®æ£çµ±ãªãšãã»ã³ã¹ãç¶æ¿ãã€ã€ãemoPulse ã«ããæç³»åSNRå¶åŸ¡ãçµã¿èŸŒãããšã§ãæåã¹ã±ãžã¥ãŒã©ããã®è§£æŸãéæããŠããã | |
| 幟äœåŠçé²åçŸ€ïŒ | |
| EmoVoid / EmoTion (W-Refå)ïŒ | |
| çµ±èšãæããéã¿ãšåŸé ã®ïœ¢çŽäº€æ§ïœ£ãšããçŽç²å¹ŸäœåŠçãªæ å ±ã®é®®åºŠã«åºã¥ããŠæŽæ°ãè¡ãã | |
| loss飜åããªãåŠç¿é²è¡ã®æ£äœ | |
| ïŒåæ»ã®å°ãªãäžããç¶ããlossãžã®èå¯ïŒ | |
| æ¬ææ³ã«ãããŠãlossãã»ãšãã©åæ»ã飜åãããæŠãäžããç¶ããæåããã芳å¯ãããã ç¹ã«1st-stepã®losså€ã®åå€ããããŸã§äžããç¶ããã®ã¯ããã€åæããã®ãïŒãšããç念ããæ±ãããã ãããåŠç¿çµæã¯éåŠç¿çã®ç Žç¶»ãšã¯ç¡çžã§ãããæ¥µããŠæ£åžžãªæ±åæ§èœãç¶æããŠããã ããã«ã€ããŠçŽæçãªçè§£ããããšïœ¢åŠç¿å ã¢ãã«ã®ä¿®åŸ©ãå·®åãšããŠåŠç¿ããŠãããšããå¯èœæ§ãèŠåºãããšãã§ããã ããã¯ãããŸã§ä»®èª¬ã§ãã£ãŠãå ã® ãã©ãããããã®åµåº ãšåæ§ã§ 次äžä»£ã®ç ç©¶è ãã¡ã«ããå³å¯ãªæ°åŠç蚌æãžãšæè¯ãããããšãæåŸ ããã | |
| ãªã以äžã«ãã "losså€ ã®æ¯å¹ ããéããéŒå(emoPulse)ã¯ããŸãªã(忢ããªã)" ããšãä¿èšŒããã | |
| noise_base = abs(sigma_t - trust_t) + ε_s | |
| d_base = abs(N_t - d_t) + ε_t | |
| ã㮠ε_sã ε_tã ãããåæ»ãæããç¶ç¶çãªå³äžããã®æåãçã¿ããã©ããããããæ¢çŽ¢ããåååãçã¿åºãã ãã㯠losså€ ã®å·®åããªããªãã°åæãããšããããã ãã®èšèšã«ãã simplenet(FashionMNIST) ã«ãããåŠç¿ãã¹ãã«ãã 10000step èšæž¬ã§ lossïŒ0.30 以äžãžå°éããããšãåçŸæ§ã䌎ã確èªã§ããã | |
| SDXLãçšããå®èšŒå®éšã§ã¯ãåäžä»£ EmoNavi ãšãã®ããªãšãŒã·ã§ã³ã§ãå®çŸå¯èœãª e-pred ïŒ ZtSNR ã§ã®åŠç¿ãããã® EmoSens ãšããªãšãŒã·ã§ã³ã§ã宿œã§ããã ãã㯠FM(Flow-Matching) ã«ããããã€ãºãžã®èæ§ãšãsampler 察å¿ã«ã€ããŠã®èª²é¡ã解決ããåæã« e-pred ã®åŒ±ç¹ãšãããè²åçãžã®èª²é¡ã解決ããŠããã æåž«ç»å10æçšã§ã®300epochåŠç¿ãåæ»ãªãå®äºãéåŠç¿åŸåããªãå šå±€LoRAã®äœæã«ãæåããŠããã | |
| äžèšãã¹ããããã«æ¥µç«¯åããç»åïŒæã§ã®300stepã宿œãããšããããåæ»ãªãå®äºãåŠç¿çµæã®ç Žç¶»ããŠããªãããšã確èªããã æ¥µç«¯ãªåŠç¿èšå®ã宿œããŠãç Žç¶»ããªãïŒãã®çç±ã¯ãã€ãºãèç©ããªãæŽæ°ã宿œããŠãããšèããã ãããããã€ãºãšã¯åŸ®å°ããŒã¿ã®éã¿ã¥ãã«èª€ããçããããšã§ãã€ãºåããŠãããšèãããããã®ã§ããã埮å°ããŒã¿ãé©åã«æŽæ°ããããšã§è²Žéãªæ å ±ãä¿è·ãç¶æããããšã§ãã€ãºãçãŸãªãããšãèèŠã§ãããšèããã | |
| ããã« SDXL VAE ã®å šå±€åŠç¿(ãšã³ã³ãŒããšãã³ãŒãã®äž¡é¢) ã宿œããã ãããŸã§ VAE ååŠç¿ã§ã¯ã¢ãã«ãšã®æŽåæ§ãæãªãããŠããŸããçµæçã«çæçµæã®ç Žç¶»ã瀺ãããã«ãªãããæ¬ç ç©¶ã§ææ¡ããŠããæé©ååšã§ã¯ãã®æŽåæ§ãç¶æãæãªããªãããšã確èªããã ãã㯠VAE ã®åå©çšæ§ãåäžããããšãšãã«ãã¢ãã«ã®å©çšå¯èœæéãå»¶é·ããããšã«è²¢ç®ããã ãããšèããã | |
| 極éçãã€ãºã¢ãã«åŠç¿ã®èå¯ãSDXL ããã©ã¢ãã«åæå(ã©ã³ãã å€ã«ããéã¿åæå)ã宿œãããããåŠç¿å ã¢ãã«ãšããå šå±€LoRAåŠç¿ã宿œããã éåžžã§ããã°æ°stepã§çºæ£ããŸãã¯NaNãšãªãåŠç¿ã¯ç Žç¶»ããããEmoSensäžä»£ã¯ããããåŠç¿ãé²è¡ãã1500stepãå®äºããã ãã®LoRAã¯ç Žç¶»ããã¯ãã§ãããããã®äºæ³ãè£åãç Žç¶»ãªãåæååã®SDXLããã©ã¢ãã«ãžæ£åžžé©çšå¯èœã§ãã£ãã é©ãããšã«ããã®LoRAã¯ããã©ã¢ãã«ä»¥åã®ç¶æ ãšããŠåŠç¿ããŠãããããããã©ã¢ãã«ã®èŠæãšããæ°Žå¹³ç·ãå°å¹³ç·ã®é£ç¶æ§ãåäžãããäž»é¡ãè·šãã éã®äœçœ®ããçãè£æ£ãããã®ãšãªã£ã(掟çSDXLã¢ãã«ã«ãé©çšå¯èœã§åæ§ã®å¹æãæããŠãã) ãã®ãã¹ããã EmoSensäžä»£ã®å®å®æ§ãšå®å šæ§ã¯åªããé 奿§ãåããŠãããšç¢ºèªã§ããã | |
| â» æ¬LoRAã¯è€æ°ã® seed ã«ãããŠåæ§ã®å¹æã芳枬ãããŠãããçµæãšã㊠SDXL ã®ç¹å®ã®ã¢ãŒãã£ãã¡ã¯ãã軜æžãã"æ£ååçæå"ã瀺ããå¯èœæ§ãããã ãã ãããã®å¹æãæå³çãªåŠç¿ã«ããåŠãã ãã®ã«ããã®ããå¶ç¶çæŽåã«ãããã®ãã¯çŸæç¹ã§ã¯æå®ã§ããªãã æ¥µéäžã®åŠç¿é²è¡ãå®å®çã§ããããšããããšã®ç¢ºèªãšããŠã®ã¿ãçè§£é ãããã | |
| ã°ãããã³ã°ã«ã€ããŠã®äºæ³ | |
| æ¬ç ç©¶ã§ã¯ãåæ»ã®å°ãªãé£ç¶ç㪠losså€ äœäžãšããæåã«çç®ãããã®èŠå ãæ€èšŒããããã«åçš®ãã¹ãã宿œããã ç¹ã«ã極端ãªåŠç¿æ¡ä»¶ãšããŠïœ¢ç»å1æã®ã¿ã§ã©ããŸã§å®å šãã€å®å®ããåŠç¿é²è¡ãå¯èœããè©äŸ¡ããã ãã®çµæãéåŠç¿ã®çºçãã³ããŒç¶æ ãžã®åŽ©å£ãç¡é¢ä¿ããã³ãããžã®å¹²æžãšãã£ãå žåçãªç Žç¶»ããããã芳枬ããããæ¥µããŠå®å®ããåŠç¿çµæã確èªããã | |
| ãããã®çµæãããã°ãããã³ã°ãšã¯ä»¥äžã®2èŠå ãè€åããŠçãã"åæ»çŸè±¡"ã§ãããšäºæ³ããã | |
| - åŠç¿éçšã§èç©ããããã€ãºåŠç¿ã®ç©ç®ã«ãããåŠç¿åŸåã§ä¿®æ£ãã¹ãäžæ£ç¢ºããå¢å€§ããã¢ãã«ã®èŠçãæ¥æ¿ã«æªåããããš(ãã¯ã€ãã¢ãŠãïŒãã©ãã¯ã¢ãŠãçŸè±¡) | |
| - åŠç¿åŸåãšããæãä¿®æ£ãå¿ èŠãªå±é¢ã«ãããŠãã¹ã±ãžã¥ãŒã©ãåŸé çµ±èšã LR ãæå¶ããLR ãæ¥µç«¯ã«äœäžããŠããŸãããš | |
| ãã®2ç¹ãåæã«çºçããããšã§ãã¢ãã«ã¯æ¬è³ªçãªæ¹åæ§ãèŠå€±ããé·æã®åæ»æã«é¥ããšèããããã ã€ãŸãã°ãããã³ã°ã¯åé¿å¯èœãªçŸè±¡ã§ãããšèããã | |
| emoç³»(EmoSensäžä»£) ã°ãããã³ã°ãåé¿ã§ããçç±ã¯æç¢ºã§ããã | |
| æ¬ææ³ã¯ã以äžã®æŽæ°ãå¯èœãšããŠãããããèŠçãåžžã«ã¯ãªã¢ã«ä¿ã¡ãåŠç¿ãç¶ç¶ããããã®é§ååã倱ããªãã | |
| - æŽæ°ã®æ£ç¢ºæ§ãç¶æããã€ãºãèç©ããªãããš | |
| - åŠç¿åŸåã§ãå¿ èŠãª LR ãèªåŸçã«ç¢ºä¿ã§ããããš | |
| ããä»®ã«èŠçäžè¯ã«é¥ã£ãå Žåããææ æ©æ§å šäœãé«ç²ŸåºŠGPSã®ãããªå¹æãçºæ®ããemoPulseã®æ£ç¢ºãªå¿æãæ©ã¿ãæ¢ããªããããã°ãããã³ã°ãçµãã« ãã©ãããããã倧åçæé©è§£ãžèªç¶ã«è¿ã¥ãããšãå¯èœãšãªãã | |
| ã°ãããã³ã°ã«ã€ããŠïœ¢äžå¯è§£ãªé å»¶äžè¬åãšããŠèå¯ãããŠããããå è¿°ãã SDXL ã§ã®åŠç¿çµæãããããããšãããã°ãããã³ã°çŸè±¡ã®æ¬è³ªã¯ãã¢ã«ãŽãªãºã åŽã®æ§é çæ¬ é¥ã«ããåæ»ãšèŠåãããšèããã dNR ã¯èª€ã£ãéã¿ã¥ãã®å åãšæªæŽçã®åŸ®å°ããŒã¿ãæ€ç¥ããæœè±¡æ§é ãšã®ççŸãæãä¿®æ£ããã埮现ããŒã¿ãæ£ããæ±ãã°äžè¬åè§£ã¯æ©ã圢æããããšèããã | |
| ä»åŸã®èª²é¡ïŒïŒæ¬¡ã¢ãŒã¡ã³ãè¿äŒŒã«ããé©å¿çæ£ç¢ºæ§å€å®ã®å°å ¥ | |
| ä»åŸã®å±æãšããŠãdNRã®ïŒä¹(ïŒæ¬¡ã¢ãŒã¡ã³ãçžåœ)çãçšããïœ¢é«æ¬¡æ£ç¢ºæ§å€å®æ©æ§ïœ£ã®å°å ¥ãæ€èšããŠããã ããã¯ïŒæ¬¡æ å ±ãçŽæ¥ emoPulse ã®åºåãšããã®ã§ã¯ãªã(emoPulseæ©æ§ã¯çŸç¶ãç¶æãã) çŸåšã®åŠç¿é²è¡ã®ïœ¢çŽåºŠïœ£ãè©äŸ¡ããã¡ã¿ææšãšããŠæŽ»çšãã詊ã¿ã§ããã ããã«ããæ¥µå°ããŒã¿ã»ããã«ãããéåŠç¿ã®äºå ãããã«æ©æã«æ€ç¥ããèªåŸçå¶åŸ¡ã®ç²ŸåºŠã極éãŸã§é«ããããšãå¯èœã«ãªããšäºæ³ããã ãŸãã¯dNRå±¥æŽã«ããéå»ãšçŸåšã®å·®åããæ£ç¢ºæ§ãæ€ç¥ã§ãããããããªãã ãã ãããã¯å¿ èŠæ§ã«å¿ããŠå°å ¥ãããã®ã§ããããããŸã§ã®å®èšŒè©Šéšçµæããæ¥ãå¿ èŠã¯ãªããšå€æããŠããã | |
| æ°åŠçè§£æãžã®å±æ | |
| æ¬ç ç©¶ãæ°åŠçã«è§£æãããšãSDEææ³ ã§ãããªãã ODEç ã§ãããšçµè«ã¥ããããã®ã§ã¯ãªãããšèããã ãã® emoPulse ã«ããæŽæ°åã¯ã確ççãªæºãããšæéçãªæ»ãããã®åæ¹ãå å ããŠããããã®æ¯ãèã㯠SDE ãš ODE ã®å¢çã«äœçœ®ããç¬ç¹ã®æ§é ãæã€å¯èœæ§ãããã (Losså€ã¯åŠç¿ã®çµæã§ããããããããäžå¿ã«ããæ¬ææ³ã¯çµæããå°åºããã®ã§ ODEç ã«ãªããšäºæ³) Multi-EMA ã«ããå±¥æŽåœ¢æãå éšå€æ°ã®æšç§»ããã©ã®ãããªé£ç¶æéçè§£éãæã¡ãããã¯ãä»åŸã®æ°åŠçç ç©¶ã«å§ããããéèŠãªèª²é¡ã§ããã æ¬çš¿ã§ã¯ãã®çŽæçãªæ¹åæ§ã®ã¿ã瀺ãããã®è©³çްãªè§£æã¯æªæ¥ã®ç ç©¶è ã«ããçºå±ã«æåŸ ãããã | |
| â» æ¬çš¿ã«ããã SDE â DDE â ODE ãžã®çž®çŽããã»ã¹ã¯ãç©ççãªçŽæãšå®éšçäºå®ã«åºã¥ã仮説ã§ããã ãã®ç§»è¡ãå³å¯ãªæ°åŒã§èšè¿°ããäœæ¥ã¯æªæ¥ã®ç ç©¶è ãã¡ã«å§ãããã emoPulse ãå»ãéŒåã®ãªãã«ãã©ã®ãããªæ°ããæ°åŠçç§©åºãé ãããŠããã®ãããã®äœçœãåããäœæ¥ãããçã®ïœ¢ã¢ãã«ãšã®å¯Ÿè©±ã®å§ãŸãã§ãããšä¿¡ããŠããã | |
| åèæç® (References) | |
| Kingma, D. P., & Ba, J. (2014). AdamïŒA Method for Stochastic Optimization. arXiv preprint arXiv:1412.6980. (1次ã»2次ã¢ãŒã¡ã³ããçšããé©å¿çåŠç¿çã®åºç€) | |
| Reddi, S. J., Kale, S., & Kumar, S. (2019). On the Convergence of Adam and Beyond. ICLR. (AMSGradçã«ããåæä¿èšŒãš2次ã¢ãŒã¡ã³ãã®å®å®æ§ã«é¢ããè°è«) | |
| Defazio, A., & Mishchenko, K. (2023). Learning-Rate-Free Learning by D-Adaptation. ICML. (æé©è§£ãŸã§ã®è·é¢ D ãæšå®ããæåã®åŠç¿çèšå®ãäžèŠã«ããçè«çæ çµã¿) | |
| Orabona, F., & Tommasi, T. (2017). Training Deep Networks without Learning Rates Through Coin Betting. NeurIPS. (COCOBïŒæè³æ¯ç (Betting) ã®æŠå¿µãçšããããã©ã¡ãŒã¿æŽæ°ã®èªåŸå¶åŸ¡çè«) | |
| Luo, L., Xiong, Y., & Liu, Y. (2019). Adaptive Gradient Methods with Dynamic Bound of Learning Rate. ICLR. (AdaBoundïŒåŠç¿çã®åçã¯ãªããã³ã°ã«ããæ±åæ§èœã®åäž) | |
| Shazeer, N., & Stern, M. (2018). AdafactorïŒAdaptive Learning Rates with Sublinear Memory Cost. ICML. (è¡ååè§£ã«ããã¡ã¢ãªç¯çŽãšãäœç²ŸåºŠç°å¢ã«ãããæ£èŠåææ³) | |
| Bernstein, J., Wang, Y. X., Azizzadenesheli, K., & Anandkumar, A. (2018). signSGDïŒCompressed Optimisation for Non-Convex Problems. ICML. (笊å·åã«ããåŸé å§çž®ãšããã€ãºèæ§ã®é«ãæŽæ°åã®èšŒæ) | |
| Chen, S. B., et al. (2023). Symbolic Discovery of Optimization Algorithms. arXiv. (LionïŒç¬Šå·å (Sign) ãš Weight Decay ã®åé¢ã«ããå¹ççãªæ¢çŽ¢ã®èšå·ççºèŠ) | |
| Zeyuan Allen-Zhu. (2017). NatashaïŒFaster Non-Convex Optimization Than SGD. arXiv. (髿¬¡æ å ±ãå©çšããéåžæé©åã®å éãšãå±æè§£ããã®è±åºçè«) |