¶¯Ì¬¹æ»®·¨Çó½âÉú²úÓë´æ´¢ÎÊÌâ ÏÂÔØ±¾ÎÄ

ÄÚÈÝ·¢²¼¸üÐÂʱ¼ä : 2026/3/12 7:16:31ÐÇÆÚÒ» ÏÂÃæÊÇÎÄÕµÄÈ«²¿ÄÚÈÝÇëÈÏÕæÔĶÁ¡£

¶¯Ì¬¹æ»® Ò»¡¤¶¯Ì¬¹æ»®·¨µÄ·¢Õ¹¼°ÆäÑо¿ÄÚÈÝ

¶¯Ì¬¹æ»®ÊÇÔ˳ïѧµÄÒ»¸ö·ÖÖ§£¬ÊÇÇó½â¾ö²ß¹ý³Ì×îÓÅ»¯µÄÊýѧ·½·¨¡£20ÊÀ¼Í50Äê´ú³õÃÀ¹úÊýѧ¼ÒR.E.BELLMANµÈÈËÔÚÑо¿¶à½×¶Î¾ö²ß¹ý³ÌµÄÓÅ»¯ÎÊÌâʱ£¬Ìá³öÁËÖøÃûµÄ×îÓÅ»¯Ô­Àí£¬°Ñ¶à½×¶ÎÎÊÌâת»¯ÎªÒ»ÏµÁеĵ¥½×¶ÎÎÊÌ⣬Öð¸öÇó½â ´´Á¢Á˽â¾öÕâÀà¹ý³ÌÓÅ»¯ÎÊÌâµÄз½·¨¡ª¡ª¶¯Ì¬¹æ»®¡£1957Äê³ö°æµÄËûµÄÃûÖø¡¶Dynamic Proggramming¡·£¬ÕâÊǸÃÁìÓòµÄµÚÒ»±¾Öø×÷¡£

¶¯Ì¬¹æ»®ÎÊÊÀÒÔÀ´£¬ÔÚ¾­¼Ã¹ÜÀí¡¤Éú²úµ÷¶È¡¤¹¤³Ì¼¼ÊõºÍ×îÓÅ¿ØÖƵȷ½ÃæµÃµ½Á˹㷺µÄÓ¦Óá£ÀýÈç×î¶Ì·Ïß¡¤¿â´æ¹ÜÀí¡¤×ÊÔ´·ÖÅ䡤É豸¸üС¤×éºÏ¡¤ÅÅÐò¡¤×°ÔصÈÎÊÌ⣬²ÉÓö¯Ì¬¹æ»®·¨Çó½â±ÈÓÃÆäËû·½·¨¸üΪ¼ò±ã¡£ ¶þ¡¤¶¯Ì¬¹æ»®·¨»ù±¾¸ÅÄî

Ò»¸ö¶à½×¶Î¾ö²ß¹ý³Ì×îÓÅ»¯ÎÊÌâµÄ¶¯Ì¬¹æ»®Ä£ÐÍͨ³£°üÀ¨ÒÔϼ¸¸öÒªËØ£º 1£® ½×¶Î

½×¶Î£¨stage£©ÊǶÔÕû¸ö¹ý³ÌµÄ×ÔÈ»»®·Ö¡£Í¨³£¸ù¾Ýʱ¼ä˳Ðò»òÊǿռäÌØÕ÷À´»®·Ö½×¶Î£¬¶ÔÓÚÓëʱ¼ä£¬¿Õ¼äÎ޹صġ°¾²Ì¬¡±ÓÅ»¯ÎÊÌ⣬¿ÉÒÔ¸ù¾ÝÆä×ÔÈ»ÌØÕ÷£¬ÈËΪµÄ¸³Ó衰ʱ¶Î¡±¸ÅÄ½«¾²Ì¬ÎÊÌ⶯̬»¯£¬ÒԱ㰴½×¶ÎµÄ˳Ðò½âÓÅ»¯ÎÊÌâ¡£½×¶Î±äÁ¿Ò»°ãÓÃk=1.2¡­.n.±íʾ¡£

1. ״̬

״̬(state)ÊÇÎÒÃÇËùÑо¿µÄÎÊÌ⣨Ҳ½Ðϵͳ£©ÔÚ¹ý¸ö½×¶ÎµÄ³õʼ״̬»ò¿Í¹ÛÌõ¼þ¡£ËüÓ¦ÄÜÃèÊö¹ý³ÌµÄÌØÕ÷²¢ÇÒ¾ßÓÐÎÞºóЧÐÔ£¬¼´µ±Ä³½×¶ÎµÄ״̬¸ø¶¨Ê±£¬Õâ¸ö½×¶ÎÒÔºóµÄ¹ý³ÌµÄÑݱäÓë¸Ã½×¶ÎÒÔǰ¸÷½×¶ÎµÄ״̬Î޹ء£Í¨³£»¹ÒªÇó״̬ÊÇ¿ÉÒÔÖ±½Ó»òÕßÊǼä½Ó¿ÉÒÔ¹Û²âµÄ¡£ÃèÊö״̬µÄ±äÁ¿³ÆÎª×´Ì¬±äÁ¿£¨State Virable£©ÓÃs ±íʾ£¬×´Ì¬±äÁ¿µÄȡֵ¼¯ºÏ³ÆÎª×´Ì¬¼¯ºÏ£¬ÓÃS±íʾ¡£±äÁ¿ÔÊÐíȡֵµÄ·¶Î§³ÆÎªÔÊÐí״̬¼¯ºÏ(set of admissble states).ÓÃx(k)±íʾµÚk½×¶ÎµÄ״̬±äÁ¿£¬Ëü¿ÉÒÔÊÇÒ»¸öÊý»òÕßÊÇÒ»¸öÏòÁ¿¡£ÓÃX(k)±íʾµÚk½×¶ÎµÄÔÊÐí״̬¼¯ºÏ¡£

n ¸ö½×¶ÎµÄ¾ö²ß¹ý³ÌÓÐn+1¸ö״̬±äÁ¿£¬x(n+1)ÊÇx(n)µÄÑݱäµÄ½á¹û¡£

¸ù¾ÝÑݱä¹ý³ÌµÄ¾ßÌåÇé¿ö£¬×´Ì¬±äÁ¿¿ÉÒÔÊÇÀëÉ¢µÄ»òÊÇÁ¬ÐøµÄ¡£ÎªÁ˼ÆËã·½±ãÓÐʱ½«Á¬Ðø±äÁ¿ÀëÉ¢»¯£¬ÎªÁË·ÖÎöµÄ·½±ãÓÐʱÓÖ½«ÀëÉ¢µÄ±äÁ¿ÊÓΪÁ¬ÐøµÄ¡£ 2£® ¾ö²ß

µ±Ò»¸ö½×¶ÎµÄ״̬ȷ¶¨ºó£¬¿ÉÒÔ×ö³ö¸÷ÖÖÑ¡Ôñ´Ó¶øÑݱ䵽ÏÂÒ»½×¶ÎµÄij¸ö״̬£¬ÕâÖÖÑ¡ÔñÊֶγÆÎª¾ö²ß£¨decision£©£¬ÔÚ×îÓÅ¿ØÖÆÎÊÌâÖÐÒ²³ÆÎª¿ØÖÆ£¨control£©ÃèÊö¾ö²ßµÄ±äÁ¿³ÆÎª¾ö²ß±äÁ¿£¨decision virable£©¡£±äÁ¿ÔÊÐíȡֵµÄ·¶Î§³ÆÎªÔÊÐí¾ö²ß¼¯ºÏ£¨set of admissble

decisions£©¡£ÓñíʾµÚk½×¶Î´¦ÓÚ½×¶Îx(k)µÄ¾ö

±íʾx(k)µÄÔÊÐí

¡£

²ß±äÁ¿£¬ËüÊÇx(k)µÄº¯Êý£¬Óþö²ß¼¯ºÏ¾ö²ß±äÁ¿¼ò³Æ¾ö²ß¡£4.²ßÂÔ

¾ö²ß×é³ÉµÄϵÁгÆÎª²ßÂÔ£¨policy£©¡£Óɳõʼ״̬x1¿ªÊ¼µÄÈ«¹ý³ÌµÄ²ßÂÔ¼Ç×÷

.

.

ÓɵÚk½×¶ÎµÄ״̬x(k)¿ªÊ¼µ½ÖÕֹ״̬µÄºó²¿×Ó¹ý³ÌµÄ²ßÂÔ

,

;k=2,¡­,n-1.

¿É¹©Ñ¡ÔñµÄ²ßÂÔÓÐÒ»¶¨µÄ·¶Î§£¬³ÆÎªÔÊÐí²ßÂÔ¼¯ºÏ£¨set of admissble polices£©,ÓÃʾ¡£

5.×´Ì¬×ªÒÆ·½³Ì

ÔÚÈ·¶¨ÐÔ¹ý³ÌÖУ¬Ò»µ©Ä³½×¶ÎµÄ״̬ºÍ¾ö²ßΪÒÑÖª£¬Ï½׶εÄ״̬ƫÍêÈ«¿ÉÒÔÈ·¶¨¡£ÓÃ×´Ì¬×ªÒÆ·½³Ì£¨state transfer equations£©±íʾÕâÖÖÑÝ±ä¹æÂÉ£¬Ð´×÷£º

6.½×¶ÎÖ¸±êº¯Êý

¶ÔÓÚk½×¶ÎµÄ״̬x(k)£¬µ±Ö´ÐÐÁ˾ö²ß

ʱ£¬

,

µÈ±í

³ý´øÀ´ÏµÍ³×´Ì¬µÄ×ªÒÆÖ®Í⣬»¹²úÉúµÚk½×¶ÎµÄ¾Ö²¿Àû

Òæ£¬ËüÊÇ×ÜÐ§ÒæµÄÒ»²¿·Ö£¬³ÆÎª½×¶ÎÖ¸±êº¯Êý£¨stage effective fuction£©£¬¼Ç×÷7.¹ý³ÌÖ¸±êº¯Êý

ÓÃÀ´ºâÁ¿²ßÂÔ»òÕßÊÇ×Ó²ßÂÔÖ´ÐÐЧ¹ûµÄÊýÁ¿Ö¸±ê³ÆÎª¹ý³ÌÖ¸±êº¯Êý£¨process effective fuction£©£¬Ëü¶¨ÒåÔÚËùÓÐkºó²¿×Ó¹ý³ÌÉÏ£¬³£ÓÃÓÃ

±íʾ£¬¼´

.

k=1,2,¡­,n.

µ±k=1ʱ£¬¾ÍÊÇÈ«¹ý³ÌÖ¸±êº¯Êý¡£ Èç¹û״̬x(k)ºÍ×Ó²ßÂÔÈ·¶¨ÁË£¬ËùÒÔ

ÊÇx(k)ºÍ

¸ø¶¨£¬ÄÇôµÄº¯Êý£¬¼ÇΪ£º ³£¼ûµÄ¹ý³ÌÖ¸±êº¯ÊýÊÇÁ¬ºÍ

ÐÎʽ»òÁ¬»ýÐÎʽ£º

Ò²¾Í±»

8.×îÓÅÖ¸±êº¯Êý ¹ý³ÌÖ¸±êº¯Êý

µÄ×îÓÅÖµ³ÆÎª

×îÓÅÖ¸±êº¯Êý(optimum effective fuction£©£¬¼ÇΪf(x(k).Ëü±íʾ£¬²ÉÈ¡ÁË×îÓÅ×Ó²ßÂÔ

Ö®ºó£¬ºó²¿×Ó¹ý³ÌËù»ñµÃµÄ×ÜÐ§Òæ£¬±íʾΪ£º

ʽÖÐ

optÊÇoptimizationµÄËõд£¬ÒâΪ×îÓÅ»¯£¬¿ÉÒÔ¸ù¾Ý¾ßÌåÎÊÌâÈ¥max»òmin

Èý¡¤¶¯Ì¬¹æ»®·¨µÄ×îÓÅÐÔÔ­ÀíºÍ»ù±¾º¯Êý·½³Ì ÔÚ¶¯Ì¬¹æ»®ÖÐÆðºËÐÄ×÷ÓõÄÊÇ×îÓÅÐÔÔ­Àí£º¡°×÷ΪÕû¸ö¹ý³ÌµÄ×îÓŲßÂÔ¾ßÓÐÕâÑùµÄÐÔÖÊ£¬ÎÞÂÛ¹ýÈ¥µÄ״̬ºÍ¾ö²ßÈçºÎ£¬Ïà¶ÔÓÚÇ°Ãæ¾ö²ßËùÐγɵÄ״̬¶øÑÔ£¬Óàϵľö²ßϵÁбØÐë¹¹³É×îÓÅ×Ó²ßÂÔ¡£¡±

¶¯Ì¬¹æ»®½â·¨µÄ¹Ø¼üÔÚÓÚ¸ø³öÒ»ÖÖµÝÍÆ¹ØÏµ£¬Ò»°ã°ÑÕâÖÖ¹ØÏµ³ÆÎª»ù±¾º¯Êý·½³Ì£¬

×¢Òâµ½ÎÞºóЧÐÔ£¬×îÓÅÖ¸±êº¯ÊýΪ

µ±k=nʱ£¬ÓÉÓÚx(n+1)ÊÇÕû¸ö¾ö²ß¹ý³ÌµÄÖÕֹ״̬£¬ÒÔºó²»ÔÙ×ö³ö¾ö²ß£¬Òò´Ë£¬