ÄÚÈÝ·¢²¼¸üÐÂʱ¼ä : 2026/3/12 7:16:31ÐÇÆÚÒ» ÏÂÃæÊÇÎÄÕµÄÈ«²¿ÄÚÈÝÇëÈÏÕæÔĶÁ¡£
¶¯Ì¬¹æ»® Ò»¡¤¶¯Ì¬¹æ»®·¨µÄ·¢Õ¹¼°ÆäÑо¿ÄÚÈÝ
¶¯Ì¬¹æ»®ÊÇÔ˳ïѧµÄÒ»¸ö·ÖÖ§£¬ÊÇÇó½â¾ö²ß¹ý³Ì×îÓÅ»¯µÄÊýѧ·½·¨¡£20ÊÀ¼Í50Äê´ú³õÃÀ¹úÊýѧ¼ÒR.E.BELLMANµÈÈËÔÚÑо¿¶à½×¶Î¾ö²ß¹ý³ÌµÄÓÅ»¯ÎÊÌâʱ£¬Ìá³öÁËÖøÃûµÄ×îÓÅ»¯ÔÀí£¬°Ñ¶à½×¶ÎÎÊÌâת»¯ÎªÒ»ÏµÁеĵ¥½×¶ÎÎÊÌ⣬Öð¸öÇó½â ´´Á¢Á˽â¾öÕâÀà¹ý³ÌÓÅ»¯ÎÊÌâµÄз½·¨¡ª¡ª¶¯Ì¬¹æ»®¡£1957Äê³ö°æµÄËûµÄÃûÖø¡¶Dynamic Proggramming¡·£¬ÕâÊǸÃÁìÓòµÄµÚÒ»±¾Öø×÷¡£
¶¯Ì¬¹æ»®ÎÊÊÀÒÔÀ´£¬ÔÚ¾¼Ã¹ÜÀí¡¤Éú²úµ÷¶È¡¤¹¤³Ì¼¼ÊõºÍ×îÓÅ¿ØÖƵȷ½ÃæµÃµ½Á˹㷺µÄÓ¦Óá£ÀýÈç×î¶Ì·Ïß¡¤¿â´æ¹ÜÀí¡¤×ÊÔ´·ÖÅ䡤É豸¸üС¤×éºÏ¡¤ÅÅÐò¡¤×°ÔصÈÎÊÌ⣬²ÉÓö¯Ì¬¹æ»®·¨Çó½â±ÈÓÃÆäËû·½·¨¸üΪ¼ò±ã¡£ ¶þ¡¤¶¯Ì¬¹æ»®·¨»ù±¾¸ÅÄî
Ò»¸ö¶à½×¶Î¾ö²ß¹ý³Ì×îÓÅ»¯ÎÊÌâµÄ¶¯Ì¬¹æ»®Ä£ÐÍͨ³£°üÀ¨ÒÔϼ¸¸öÒªËØ£º 1£® ½×¶Î
½×¶Î£¨stage£©ÊǶÔÕû¸ö¹ý³ÌµÄ×ÔÈ»»®·Ö¡£Í¨³£¸ù¾Ýʱ¼ä˳Ðò»òÊǿռäÌØÕ÷À´»®·Ö½×¶Î£¬¶ÔÓÚÓëʱ¼ä£¬¿Õ¼äÎ޹صġ°¾²Ì¬¡±ÓÅ»¯ÎÊÌ⣬¿ÉÒÔ¸ù¾ÝÆä×ÔÈ»ÌØÕ÷£¬ÈËΪµÄ¸³Ó衰ʱ¶Î¡±¸ÅÄ½«¾²Ì¬ÎÊÌ⶯̬»¯£¬ÒԱ㰴½×¶ÎµÄ˳Ðò½âÓÅ»¯ÎÊÌâ¡£½×¶Î±äÁ¿Ò»°ãÓÃk=1.2¡.n.±íʾ¡£
1. ״̬
״̬(state)ÊÇÎÒÃÇËùÑо¿µÄÎÊÌ⣨Ҳ½Ðϵͳ£©ÔÚ¹ý¸ö½×¶ÎµÄ³õʼ״̬»ò¿Í¹ÛÌõ¼þ¡£ËüÓ¦ÄÜÃèÊö¹ý³ÌµÄÌØÕ÷²¢ÇÒ¾ßÓÐÎÞºóЧÐÔ£¬¼´µ±Ä³½×¶ÎµÄ״̬¸ø¶¨Ê±£¬Õâ¸ö½×¶ÎÒÔºóµÄ¹ý³ÌµÄÑݱäÓë¸Ã½×¶ÎÒÔǰ¸÷½×¶ÎµÄ״̬Î޹ء£Í¨³£»¹ÒªÇó״̬ÊÇ¿ÉÒÔÖ±½Ó»òÕßÊǼä½Ó¿ÉÒÔ¹Û²âµÄ¡£ÃèÊö״̬µÄ±äÁ¿³ÆÎª×´Ì¬±äÁ¿£¨State Virable£©ÓÃs ±íʾ£¬×´Ì¬±äÁ¿µÄȡֵ¼¯ºÏ³ÆÎª×´Ì¬¼¯ºÏ£¬ÓÃS±íʾ¡£±äÁ¿ÔÊÐíȡֵµÄ·¶Î§³ÆÎªÔÊÐí״̬¼¯ºÏ(set of admissble states).ÓÃx(k)±íʾµÚk½×¶ÎµÄ״̬±äÁ¿£¬Ëü¿ÉÒÔÊÇÒ»¸öÊý»òÕßÊÇÒ»¸öÏòÁ¿¡£ÓÃX(k)±íʾµÚk½×¶ÎµÄÔÊÐí״̬¼¯ºÏ¡£
n ¸ö½×¶ÎµÄ¾ö²ß¹ý³ÌÓÐn+1¸ö״̬±äÁ¿£¬x(n+1)ÊÇx(n)µÄÑݱäµÄ½á¹û¡£
¸ù¾ÝÑݱä¹ý³ÌµÄ¾ßÌåÇé¿ö£¬×´Ì¬±äÁ¿¿ÉÒÔÊÇÀëÉ¢µÄ»òÊÇÁ¬ÐøµÄ¡£ÎªÁ˼ÆËã·½±ãÓÐʱ½«Á¬Ðø±äÁ¿ÀëÉ¢»¯£¬ÎªÁË·ÖÎöµÄ·½±ãÓÐʱÓÖ½«ÀëÉ¢µÄ±äÁ¿ÊÓΪÁ¬ÐøµÄ¡£ 2£® ¾ö²ß
µ±Ò»¸ö½×¶ÎµÄ״̬ȷ¶¨ºó£¬¿ÉÒÔ×ö³ö¸÷ÖÖÑ¡Ôñ´Ó¶øÑݱ䵽ÏÂÒ»½×¶ÎµÄij¸ö״̬£¬ÕâÖÖÑ¡ÔñÊֶγÆÎª¾ö²ß£¨decision£©£¬ÔÚ×îÓÅ¿ØÖÆÎÊÌâÖÐÒ²³ÆÎª¿ØÖÆ£¨control£©ÃèÊö¾ö²ßµÄ±äÁ¿³ÆÎª¾ö²ß±äÁ¿£¨decision virable£©¡£±äÁ¿ÔÊÐíȡֵµÄ·¶Î§³ÆÎªÔÊÐí¾ö²ß¼¯ºÏ£¨set of admissble
decisions£©¡£ÓñíʾµÚk½×¶Î´¦ÓÚ½×¶Îx(k)µÄ¾ö
±íʾx(k)µÄÔÊÐí
¡£
²ß±äÁ¿£¬ËüÊÇx(k)µÄº¯Êý£¬Óþö²ß¼¯ºÏ¾ö²ß±äÁ¿¼ò³Æ¾ö²ß¡£4.²ßÂÔ
¾ö²ß×é³ÉµÄϵÁгÆÎª²ßÂÔ£¨policy£©¡£Óɳõʼ״̬x1¿ªÊ¼µÄÈ«¹ý³ÌµÄ²ßÂÔ¼Ç×÷
.
.
ÓɵÚk½×¶ÎµÄ״̬x(k)¿ªÊ¼µ½ÖÕֹ״̬µÄºó²¿×Ó¹ý³ÌµÄ²ßÂÔ
,
;k=2,¡,n-1.
¿É¹©Ñ¡ÔñµÄ²ßÂÔÓÐÒ»¶¨µÄ·¶Î§£¬³ÆÎªÔÊÐí²ßÂÔ¼¯ºÏ£¨set of admissble polices£©,ÓÃʾ¡£
5.×´Ì¬×ªÒÆ·½³Ì
ÔÚÈ·¶¨ÐÔ¹ý³ÌÖУ¬Ò»µ©Ä³½×¶ÎµÄ״̬ºÍ¾ö²ßΪÒÑÖª£¬Ï½׶εÄ״̬ƫÍêÈ«¿ÉÒÔÈ·¶¨¡£ÓÃ×´Ì¬×ªÒÆ·½³Ì£¨state transfer equations£©±íʾÕâÖÖÑÝ±ä¹æÂÉ£¬Ð´×÷£º
6.½×¶ÎÖ¸±êº¯Êý
¶ÔÓÚk½×¶ÎµÄ״̬x(k)£¬µ±Ö´ÐÐÁ˾ö²ß
ʱ£¬
,
µÈ±í
³ý´øÀ´ÏµÍ³×´Ì¬µÄ×ªÒÆÖ®Í⣬»¹²úÉúµÚk½×¶ÎµÄ¾Ö²¿Àû
Òæ£¬ËüÊÇ×ÜÐ§ÒæµÄÒ»²¿·Ö£¬³ÆÎª½×¶ÎÖ¸±êº¯Êý£¨stage effective fuction£©£¬¼Ç×÷7.¹ý³ÌÖ¸±êº¯Êý
ÓÃÀ´ºâÁ¿²ßÂÔ»òÕßÊÇ×Ó²ßÂÔÖ´ÐÐЧ¹ûµÄÊýÁ¿Ö¸±ê³ÆÎª¹ý³ÌÖ¸±êº¯Êý£¨process effective fuction£©£¬Ëü¶¨ÒåÔÚËùÓÐkºó²¿×Ó¹ý³ÌÉÏ£¬³£ÓÃÓÃ
±íʾ£¬¼´
.
k=1,2,¡,n.
µ±k=1ʱ£¬¾ÍÊÇÈ«¹ý³ÌÖ¸±êº¯Êý¡£ Èç¹û״̬x(k)ºÍ×Ó²ßÂÔÈ·¶¨ÁË£¬ËùÒÔ
ÊÇx(k)ºÍ
¸ø¶¨£¬ÄÇôµÄº¯Êý£¬¼ÇΪ£º ³£¼ûµÄ¹ý³ÌÖ¸±êº¯ÊýÊÇÁ¬ºÍ
ÐÎʽ»òÁ¬»ýÐÎʽ£º
Ò²¾Í±»
8.×îÓÅÖ¸±êº¯Êý ¹ý³ÌÖ¸±êº¯Êý
µÄ×îÓÅÖµ³ÆÎª
×îÓÅÖ¸±êº¯Êý(optimum effective fuction£©£¬¼ÇΪf(x(k).Ëü±íʾ£¬²ÉÈ¡ÁË×îÓÅ×Ó²ßÂÔ
Ö®ºó£¬ºó²¿×Ó¹ý³ÌËù»ñµÃµÄ×ÜÐ§Òæ£¬±íʾΪ£º
ʽÖÐ
optÊÇoptimizationµÄËõд£¬ÒâΪ×îÓÅ»¯£¬¿ÉÒÔ¸ù¾Ý¾ßÌåÎÊÌâÈ¥max»òmin
Èý¡¤¶¯Ì¬¹æ»®·¨µÄ×îÓÅÐÔÔÀíºÍ»ù±¾º¯Êý·½³Ì ÔÚ¶¯Ì¬¹æ»®ÖÐÆðºËÐÄ×÷ÓõÄÊÇ×îÓÅÐÔÔÀí£º¡°×÷ΪÕû¸ö¹ý³ÌµÄ×îÓŲßÂÔ¾ßÓÐÕâÑùµÄÐÔÖÊ£¬ÎÞÂÛ¹ýÈ¥µÄ״̬ºÍ¾ö²ßÈçºÎ£¬Ïà¶ÔÓÚÇ°Ãæ¾ö²ßËùÐγɵÄ״̬¶øÑÔ£¬Óàϵľö²ßϵÁбØÐë¹¹³É×îÓÅ×Ó²ßÂÔ¡£¡±
¶¯Ì¬¹æ»®½â·¨µÄ¹Ø¼üÔÚÓÚ¸ø³öÒ»ÖÖµÝÍÆ¹ØÏµ£¬Ò»°ã°ÑÕâÖÖ¹ØÏµ³ÆÎª»ù±¾º¯Êý·½³Ì£¬
×¢Òâµ½ÎÞºóЧÐÔ£¬×îÓÅÖ¸±êº¯ÊýΪ
µ±k=nʱ£¬ÓÉÓÚx(n+1)ÊÇÕû¸ö¾ö²ß¹ý³ÌµÄÖÕֹ״̬£¬ÒÔºó²»ÔÙ×ö³ö¾ö²ß£¬Òò´Ë£¬