Å·²©abg

ѧÔÚÅ·²©abg

ÄúµÄλÖ㺠Ê×Ò³ ¡¤ ѧÔÚÅ·²©abg ¡¤ ѧÊõÔ˶¯ ¡¤ ÕýÎÄ

¡¾ÊýÀíÎÄ»¯½Ú¡¤Ñ§Êõ½²×ùÔ¤¸æ¡¿ÀíѧԺÊ×½ìÊýÀíÎÄ»¯½ÚѧÊõ±¨¸æµÚ3ÆÚ¡ª¡ªApplying Machine Learning to Queueing Systems: Online Learning, Offline Learning, and Deep Learning

Ö÷½²ÈË :Dr.Yunan Liu ËùÔÚ :Å·²©abgÎ÷ÍÁ³ÇУÇø½ÌËÄ-441 ×îÏÈʱ¼ä : 2024-04-15 11:00:00

±¨¸æÈË£º±±¿¨ÂÞÀ´ÄÉÖÝÁ¢´óѧDr . Yunan Liu

ʱ¼ä£º2024Äê4ÔÂ15ÈÕ11£º00-12£º00

ËùÔÚ£ºÅ·²©abgÎ÷ÍÁ³ÇУÇø½ÌËÄ-441

±¨¸æÕªÒª£º

In this talk, we investigate new ways to apply machine learning methodologies to queueing models with applications to service systems (e.g., call centers and healthcare). Our work will cover three different machine learning paradigms: (i) online learning, (ii) offline learning, and (iii) deep learning. (i) We propose a new online reinforcement learning technique to solve a multi-period pricing and staffing problem in a service queueing system with an unknown demand curve. We develop an algorithm calledgradient-based online-learning in queues(GOLiQ) to dynamically adjust the service price p (and service rate ?) so as to maximize cumulative expected revenues (the sales revenue minus the delay penalty) over a given finite time horizon. (ii) We develop a new simulation-based offline learning algorithm that can be used to determine the required staffing function that achieves time-stable performance for a time-varying queue within a finite time. Our new algorithm, calledsimulation-based offline learning staffing algorithm(SOLSA), organizes the overall learning process into successive cycles each of which consists of two phases: (1) (Exploitation) The decision maker generates relevant queueing data via a decision-aware simulator under a candidate solution, (2) (Exploration) Using the newly collected data, improved staffing plans are prescribed and to be used to configure the simulator in the next cycle.  (iii) We develop a new deep learning method, dubbeddeep learning in non-Markovian queues(DeepLiNQ), which is an offline supervised learning method that learns the system¡¯s intrinsic characteristics using synthetic training data. In real-time applications, DeepLiNQ is built by a set of neuro networks and can be used to recursively provide estimates for the transient system waiting time performance.

ר¼Ò¼ò½é£º

ÁõÓêéª £¬ÃÀ¹ú±±¿¨ÂÞÀ´ÄÉÖÝÁ¢´óѧ¹¤ÒµÓëϵͳ¹¤³Ìϵ¸±½ÌÊÚ¡£ÓÚÇ廪´óѧµçÆø¹¤³Ìϵ»ñµÃѧʿѧλ £¬ÓÚ¸çÂ×±ÈÑÇ´óѧ¹¤Òµ¹¤³ÌÓëÔ˳ﲿ»ñ˶ʿºÍ²©Ê¿Ñ§Î»¡£Ñо¿ÐËȤ°üÀ¨Ëæ»úÄ£×Ó¡¢Ó¦ÓøÅÂÊ¡¢·ÂÕæ¡¢ÅŶÓÂÛ¡¢×îÓÅ¿ØÖƺÍÇ¿»¯Ñ§Ï°µÈ £¬²¢½«ÆäÓ¦ÓÃÓÚ¿Í»§ÁªÂçÖÐÐÄ£¨customer contact centers£©¡¢Ò½ÁƱ£½¡£¨healthcare£©¡¢Éú²úºÍÔËÊäϵͳÖС£ÎÄÕ½ÒÏþ±¾ÊÂÓòÆì½¢ÆÚ¿¯ÉÏ £¬ÈçOperations Research, Production and Operations Management, INFORMS Journal on Computing, IISE Transactions, Naval Research Logistics, Stochastic Systems, European Journal of Operational Research, Queueing Systems¡£ÈÙ»ñÑÇÂíѷѧÕߣ¨Amazon Scholar£© £¬ÓëÑÇÂíÑ·¿Í»§Ð§ÀͲ¿µÄÈ«ÇòÈÝÁ¿ÍýÏëÍŶÓÇ×½üÏàÖú¡£Ð¡ÎÒ˽¼ÒÍøÒ³£ºhttp://yunanliu.wordpress.ncsu.edu

µçÄÔ°æ

¡¾ÍøÕ¾µØͼ¡¿¡¾sitemap¡¿