Spaces:

marimo-team
/

marimo-learn

Runtime error

marimo-learn / queueing /02_queue_formation.py

Greg Wilson

feat: hiding a few cells

8eccedb 8 days ago

9.92 kB

	# /// script
	# requires-python = ">=3.13"
	# dependencies = [
	# "altair",
	# "asimpy",
	# "marimo",
	# "polars==1.24.0",
	# ]
	# ///

	import marimo

	__generated_with = "0.20.4"
	app = marimo.App(width="medium")


	@app.cell(hide_code=True)
	def _():
	import marimo as mo
	import random
	import statistics

	import altair as alt
	import polars as pl

	from asimpy import Environment, Process, Resource

	return Environment, Process, Resource, alt, mo, pl, random, statistics


	@app.cell(hide_code=True)
	def _(mo):
	mo.md(r"""
	# Queue Formation

	## Randomness Creates Waiting Even with Spare Capacity

	We now combine arrivals (Poisson at rate $\lambda$) with a server (exponential service at rate $\mu$) into a complete queue. The system is stable because $\rho = \lambda/\mu < 1$, which means that on average, the server handles more work than arrives.

	Our first question is, how long is the queue? The surprising answer is that even when the server has plenty of spare capacity, customers wait. The mean number of customers in the system (both waiting and being served) is:

	$$L = \frac{\rho}{1 - \rho}$$

	The table below gives some representative values:

	\| $\rho$ \| $L$ \|
	\|:---:\|:---:\|
	\| 0.1 \| 0.11 \|
	\| 0.5 \| 1.00 \|
	\| 0.8 \| 4.00 \|
	\| 0.9 \| 9.00 \|

	When $\rho = 0.5$, half the server's capacity is idle, but there is on average one customer in the system at any moment. That customer either had to wait for a previous customer, or is currently being served. The queue is never consistently empty, even at moderate load.

	The formula also explains why simple queues are so sensitive to utilization: $L$ blows up as $\rho \to 1$. One way to think about this is that the denominator $(1 - \rho)$ is the spare capacity. As spare capacity vanishes, queue length increases.
	""")
	return


	@app.cell
	def _(mo):
	mo.md(r"""
	## Why Queues Form at All

	With deterministic arrivals and service (every customer arrives exactly $1/\lambda$ apart and takes exactly $1/\mu$), a server with $\rho < 1$ would never form a queue: each customer would depart before the next arrived. Randomness changes this. Sometimes three customers arrive close together before the server finishes even one, so the server falls briefly behind. While it recovers, customers wait. These temporary pileups are unavoidable whenever inter-arrival or service times have any variance.

	The probability that exactly $n$ customers are in an M/M/1 system at steady state is:

	$$P(N = n) = (1 - \rho)\,\rho^n \qquad n = 0, 1, 2, \ldots$$

	This is a geometric distribution with success probability $1 - \rho$. The formula says the server is idle (i.e., n=0) with probability $1 - \rho$, which is consistent with the utilization result from the previous scenario. Each additional customer in the system is $\rho$ times less likely than the previous count.

	This formula $L = \rho/(1-\rho)$ is the foundation of the later M/M/1 nonlinearity scenario, which shows the practical consequences of the $(1-\rho)$ denominator. Every queue-length formula in queueing theory has a similar structure: a traffic factor $\rho$ divided by a spare-capacity factor $(1 - \rho)$, possibly multiplied by a variability correction.
	""")
	return


	@app.cell
	def _(mo):
	mo.md(r"""
	## Implementation

	A `Customer` process increments a shared `in_system` counter on arrival and decrements it on departure. A `Monitor` process samples `in_system[0]` every `SAMPLE_INTERVAL` time units. After the simulation, the mean of the samples estimates $L$. The theoretical value $\rho/(1-\rho)$ is computed and compared. By the law of large numbers, this converges to the true steady-state mean as the simulation time grows.

	The simulation sweeps $\rho$ from 0.1 to 0.9, confirming the formula at each load level.
	""")
	return


	@app.cell(hide_code=True)
	def _(mo):
	sim_time_slider = mo.ui.slider(
	start=0,
	stop=100_000,
	step=1_000,
	value=20_000,
	label="Simulation time",
	)

	service_rate_slider = mo.ui.slider(
	start=1.0,
	stop=5.0,
	step=0.01,
	value=2.0,
	label="Service rate",
	)

	sample_interval_slider = mo.ui.slider(
	start=1.0,
	stop=5.0,
	step=1.0,
	value=1.0,
	label="Sample interval",
	)

	seed_input = mo.ui.number(
	value=192,
	step=1,
	label="Random seed",
	)

	run_button = mo.ui.run_button(label="Run simulation")

	mo.vstack([
	sim_time_slider,
	service_rate_slider,
	sample_interval_slider,
	seed_input,
	run_button,
	])
	return (
	sample_interval_slider,
	seed_input,
	service_rate_slider,
	sim_time_slider,
	)


	@app.cell
	def _(
	sample_interval_slider,
	seed_input,
	service_rate_slider,
	sim_time_slider,
	):
	SIM_TIME = int(sim_time_slider.value)
	SERVICE_RATE = float(service_rate_slider.value)
	SAMPLE_INTERVAL = float(sample_interval_slider.value)
	SEED = int(seed_input.value)
	return SAMPLE_INTERVAL, SEED, SERVICE_RATE, SIM_TIME


	@app.cell
	def _(Process, SERVICE_RATE, random):
	class Customer(Process):
	def init(self, server, in_system):
	self.server = server
	self.in_system = in_system

	async def run(self):
	self.in_system[0] += 1
	async with self.server:
	await self.timeout(random.expovariate(SERVICE_RATE))
	self.in_system[0] -= 1

	return (Customer,)


	@app.cell
	def _(Customer, Process, random):
	class Arrivals(Process):
	def init(self, rate, server, in_system):
	self.rate = rate
	self.server = server
	self.in_system = in_system

	async def run(self):
	while True:
	await self.timeout(random.expovariate(self.rate))
	Customer(self._env, self.server, self.in_system)

	return (Arrivals,)


	@app.cell
	def _(Process, SAMPLE_INTERVAL):
	class Monitor(Process):
	"""Samples total customers in system at regular intervals."""

	def init(self, in_system, samples):
	self.in_system = in_system
	self.samples = samples

	async def run(self):
	while True:
	self.samples.append(self.in_system[0])
	await self.timeout(SAMPLE_INTERVAL)

	return (Monitor,)


	@app.cell
	def _(
	Arrivals,
	Environment,
	Monitor,
	Resource,
	SERVICE_RATE,
	SIM_TIME,
	statistics,
	):
	def simulate(rho):
	arrival_rate = rho * SERVICE_RATE
	env = Environment()
	server = Resource(env, capacity=1)
	in_system = [0]
	samples = []
	Arrivals(env, arrival_rate, server, in_system)
	Monitor(env, in_system, samples)
	env.run(until=SIM_TIME)
	sim_L = statistics.mean(samples)
	theory_L = rho / (1.0 - rho)
	return {
	"rho": rho,
	"sim_L": round(sim_L, 4),
	"theory_L": round(theory_L, 4),
	"error_pct": round(100.0 * (sim_L - theory_L) / theory_L, 2),
	}

	return (simulate,)


	@app.cell
	def _(SEED, pl, random, simulate):
	def sweep():
	rows = [simulate(rho) for rho in [0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9]]
	return pl.DataFrame(rows)

	random.seed(SEED)
	df = sweep()
	df
	return (df,)


	@app.cell
	def _(alt, df):
	df_plot = df.unpivot(
	on=["sim_L", "theory_L"],
	index="rho",
	variable_name="source",
	value_name="L",
	)
	chart = (
	alt.Chart(df_plot)
	.mark_line(point=True)
	.encode(
	x=alt.X("rho:Q", title="Utilization (ρ)"),
	y=alt.Y("L:Q", title="Mean customers in system (L)"),
	color=alt.Color("source:N", title="Source"),
	tooltip=["rho:Q", "source:N", "L:Q"],
	)
	.properties(title="Queue Formation: Simulated vs. Theoretical L = ρ/(1−ρ)")
	)
	chart
	return


	@app.cell
	def _(mo):
	mo.md(r"""
	## Understanding the Math

	### Why is the queue length geometric?

	The M/M/1 queue can be analyzed as a random walk on the non-negative integers. When the server is busy, the queue grows by 1 with each arrival (with rate $\lambda$) and shrinks by 1 with each service completion (with rate $\mu$). The ratio $\lambda/\mu = \rho$ is the probability that the queue grows rather than shrinks at the next event. Under steady state, the probability of being at level $n$ is proportional to $\rho^n$ — because reaching level $n$ requires $n$ consecutive "up" steps. Normalizing so the probabilities sum to 1 gives $(1-\rho)\rho^n$.

	### Deriving $L = \rho/(1-\rho)$ from the geometric distribution

	Given $P(N = n) = (1 - \rho)\rho^n$, the mean is:

	$$L = E[N] = \sum_{n=0}^{\infty} n \cdot (1-\rho)\rho^n = (1-\rho) \sum_{n=0}^{\infty} n\rho^n$$

	A result from basic calculus is that the geometric series $\sum_{n=0}^{\infty} \rho^n = 1/(1-\rho)$. Differentiating both sides with respect to $\rho$:

	$$\sum_{n=0}^{\infty} n\rho^{n-1} = \frac{1}{(1-\rho)^2}$$

	Multiply both sides by $\rho$:

	$$\sum_{n=0}^{\infty} n\rho^n = \frac{\rho}{(1-\rho)^2}$$

	Substituting back:

	$$L = (1-\rho) \cdot \frac{\rho}{(1-\rho)^2} = \frac{\rho}{1-\rho}$$

	### Checking the formula at the boundaries

	When $\rho \to 0$: there are almost no arrivals, so $L \to 0$, i.e., the server is nearly always idle.

	When $\rho \to 1$: the spare capacity is $(1-\rho) \to 0$, so $L \to \infty$, i.e., the queue grows without bound.

	Both limits match physical intuition. Note that the formula is exact (not an approximation) for an M/M/1 queues in steady state.
	""")
	return


	if __name__ == "__main__":
	app.run()