import numpy as np
import matplotlib.pyplot as plt
from scipy import stats
# For reproducibility
37)
np.random.seed(
# Function to perform a single random walk
def random_walk_1D(steps: int, p: float):
"""Simulate a 1D biased random walk."""
= np.random.choice([-1, 1], size=steps, p=[1-p, p])
moves = np.cumsum(moves) # Cumulative sum to get position over time
position return position
A random walk is a mathematical model that describes a path consisting of a series of random steps.
If \(X_n\) represents the position at step \(n\), then:
\(X_n = X_{n−1} + S_n\)
where \(S_n\) is a random step, typically drawn from a probability distribution (e.g., uniform, normal, or Bernoulli).
1D Random Walk (One-Dimensional)
A 1D random walk occurs when movement is restricted to a single axis (e.g., left or right along a line). At each step, the walker moves left (-1) or right (+1) with equal probability (if unbiased).
Example: Stock prices
2D Random Walk (Two-Dimensional)
A 2D random walk allows movement in two perpendicular directions (x and y). At each step, the walker can move up, down, left, or right.
Example: Brownian motion
Applications
1. Simulating a 1D Random Walk with Step Bias
A particle moves along a one-dimensional line (1D). At each time step, it can either move:
- +1 step to the right with probability
p
, or - -1 step to the left with probability
1−p
Suppose p=0.85
(i.e., a bias to the right).
Instructions:
- Write a Python program to simulate a 1D random walk with 1000 steps where:
- The starting position is 0.
- Each step has a probability of p=0.85 to move right and 1−p=0.15 to move left.
- Plot the position vs time graph (time on x-axis, position on y-axis).
- Run the simulation five times and overlay all five random walks in the same graph.
- Calculate and interpret:
- The final position of the particle after 1000 steps.
- The mean and standard deviation of the final position across five simulations.
Answer
def create_random_walk_1():
= 5
num_simulations = 1000
steps # Probability of moving right
= 0.85
prob_right
= range(steps)
times = []
final_positions
= plt.subplots(figsize=(10, 6))
fig, ax
# Run simulations and plot
for i in range(num_simulations):
= random_walk_1D(steps, prob_right)
positions -1]) # Store final position
final_positions.append(positions[=f"Simulation {i+1}")
plt.plot(times, positions, label
# Plot settings
=0, color='red', linestyle='--', alpha=0.3)
ax.axhline(y"Time Step")
ax.set_xlabel("Position")
ax.set_ylabel(f"1D Biased Random Walk (p={prob_right})")
ax.set_title(
ax.legend()True, alpha=0.3)
ax.grid(
# Compute statistics
= np.array(final_positions)
final_positions = np.mean(final_positions)
mean_final_position = np.std(final_positions)
std_final_position = steps * (2 * prob_right - 1)
expected_final_position = stats.t.interval(
confidence_interval_95 0.95, # Confidence level - 95%
# Degrees of freedom
num_simulations, = mean_final_position, # Sample mean
loc # Standard error
= std_final_position / np.sqrt(len(final_positions))
scale
)= (
stats_text f"Number of steps: {steps}\n"
f"Right step probability: {prob_right}\n"
f"Left step probability: {(1-prob_right):.2f}\n"
f"Final positions: {[int(pos) for pos in final_positions]}\n"
f"Mean final position: {mean_final_position:.2f}\n"
f"Standard deviation: {std_final_position:.2f}\n"
f"Expected final position: {expected_final_position:.2f}\n"
"95% CI for mean final position: "
f"[{confidence_interval_95[0]:.2f}, {confidence_interval_95[1]:.2f}]"
)
fig.text(0.5, -0.1, stats_text, ha="center", fontsize=10,
={"facecolor":"lightgray", "alpha":0.5})
bbox=[0, 0.1, 1, 0.95])
fig.tight_layout(rect
plt.show()print(stats_text)
create_random_walk_1()
Number of steps: 1000
Right step probability: 0.85
Left step probability: 0.15
Final positions: [716, 690, 678, 692, 710]
Mean final position: 697.20
Standard deviation: 13.89
Expected final position: 700.00
95% CI for mean final position: [681.23, 713.17]
Interpretation of the Results
- Bias Towards the Right:
- The particle has a high probability (
p = 0.85
) of moving right, leading to a strong rightward drift in position.
- If the movement were unbiased (
p = 0.5
), we would expect the particle to stay near 0 on average.
- The particle has a high probability (
- Final Positions:
- The five simulations resulted in final positions: [716, 690, 678, 692, 710].
- These values are all positive and close to each other, confirming the expected bias.
- The five simulations resulted in final positions: [716, 690, 678, 692, 710].
- Mean vs Expected Position:
- The mean final position from simulations is 697.20, which is close to the expected value of 700.
- The small difference arises due to random fluctuations (noise) in each walk.
- The mean final position from simulations is 697.20, which is close to the expected value of 700.
- Spread of Final Positions:
- The standard deviation is 13.89, indicating that the final position varies by about
±13.89
steps from the mean across different runs.
- Since the standard deviation is relatively small compared to 1000 steps, the bias dominates, keeping walks close to the expected mean.
- The standard deviation is 13.89, indicating that the final position varies by about
- Confidence Interval (95% CI):
- The confidence interval [681.23, 713.17] suggests that, if we were to run many more simulations, the true mean final position would lie within this range 95% of the time.
- Since the expected position 700 is well within this range, our simulations align well with theoretical predictions.
- The confidence interval [681.23, 713.17] suggests that, if we were to run many more simulations, the true mean final position would lie within this range 95% of the time.
2. Comparing 1D Random Walks with and without Drift
Suppose two particles perform 1D random walks starting from position 0: - Particle A: Moves with a drift, i.e., p=0.7 (70% chance to move right). - Particle B: Moves without drift, i.e., p=0.5(equal probability both sides).
Instructions:
- Write a Python program to simulate 1000 steps for each particle.
- Plot both random walks on the same graph with:
- Time on the x-axis.
- Position on the y-axis.
- Different colors for each particle.
- Calculate and display:
- The mean and standard deviation of the final position after 1000 steps.
- Interpret your answer
Answer
def create_random_walk_2():
= 1000
steps = range(steps)
times = []
final_positions
=(10, 6))
plt.figure(figsize
# Particle A: Moves with a drift,
# i.e., p=0.7 (70% chance to move right).
= random_walk_1D(steps, 0.7)
positionsA -1]) # Store final position
final_positions.append(positionsA[=f"Particle A (p=0.7)")
plt.plot(times, positionsA, label
# Particle B: Moves without drift,
# i.e., p=0.5(equal probability both sides).
= random_walk_1D(steps, 0.5)
positionsB -1]) # Store final position
final_positions.append(positionsB[=f"Particle B (p=0.5)")
plt.plot(times, positionsB, label
# Plot settings
=0, color='gray', linestyle='--', alpha=0.3)
plt.axhline(y"Time Step")
plt.xlabel("Position")
plt.ylabel(
plt.title(f"1D Biased Random Walk: Particle A (p=0.7) vs. Particle B (p=0.5)")
plt.legend()True, alpha=0.3)
plt.grid(
# Compute statistics
= np.array(final_positions)
final_positions = np.mean(final_positions)
mean_final_position = np.std(final_positions)
std_final_position = steps * (2 * 0.7 - 1)
expected_final_position_A = steps * (2 * 0.5 - 1)
expected_final_position_B = stats.t.interval(
confidence_interval_95 0.95, # Confidence level - 95%
2, # Degrees of freedom
= mean_final_position, # Sample mean
loc # Standard error
= std_final_position / np.sqrt(len(final_positions))
scale
)= (
stats_text f"Number of steps: {steps}\n"
f"Right step probability: {[0.7, 0.5]}\n"
f"Left step probability: {[0.3, 0.5]}\n"
f"Final positions: {[int(pos) for pos in final_positions]}\n"
f"Mean final position: {mean_final_position:.2f}\n"
f"Standard deviation: {std_final_position:.2f}\n"
f"Expected final position (A): {expected_final_position_A:.2f}\n"
f"Expected final position (B): {expected_final_position_B:.2f}\n"
"95% CI for mean final position: "
f"[{confidence_interval_95[0]:.2f}, {confidence_interval_95[1]:.2f}]"
)
plt.figtext(0.5, -0.15, stats_text, ha="center", fontsize=10,
={"facecolor":"lightgray", "alpha":0.5})
bbox=[0, 0.1, 1, 0.95])
plt.tight_layout(rect
plt.show()print(stats_text)
create_random_walk_2()
Number of steps: 1000
Right step probability: [0.7, 0.5]
Left step probability: [0.3, 0.5]
Final positions: [406, 32]
Mean final position: 219.00
Standard deviation: 187.00
Expected final position (A): 400.00
Expected final position (B): 0.00
95% CI for mean final position: [-349.94, 787.94]
Interpretation of the Answer
- Observed vs. Expected Behavior
- Particle A (with drift) ended up at 406, which is close to its expected final position of 400 (since \(\mathbb{E}[X_A] = (2p - 1) \times N = (2(0.7) - 1) \times 1000 = 400\)).
- Particle B (without drift) ended up at 32, which is close to the expected value of 0.
- The mean final position (219.00) reflects the contribution of both particles, primarily influenced by Particle A.
- Variability & Standard Deviation
- The standard deviation (187.00) indicates a significant spread in possible final positions.
- This is expected, as random walks have inherent variance.
- Confidence Interval Analysis
- The 95% confidence interval for the mean final position is [-349.94, 787.94].
- This wide range suggests that while Particle A tends to drift right, there is considerable randomness in individual runs.
3. Simulating a 2D Random Walk (Unbiased)
A mosquito trapped in a square grid moves randomly: - Up, Down, Left, or Right with equal probability (25%) in each direction. - The mosquito starts at coordinate (0,0).
Instructions:
- Write a Python program to simulate a 2D random walk for 500 steps.
- Plot the path of the mosquito (X vs Y) using a scatter plot or line plot.
- Calculate and display:
- The final position after 500 steps.
- The total distance from the origin after 500 steps.
- Run the simulation 10 times and calculate:
- The average distance from the origin after 500 steps.
- The standard deviation of the distance.
- Interpret your answer
Answer
def create_random_walk_3():
= 500
steps def random_walk_2D(steps: int):
# Define possible moves (Up, Down, Left, Right)
= np.array([[0, 1], [0, -1], [-1, 0], [1, 0]])
moves # Starting position (0,0)
= np.array([0, 0])
position = [position.copy()]
path for _ in range(steps):
= moves[np.random.choice(4)] # Choose a random move
step += step # Update position
position
path.append(position.copy())return np.array(path), position
def plot_walk(path):
=(8, 8))
plt.figure(figsize
plt.plot(0], path[:, 1], marker='o', markersize=2,
path[:, ='-', alpha=0.6)
linestyle0, 0, c='red', marker='o', label='Start (0,0)')
plt.scatter(
plt.scatter(-1, 0], path[-1, 1], c='blue', marker='o',
path[=f'End {tuple(path[-1].tolist())}')
label'X Position')
plt.xlabel('Y Position')
plt.ylabel(
plt.title(('2D Random Walk (500 Steps) - '
'A mosquito trapped in a square grid'))
plt.legend()
plt.grid()
plt.show()
# Run simulation 10 times and gather results
= 10
num_simulations = []
distances = []
final_positions
for _ in range(num_simulations):
= random_walk_2D(steps)
path, final_position # Euclidean distance from origin
= np.linalg.norm(final_position)
distance
distances.append(distance)
final_positions.append(final_position)if _ == 0: # Plot first simulation
plot_walk(path)
# Compute statistics
= np.mean(distances)
average_distance = np.std(distances)
std_dev_distance
= (
stats_text f"Number of steps: {steps}\n"
f"Distances: {[int(dist) for dist in distances]}\n"
f"Final position after 500 steps in first run: {final_position}\n"
f"Distance from origin in first run: {distances[0]:.2f}\n"
f"Average distance from origin over {num_simulations} "
f"runs: {average_distance:.2f}\n"
f"Standard deviation of distance: {std_dev_distance:.2f}"
)# Display results
print(stats_text)
create_random_walk_3()
Number of steps: 500
Distances: [29, 6, 15, 32, 14, 29, 20, 21, 34, 31]
Final position after 500 steps in first run: [25 19]
Distance from origin in first run: 29.97
Average distance from origin over 10 runs: 23.65
Standard deviation of distance: 9.06
Explanation of the Result:
- Final position after 500 steps in the first run:
[25, 19]
- This means that after 500 random steps, the mosquito ended up at coordinates (9 -29) on the grid.
- The displacement is not necessarily centered around (0,0) due to randomness.
- Distance from origin in the first run:
29.97
The Euclidean distance from the origin \((0,0)\) is given by:
\(d = \sqrt{(-7)^2 + (-35)^2} = \sqrt{49 + 1225} = \sqrt{1274} \approx 22.09\)
This measures how far the mosquito moved from the starting point after 500 steps.
- Average distance from origin over 10 runs:
23.65
This is the mean of the final distances over 10 independent simulations.
Even though the mosquito moves randomly, the expected distance from the origin is approximately proportional to the square root of the number of steps:
\(E[d] \approx \sqrt{500} \approx 22.36\)
The observed average of
9.06
is lower, due to random variation in the limited number of simulations.
- Standard deviation of distance:
9.06
- This represents the spread of distances across 10 runs.
4. Comparing 2D Random Walks with Bias vs No Bias
A person walks randomly in a 2D grid but with a slight bias towards the East (right).
In each step: - Move East: 40% probability - Move West: 20% probability - Move North: 20% probability - Move South: 20% probability
The person starts at (0,0). Instructions:
- Write a Python program to simulate:
- 500 steps for the biased random walk.
- 500 steps for an unbiased random walk (equal probability).
- Plot both paths on the same graph with:
- Different colors for each walk.
- Scatter plot showing the final position.
- Calculate and display:
- The final position after 500 steps.
- The total distance from the origin for both walks.
- Run the simulation 10 times and compute:
- The average distance from the origin for both biased and unbiased walks.
- The standard deviation of the distance.
- Interpretation:
- Why does the biased random walk drift to the east?
- How does drift affect the standard deviation of the final position?
- What real-world phenomena could this simulation represent (e.g., wind drift, ocean currents)?
Answer
def create_random_walk_4():
def random_walk(steps, biased=False):
= [0], [0] # Start at origin
x, y
for _ in range(steps):
= np.random.choice(
direction 'E', 'W', 'N', 'S'],
[=[0.4, 0.2, 0.2, 0.2] if biased
pelse [0.25, 0.25, 0.25, 0.25])
if direction == 'E':
-1] + 1)
x.append(x[-1])
y.append(y[elif direction == 'W':
-1] - 1)
x.append(x[-1])
y.append(y[elif direction == 'N':
-1])
x.append(x[-1] + 1)
y.append(y[else: # 'S'
-1])
x.append(x[-1] - 1)
y.append(y[
return x, y
def plot_walks(biased_walk, unbiased_walk):
=(8, 8))
plt.figure(figsize*biased_walk, label='Biased Walk (East)', color='red')
plt.plot(*unbiased_walk, label='Unbiased Walk', color='blue')
plt.plot(
plt.scatter(*biased_walk[:, -1], color='red', marker='o',
='Biased Final Position')
label
plt.scatter(*unbiased_walk[:, -1], color='blue', marker='o',
='Unbiased Final Position')
label0, color='black', linestyle='--', linewidth=0.5)
plt.axhline(0, color='black', linestyle='--', linewidth=0.5)
plt.axvline(
plt.legend()"X Position")
plt.xlabel("Y Position")
plt.ylabel("Comparison of Biased vs Unbiased Random Walks")
plt.title(
plt.grid()
plt.show()
def distance_from_origin(position):
return np.sqrt(position[0]**2 + position[1]**2)
# Run simulations
= 10
num_simulations = []
biased_distances = []
unbiased_distances
for _ in range(num_simulations):
= np.array(random_walk(500, biased=True))
biased_walk = np.array(random_walk(500, biased=False))
unbiased_walk
= biased_walk[:, -1]
biased_final = unbiased_walk[:, -1]
unbiased_final
biased_distances.append(distance_from_origin(biased_final))
unbiased_distances.append(distance_from_origin(unbiased_final))
if _ == 0: # Plot only one example
plot_walks(biased_walk, unbiased_walk)
= (
stats_text "Biased Walk - "
f"Avg Distance: {np.mean(biased_distances):.2f}, "
f"Std Dev: {np.std(biased_distances):.2f}"
"\n"
"Unbiased Walk - "
f"Avg Distance: {np.mean(unbiased_distances):.2f}, "
f"Std Dev: {np.std(unbiased_distances):.2f}"
)# Display results
print(stats_text)
create_random_walk_4()
Biased Walk - Avg Distance: 106.75, Std Dev: 14.60
Unbiased Walk - Avg Distance: 19.53, Std Dev: 9.80
Interpretation of Results
Why does the biased random walk drift east? The biased walk has a higher probability (40%) of moving east compared to the other directions (20% each). Over many steps, this bias accumulates, causing the walker to drift significantly in the eastward direction. The longer the walk, the more pronounced this drift becomes.
How does drift affect the standard deviation?
- The average distance from the origin for the biased walk (106.75) is much greater than the unbiased walk (19.53). This is because the consistent eastward bias creates a large displacement.
- The standard deviation of the biased walk (14.60) is also larger than that of the unbiased walk (9.80). This suggests that while most biased walks follow the general eastward trend, there is still some variability due to the random movements in other directions.
- In contrast, the unbiased walk spreads more symmetrically, resulting in a lower average distance and lower standard deviation.
Disclaimer: For information only. Accuracy or completeness not guaranteed. Illegal use prohibited. Not professional advice or solicitation. Read more: /terms-of-service
Reuse
Citation
@misc{kabui2025,
author = {{Kabui, Charles}},
title = {Random {Walk}},
date = {2025-03-17},
url = {https://toknow.ai/posts/computational-techniques-in-data-science/random-walk/index.html},
langid = {en-GB}
}