Skip to content

Temperature Profiles

Four Cooling Schedules for Different Problem Types

The temperature profile determines how quickly GESA transitions from exploration to exploitation. Not all problems have the same cooling schedule.


The Base Formula

All profiles share the same cooling formula:

Temperature(t) = T₀ × α^t

Where:
  T₀  = initial temperature (always 100)
  α   = cooling rate (the profile variable)
  t   = episode count

The only difference between profiles is α. Small changes in α produce dramatically different behaviour over time.


Profile 1: Fast Cool (Exploitation-First)

α = 0.85   T₀ = 100
Episode CountTemperature
0100.0
544.4
1019.7
203.9
300.8

Use when:

  • The problem domain is well-understood
  • Speed of convergence matters more than global optimality
  • You have prior knowledge about what works

Examples:

  • Browser automation task with a known, stable page structure
  • Repeated report generation for a known client context
  • Optimising a content format that has proven successful

Caution: Fast cooling risks premature convergence. If the optimal strategy hasn't been explored in the first 10–15 episodes, the system may lock into a suboptimal local solution.


Profile 2: Standard Cool (Balanced)

α = 0.95   T₀ = 100
Episode CountTemperature
0100.0
1059.9
2035.8
507.7
1000.6

Use when:

  • Mixed exploration and exploitation are needed
  • Domain is partially understood but not fully mapped
  • Default profile for new deployments

Examples:

  • Content optimisation over a content calendar
  • Sprint-level team intervention planning
  • Trading signal calibration for a familiar market

Profile 3: Slow Cool (Exploration-First)

α = 0.99   T₀ = 100
Episode CountTemperature
0100.0
2081.8
5060.5
10036.6
20013.4
3004.9

Use when:

  • Problem space is unknown or highly variable
  • Premature convergence risk is high
  • You need to genuinely explore before you can exploit

Examples:

  • New market entry strategy
  • Novel product launch
  • Unfamiliar team dynamics or new organisation
  • Any domain where past episodes from other contexts may not transfer

Note: Slow cool requires patience. The system will appear less decisive early on — this is by design. It is exploring the space before committing.


Profile 4: Adaptive Cool (Self-Tuning)

α = f(episode_variance, drift_trajectory)

The most sophisticated profile. Cooling rate adapts dynamically based on what the episode history reveals:

If variance(recent_outcomes) > threshold:
    α ← max(α - 0.02, 0.90)   // Slow down cooling: outcomes unpredictable
Else:
    α ← min(α + 0.01, 0.99)   // Speed up cooling: outcomes converging

Use when:

  • Sufficient episode history exists to activate (minimum ~30 episodes)
  • The problem domain has variable predictability
  • You want the system to self-calibrate its exploration/exploitation balance

Activation requirement: Adaptive Cool requires enough historical variance data to be meaningful. Below 20 episodes, the system defaults to Standard Cool until the episode store is populated.

Gap velocity integration: Adaptive Cool also reads DRIFT trajectory:

  • Negative gap velocity (gap closing) → allow faster cooling
  • Positive gap velocity (gap widening) → force slower cooling, increase exploration

Choosing a Profile

ScenarioRecommended Profile
First deployment, domain unknownSlow Cool (0.99)
Domain partially mappedStandard Cool (0.95)
Well-known domain, speed mattersFast Cool (0.85)
Enough history, variable domainAdaptive Cool
Recovering from major context shiftReset to Slow Cool

Resetting Temperature

Temperature can be manually reset in two circumstances:

  1. Major context shift — A significant change in the underlying system (new team, new market, new product) that makes historical episodes less relevant
  2. Episode store cleared — If episodes are purged beyond the minimum threshold

Reset procedure: set T_current = T₀ = 100 and resume from the appropriate profile.

Resets should be rare. The episode decay mechanism (EpisodeWeight = BaseWeight × e^(-age/τ)) naturally down-weights stale episodes without requiring a full temperature reset.


The StratIQX Temperature Map

The four-tier depth system in StratIQX is a production temperature schedule with a pricing model:

Quick       → α = 0.85 equivalent (exploitation) → $1.25K–$2.5K
Standard    → α = 0.95 equivalent (balanced)     → $5K–$10K
Comprehensive → α ≥ 0.99 equivalent (exploration) → $15K–$37.5K
Enterprise  → Maximum exploration                 → $50K–$100K

Higher temperature = wider search space = higher value = higher price. This is the annealing schedule commercialised.


→ Integration Overview