researcher / rl agent metacognition

Hong Zhe

scroll

[01]

About

I study emergent metacognition in reinforcement learning agents — whether systems that learn models of their world can also learn to doubt those models. My path from linguistics (Zhejiang University) through data science (CityU Hong Kong) to precision health (NUS) taught me that the most interesting questions live between disciplines.

[02]

Research Vision

24 ongoing and planned projects exploring the computational boundaries of awareness, life, and self-monitoring in artificial systems.

Metacognition self-monitoring & doubt

Artificial Life emergence & boundaries

Consciousness qualia & geometry

Minimal Systems scaling & limits

Collective Mind networks & identity

Bmetacognition

The Boiling Frog Threshold published

When does an RL agent realize its reality has been corrupted?

#01metacognition

Emergent Self-Awareness

Under what conditions does an agent spontaneously begin to model itself?

#02artificial life

The Autonomy Spectrum

Can autonomy be measured continuously, from autocatalytic sets to goal-directed agents?

#03collective

The Dissolution of Individuals

In a multi-agent system, who is the cognitive subject? Are boundaries fluid?

#04metacognition

Geometric Signatures of Metacognition

Does the emergence of self-monitoring create a detectable fold in the representation manifold?

#05artificial life

Topological Phase Transitions in Emergence

Is emergence a continuous process or a topological phase transition?

#06artificial life

The Geometric Definition of Life

What is the topological difference between alive and dead in phase space?

#07consciousness

Fisher Information Geometry of Cognitive Boundaries

What does a unified cognitive agent look like on a statistical manifold?

#08minimal systems

The Fiber Bundle Structure of Agency

Can autonomy be defined as holonomy in a fiber bundle over behavior space?

#09artificial life

The Algorithmic Life Band

Do living entities occupy a specific band of Kolmogorov complexity?

#10minimal systems

The Emergence of Stupidity

When you systematically scale down, which capabilities anomalously persist or even strengthen?

#11metacognition

Prediction Error Geometry

Does the topology of internal representations limit the horizon of prediction?

#12metacognition

The Doomsday Probability Machine

Can a system detect the meta-signal that its own predictions are failing?

#13metacognition

A Computational Theory of Subjective Time

What is the relationship between information processing density and subjective time?

#14consciousness

Color Qualia as a Metacognitive Litmus Test

Can two agents with different internal color encodings discover that their experiences differ?

#15minimal systems

Undecidable Environments

How does an agent survive in a world whose rules are provably unknowable?

#17collective

The Thermodynamic Cost of Digital Immortality

Can consciousness be losslessly migrated? Is topological loss inevitable?

#18collective

Computational Theodicy

What is the computational complexity of saving the world? When is it impossible even for God?

#19collective

Maniac: Crossing Between Layers

What internal invariants let an agent maintain identity across nested simulations?

#20collective

Phase Transitions of the Network God

Under what conditions does distributed consciousness unify, and when does it fragment?

#21collective

Simulated Millennia

When time is compressed, what information must be lost? How many generations does culture need?

#22consciousness

Qualia Spectrography

Do different survival pressures induce different Riemannian geometries in color space?

#23consciousness

The Multiple Realizability Experiment

Does the same functional state produce the same internal geometry across radically different architectures?

#24artificial life

Geometric Lenia

How does the geometry of space shape the life that emerges within it?

[03]

Featured Work

published arXiv:2603.08455

The Boiling Frog Threshold

Criticality and Blindness in World Model-Based Anomaly Detection Under Gradual Drift

A PPO agent equipped with a learned world model monitors its own prediction error while its observations are slowly corrupted. A sharp phase transition emerges: below a critical drift rate, the agent permanently adapts to corrupted reality — it stays asleep. Above it, detection is abrupt — it wakes up. Sinusoidal drift is universally invisible across all detector families, revealing a fundamental limit of prediction-error-based self-monitoring.

Detection rate vs. drift intensity showing sharp sigmoid threshold across four MuJoCo environments

Detection rate vs. drift intensity across four MuJoCo environments. The sigmoid transition is universal.

Read on arXiv →

[04]

Background

Education

2024–26

M.Sc. Precision Health and Medicine — National University of Singapore Full Scholarship

2020–21

M.Sc. Data Science — City University of Hong Kong

2015–19

B.A. English Language — Zhejiang University Waseda Exchange 2018–19

Papers

2026

The Boiling Frog Threshold: Criticality and Blindness in World Model-Based Anomaly Detection Under Gradual Drift. Hong Zhe. arXiv:2603.08455. [pdf]

2025

Personalized Risk Evaluation and secondary Vascular EveNT prevention with A Genomics-AI Network (PREVENT-AGAIN).

2020

Environmental models for predicting habitat of the Indo-Pacific humpback dolphins in Fujian, China. Aquatic Conservation: Marine and Freshwater Ecosystems, 30(4).

2020

The density, ranging pattern and suitable habitat prediction of seabirds in the northern Beibu Gulf, China. Pakistan J. Zool.

2019

The bacterial diversity in infected tissue pus of an East Asian finless porpoise. African Journal of Microbiology Research.

Languages: Chinese (native), English (fluent), Japanese (basic)

[05]

Contact

I am actively seeking PhD opportunities in Tokyo, working at the intersection of reinforcement learning, artificial life, and consciousness. If our research interests overlap, I would welcome the chance to talk.

e1324318@u.nus.edu