Workspace Architecture¶

MolExp organizes experiments in a four-tier hierarchy:

Workspace
└── Project
    └── Experiment      (parameter-space container + replica config)
        └── Run          (one execution attempt; re-runs append ExecutionRecords)

Every persistent byproduct — imported data, task artifacts, logs, checkpoints, error traces, workflow execution state — is a typed Asset subclass recorded in a per-scope assets.json manifest and indexed by the workspace catalog at catalog/index.sqlite. Every metadata write is atomic (temp-file + os.rename), so a crash never leaves a half-written JSON file.

Workspace is the bottom of the molexp dependency DAG. It owns filesystem layout, atomic JSON, content-addressed assets, and generic per-kind subsystem storage — and does not know about workflows, sessions, agents, or LLMs. Upstream layers (workflow, agent) reach down into workspace's public surface; the inverse is forbidden by the import-guard test.

Hierarchy Levels¶

Level	Directory	Metadata file	Purpose
`Workspace`	`<root>/`	`workspace.json`	Top-level container. Materialized explicitly or when the first child is created.
`Project`	`<root>/projects/<project_id>/`	`project.json`	Research-area container (MD, ML training, data pipeline, …).
`Experiment`	`<project>/experiments/<exp_id>/`	`experiment.json`	Concrete parameter set + replica count. Workspace stores the parameter binding; pairing the experiment with a workflow is the caller's concern.
`Run`	`<exp>/runs/run-<run_id>/`	`run.json`	Single execution instance; re-runs append `ExecutionRecord` entries.

Children are not stored as lists in the parent metadata — parents discover children by scanning the filesystem. This keeps writes local and avoids lock contention.

Why This Design¶

Scientific workflows tend to run the same pipeline many times with different parameters, then compare outcomes. The Experiment → Run split reflects that:

Experiment is the definition — parameters, replica count, seeds, optional advisory workflow_source / workflow_type strings used by the UI for grouping.
Run is a realization — one execution, one set of concrete parameter values, one outcome.

Each Run captures reproducibility metadata: an opaque workflow_snapshot payload (the canonical typed shape lives in molexp.workflow.WorkflowSnapshotRef; workspace stores it as a JSON dict), the resolved molcfg profile, a config_hash, execution history, error info, and produced artifacts.

Creating a Hierarchy¶

The hierarchy is created from the top down, but not every step has identical identity rules. ws.project(...) and project.experiment(...) are get-or-create operations keyed by slug or explicit id, so repeated calls can load existing objects from disk. exp.add_run(...) is different: it creates a fresh run unless you provide an explicit id, in which case it becomes a get-or-load operation for that concrete run directory. Runs seeded by exp.run(workflow, params=...) derive their ids from their parameters, so the sweep declaration is idempotent.

import molexp as me

ws = me.Workspace("./lab", name="lab")                    # lightweight object; no files yet
project = ws.project("MD Simulations")                    # materializes workspace.json and project.json
exp = project.experiment(
    "temperature-300K",
    params={"T": 300, "pressure": 1.0},
    n_replicas=3,
    seeds=[42, 43, 44],
)
run = exp.add_run(
    {"T": 300, "pressure": 1.0, "seed": 42},
    id="temperature-300K-seed-42",
)                                                         # materializes run.json

Re-calling the project or experiment factory with the same name or id returns the same in-memory object within the current process and loads from disk when needed. Runs only behave that way when the run id is stable.

Parameter Combinations¶

GridSpace and UniformSpace generate parameter combinations. Experiment.run(workflow, params=...) accepts a space (or a plain {axis: [values]} grid mapping) directly and materializes one content-addressed Run per cell:

from molexp import GridSpace

grid = GridSpace({"T": [300, 310, 320], "force_field": ["amber", "charmm"]})

exp = project.experiment("md-sweep").run(compiled, params=grid)
print(len(exp.list_runs()))  # 6 — one per grid cell

UniformSpace(param_values, n_samples, seed=None) samples n_samples combinations uniformly at random — handy for broader search spaces.

Pairing an Experiment with a Workflow¶

Workspace itself stores no workflow-shaped types. The association is declared through Experiment.run(workflow, params=...), which records the workflow's graph IR on the experiment and binds the live CompiledWorkflow in the workflow layer's default_binding_registry:

from molexp.workflow import WorkflowCompiler, WorkflowRuntime

compiled = WorkflowCompiler(name="train").add(TrainTask()).compile()
exp = project.experiment("baseline").run(compiled, params={"lr": [1e-3]})

# Workspace just provides the Run the workflow executes within.
run = exp.list_runs()[0]
with run.start() as ctx:
    result = await WorkflowRuntime().execute(compiled, run_context=ctx)

This decoupling came out of the 2026-05-09 rectification: workspace stays a storage primitive, workflow stays a graph engine, and the cross-layer seam (molexp.entry) wires Experiment.run to the binding registry without workspace ever importing the workflow layer.

Executing a Run¶

with run.start() as ctx:
    result = await WorkflowRuntime().execute(compiled, run_context=ctx)

Entering run.start() opens a RunContext which:

Ensures artifacts/, logs/, and .ckpt/ subdirectories exist.
Records temporary ownership metadata for the active execution.
Appends a new ExecutionRecord to run.execution_history.
Sets run.status = "running" and runs the workflow.
On success, writes status="succeeded" plus the final timestamp.
On failure, writes status="failed", an ErrorInfo, and registers an ErrorTraceAsset pointing at executions/<exec_id>/error.txt.

Every attempt appears in run.metadata.execution_history, newest last — a run that was retried twice will have three records.

Assets¶

Every scope exposes a typed assets view into the workspace catalog and a data_assets library for importing external data. Run-time writes (artifacts, logs, checkpoints) go through the RunContext accessors, which keep the on-disk manifest and the catalog in sync.

ws.data_assets.import_asset("bert_model", "/models/bert.pt")
project.data_assets.import_asset("dataset", "/data/qm9.tar.bz2")
exp.data_assets.import_asset("features", "/data/features.h5")

# Driver-side, around the workflow execution
with run.start() as ctx:
    dataset = ctx.find_asset("dataset")       # walks run → experiment → project → workspace
    features = ctx.find_asset("features")
    result = await WorkflowRuntime().execute(compiled, run_context=ctx)
    ctx.artifact.save("metrics.json", result.outputs["train"])
    ctx.log("train").append("epoch 1")

import_asset(name, src, action="copy", meta=None) supports "copy", "move", "symlink", and "hardlink" for ingestion. See Unified Asset Model for the full list of asset kinds (artifact, log, checkpoint, error trace, execution state, output, data) and how the catalog can be rebuilt from disk when needed.

CLI Surface¶

The same hierarchy is exposed through the CLI:

molexp project   create|list|info
molexp experiment create|list
molexp runs      create|list|info|cancel|prune
molexp asset     list
molexp info      # show workspace summary

molexp runs prune interactively walks the project → experiment → run → execution tree and lets you delete per-execution records (removes execution/<exec_id>/ and rewrites run.execution_history).

Directory Layout¶

./lab/
├── workspace.json
├── catalog/index.sqlite            # derived workspace-wide catalog
├── assets.json                     # workspace-scoped asset manifest
├── data_assets/<asset_id>/payload/ # imported DataAssets
└── projects/
    └── qm9/
        ├── project.json
        ├── assets.json             # project-scoped asset manifest
        ├── data_assets/            # project-scoped DataAssets
        └── experiments/
            └── baseline/
                ├── experiment.json
                ├── assets.json     # experiment-scoped asset manifest
                ├── data_assets/    # experiment-scoped DataAssets
                └── runs/
                    └── run-<id>/
                        ├── run.json
                        ├── assets.json     # run-scoped asset manifest
                        ├── artifacts/
                        ├── logs/
                        ├── .ckpt/
                        └── executions/<exec_id>/  # per-attempt workflow.json + error.txt

Runnable Example¶

examples/workspace/workspace_architecture.py seeds a tracked run and then prints the full on-disk tree with file sizes so you can see each layer for yourself.