Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.mirage.strukto.ai/llms.txt

Use this file to discover all available pages before exploring further.

The Disk resource mounts a local directory at some prefix such as /data/. All operations are backed by real files on disk. Path resolution validates against the root boundary to prevent directory traversal escapes.

Config

from mirage import MountMode, Workspace
from mirage.resource.disk import DiskResource

resource = DiskResource(root="/path/to/dir")
ws = Workspace({"/data": resource}, mode=MountMode.READ)
DiskResource(root=...) takes a single root path argument pointing to the directory to mount. Both READ and WRITE modes are supported.

Filesystem Layout

The Disk resource mirrors the structure of the root directory. For example, if root="/srv/files" contains:
/srv/files/
  notes.txt
  config.json
  reports/
    q1.csv
    q2.csv
Then mounting at /data/ exposes:
/data/
  notes.txt
  config.json
  reports/
    q1.csv
    q2.csv
Paths like ../../etc/passwd are rejected - resolution is always confined to the root boundary.

Cache

The Disk resource uses IndexCacheStore with _index_ttl = 60 (1 minute). Directory listings are cached for up to 60 seconds before being refreshed from disk.

Example

import asyncio
import shutil
import tempfile
from pathlib import Path

from mirage import MountMode, Workspace
from mirage.resource.disk import DiskResource

DATA_DIR = Path("/path/to/files")

tmp = tempfile.mkdtemp()
shutil.copytree(DATA_DIR, Path(tmp) / "files", dirs_exist_ok=True)

resource = DiskResource(root=tmp + "/files")


async def main() -> None:
    ws = Workspace({"/data/": resource}, mode=MountMode.READ)

    r = await ws.execute("ls /data/")
    print(await r.stdout_str())

    r = await ws.execute("cat /data/example.json")
    print(await r.stdout_str())

    r = await ws.execute("tree /data/")
    print(await r.stdout_str())

    r = await ws.execute("find /data/ -name '*.json'")
    print(await r.stdout_str())

    r = await ws.execute("grep example /data/example.json")
    print(await r.stdout_str())

    r = await ws.execute("stat /data/example.json")
    print(await r.stdout_str())


if __name__ == "__main__":
    asyncio.run(main())

Shell Commands

The Disk resource supports the full set of shell commands since it operates on real file content (text, binary, JSON, CSV, etc.):

Read Commands

CommandNotes
catRead file content
head / tailFirst/last N lines
grep / rgPattern search (file or directory level)
jqQuery JSON fields
wcLine/word/byte counts
statFile metadata (name, size, type, modified)
findRecursive search with -name, -maxdepth
treeDirectory tree view
nlNumber lines
duDisk usage summary
fileDetect file type
stringsExtract printable strings from binary
xxdHex dump
md5MD5 checksum
sha256sumSHA-256 checksum

Text Processing

CommandNotes
awkPattern scanning and processing
sedStream editor
trTranslate or delete characters
sortSort lines
uniqRemove duplicate lines
cutExtract fields/columns
joinJoin lines on a common field
pasteMerge lines side by side
columnColumnate output
foldWrap lines to a specified width
expandConvert tabs to spaces
unexpandConvert spaces to tabs
fmtSimple text formatter
revReverse lines
tacConcatenate and print in reverse
lookDisplay lines beginning with a given string
shufShuffle lines
tsortTopological sort
commCompare two sorted files
cmpCompare two files byte by byte
diffCompare files line by line
patchApply a diff patch
iconvCharacter encoding conversion

File Operations

CommandNotes
cpCopy files
mvMove/rename files
rmRemove files
mkdirCreate directories
touchCreate empty file or update timestamp
lnCreate symbolic links
teeWrite stdin to file and stdout
mktempCreate temporary file
splitSplit file into pieces
csplitSplit file by context

Path Utilities

CommandNotes
basenameStrip directory from path
dirnameStrip filename from path
realpathResolve path
readlinkPrint symbolic link target
lsList directory contents

Compression

CommandNotes
gzipCompress files
gunzipDecompress gzip files
zipCreate zip archives
unzipExtract zip archives
tarArchive files
zcatCat compressed files
zgrepGrep compressed files

Encoding

CommandNotes
base64Base64 encode/decode

Data Format Support

Commands with format-specific variants for structured data files:
FormatExtensionVariants
Parquet.parquetcat, head, tail, wc, stat, cut, grep, ls, file
Feather.feathercat, head, tail, wc, stat, cut, grep, ls, file
ORC.orccat, head, tail, wc, stat, cut, grep, ls, file
HDF5.hdf5cat, head, tail, wc, stat, cut, grep, ls, file
These variants auto-detect the format by extension and convert to tabular text (CSV) for processing.

Audio Support (Optional)

Audio commands are opt-in and require sherpa-onnx with a Whisper model. They transcribe audio to text, enabling cat, head, tail, grep, and stat on audio files.
FormatExtensionCommands
WAV.wavcat, head, tail, grep, stat
MP3.mp3cat, head, tail, grep, stat
OGG.oggcat, head, tail, grep, stat
To enable, register audio commands manually:
from mirage.commands.audio import AUDIO_COMMANDS
from mirage.commands.audio.utils import configure

configure(model_dir="path/to/sherpa-onnx-whisper-base")

for cmd in AUDIO_COMMANDS:
    ws.register(cmd)

Use Cases

  • Local directory access: Mount local directories for AI agents to read and process
  • Sandboxed file access: Restrict agent file operations to a specific directory tree
  • FUSE mounting: Expose disk files through a virtual FUSE mount for external tools
  • Data pipelines: Process local datasets with shell-like commands
  • Development: Test file operations against real data before deploying to cloud resources