Async Processing

Memsy intentionally decouples ingestion from processing. This matters in a few places, so it's worth understanding the model.

What happens when you call `ingest()`

result = client.ingest(events)
# <-- returns here, usually within ~100ms

const result = await client.ingest(events);
// <-- resolves here, usually within ~100ms

curl -X POST "$MEMSY_BASE_URL/ingest" \
  -H "Authorization: Bearer $MEMSY_API_KEY" \
  -H "Content-Type: application/json" \
  -d @events.json
# <-- responds here, usually within ~100ms

The server:

Validates the payload.
Writes the raw events to durable storage.
Returns a list of event_ids.
Enqueues the events for downstream processing (LLM extraction → embedding → indexing).

Steps 1–3 happen synchronously over HTTP. Step 4 happens on the server's worker pool, after your call has already returned.

Why it's designed this way

Applications in the hot path (chat loops, agent runtimes) log events without blocking on LLM latency.
Extraction can batch, retry, and prioritize without the client caring.
Costs and failures on the extraction side don't punish the caller.

When can I search for a newly ingested event?

Usually within a few seconds, but there's no guarantee. If your UX depends on memory being present immediately after ingest, poll for completion:

import time

result = client.ingest(events)

status = client.status(event_ids=result.event_ids)
while status.pending_ids:
    time.sleep(1.0)
    status = client.status(event_ids=result.event_ids)

print("done:", status.completed_ids)
print("failed:", status.failed_ids)

const result = await client.ingest(events);

let status = await client.status(result.eventIds);
while (status.pendingIds.length > 0) {
  await new Promise((r) => setTimeout(r, 1000));
  status = await client.status(result.eventIds);
}

console.log('done:', status.completedIds);
console.log('failed:', status.failedIds);

# First call — ingest
curl -X POST "$MEMSY_BASE_URL/ingest" \
  -H "Authorization: Bearer $MEMSY_API_KEY" \
  -H "Content-Type: application/json" \
  -d @events.json

# Then poll status with the returned event_ids
curl -X POST "$MEMSY_BASE_URL/status" \
  -H "Authorization: Bearer $MEMSY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"event_ids": ["01JK...", "01JL..."]}'

The status response partitions the IDs you sent into three buckets (Python / Node):

completed_ids / completedIds — extracted, embedded, indexed. Searchable now.
pending_ids / pendingIds — still in the queue or mid-processing.
failed_ids / failedIds — something went wrong during extraction. These won't become memories.

What causes an event to fail?

Content rejected by a content filter.
An extraction-time LLM call that could not be recovered after retries.
An event that referenced a deleted or quarantined resource.

Failures are rare but not zero. Check failed_ids in your pollers if correctness matters.

Design implications

Warning

Don't block your request pipeline on status(). Use it only when your next action requires the memory to be present.

Re-ingesting an event is safe within a 60-second window — Memsy de-duplicates by (actor_id, session_id, kind, content) and returns the original event_id so status() keeps working. Outside the window, identical payloads will create separate events; use status() to confirm processing instead of re-sending.
Do batch aggressively. ingest() accepts many events per call; fifty events in one batch is much cheaper than fifty single-event calls.

How to build batches that ingest well → Ingesting events.

What happens when you call ingest()

Why it's designed this way

When can I search for a newly ingested event?

What causes an event to fail?

Design implications

Next

What happens when you call `ingest()`