AI systems · Architecture

AI OS architecture for emerging markets: what actually changes

Context is not decoration. Context is the architecture. Here are the five dimensions that shift when you move from the Foundry context to a rural Philippine bank.

WritingMay 28, 202613 min read

Context is the architecture

Most people building enterprise AI in Southeast Asia copy what they see working in the US. They take Foundry's architecture, or ServiceNow's, or Salesforce's, and try to drop it into a Philippine bank or an Indonesian logistics company. Then they wonder why it breaks. It breaks because context is not decoration. Context is the architecture.

The architecture decisions Palantir made when designing Foundry were correct given their constraints: large US defense and enterprise clients, massive structured data lakes, dedicated engineers living on-site, and annual contracts that start at $500,000 USD. When those constraints change, some decisions that were correct become wrong. Not slightly suboptimal. Structurally wrong.

I have stood up enough AI OS deployments across Philippine microfinance, BPOs, rural banking, and government-adjacent operations to know exactly which five dimensions shift when you move from the Foundry context to ours.

1. Data quality and completeness

Foundry's ontology assumes you are working with structured, reasonably clean enterprise data. A Fortune 500 company that has run SAP for fifteen years has decades of structured records. The ontology layer can impose order on data that already has most of the shape right. A Philippine cooperative bank does not have this.

Take an illustrative but representative institution: 23 years old, around ₱2.1 billion in assets. Its loan book lives in an Excel file maintained by one person. Collections records are split between a Filipino-built legacy system from 2009 and handwritten ledger pages scanned into Google Drive. Its customer identity data has records where "Juan dela Cruz," "J. Dela Cruz," and "JUAN DELA CRUZ JR." are three different customers, or might be the same person, depending on which branch opened the account.

An ontology layer for this context cannot assume clean input. It has to do the hard work of tolerance, deduplication, gap-filling, and fallback handling at the ontology level, not as a preprocessing step before the real system starts. The customer object has to know it might be receiving five representations of the same person and resolve them. This changes the object model fundamentally. It gets heavier at the base layer and more permissive about input shape. Foundry assumes the data problem is solved before you arrive. In Philippine deployments, the data problem is 40 percent of the work.

2. Connectivity and uptime assumptions

Foundry assumes an always-on fibre-grade corporate network with a dedicated IT team managing uptime. A rural Philippine bank branch runs on Globe LTE with a router zip-tied to a window. The connection drops three times a day, sometimes for forty minutes during a typhoon or a generator issue. During those forty minutes, the branch still has customers walking in to make payments.

An offline-capable agent for this context needs three things Foundry does not prioritize. Every agent runs in one of two modes: connected, with real-time API calls, or cached, against the last-good-state from the local sync layer, valid for up to 4 hours offline. Local state caching preserves what the agent was doing when connectivity dropped. And a sync model handles partial uploads gracefully when the connection returns. You cannot send a "failed: connection lost" error to a collections agent on the phone with a borrower. The system has to degrade gracefully and recover without data loss or duplicate records on reconnect. Most US platforms have never had to solve this because US enterprise networks do not fail this way.

3. Multi-language identity resolution

A customer record in a Philippine financial institution might have "Maria Santos" at one branch, "MARIA SANTOS" at another, "María Santos" with an accent at a third, and "Maria L. Santos-Reyes" at a fourth if she married. Standard deduplication handles that. Now add Cebuano. Now add Ilocano. Add that "Cruz" in one dialect might be transliterated the same as "Kruz" in informal usage. Add code-switched names: people whose official records are in English but whose community knows them by a Tagalog nickname that appears nowhere in the formal record.

The ontology needs a resolution layer that normalizes name representations across regional languages, transliterations, and spelling variations before attempting to match records. This is not a translation layer. It is a fuzzy-match engine with Philippine cultural knowledge baked in, mapping N representations of a name to a canonical identity. US enterprise ontologies do not have this. Nobody at Palantir has spent time thinking about whether "Buenaventura" and "Buenavntura," a common data-entry typo, represent the same customer. In Philippine deployments we have to.

4. Per-seat cost ceiling

A Foundry enterprise seat costs north of $10,000 USD per year fully loaded. That works in Singapore or Hong Kong where a senior analyst earns $120,000 USD and the seat is a small fraction of labor. A Philippine rural bank branch head earns roughly ₱35,000 per month, about $600 USD. A $10,000 seat is seventeen months of that person's salary. The unit economics collapse. The moment you accept that your cost-per-active-user must be measured in pesos, a cascade of decisions changes.

You use different LLM models, with a routing layer that directs simpler interactions to cheaper models and reserves the expensive one for complex cases.
You use different hosting, routing through Singapore or regional availability zones where compute costs differ from US-region pricing.
You design agents differently. A continuous background agent costs differently than one that fires on explicit invocation; at a peso-denominated cost ceiling, background agents must earn their compute budget.
The UI surface changes. Richer client-side interfaces require more JavaScript, more API calls, more real-time transfers, all of which carry cost. Thin client is not just a bandwidth choice, it is a cost-per-interaction choice.

5. Regulatory surface

"We'll add compliance later" is the single most expensive mistake you can make in a Philippine financial AI deployment. I have watched it happen twice in the last eighteen months. The primary surfaces are the SEC, whose MC 18 (s.2019) governs debt-collection conduct for lending and financing companies, reinforced by RA 11765; BSP, whose Circular 1133 (s.2021) caps interest rates and fees on small short-term consumer loans; the National Privacy Commission under RA 10173, the Data Privacy Act; and the DICT. These are their own bodies with their own audit requirements, reporting formats, and enforcement patterns.

RA 11765 and the SEC MC 18 conduct rules require covered institutions to be able to demonstrate compliance for decisions affecting customer accounts or credit assessments, which in practice means an audit trail. An AI agent that recommends a collections approach, routes a credit decision, or flags a customer for review is making exactly the kind of decision you may later have to defend. The audit log for this is not a log file. It is a structured, queryable record linking the agent action to the input state, the model version, the permission context, and the timestamp, in a format an SEC or BSP examiner can review. You cannot bolt this on afterward. The data model for an auditable decision is different from the data model for an unaudited one.

What Nova AIOS looks like in response

The ontology layer is designed around imperfect input. Every business object has a defined tolerance model specifying which fields are required, which are optional, and which have fallback resolution logic. The customer object includes a name-resolution component that applies fuzzy matching with Philippine language rules before matching or creating records. We also version the ontology, so agents are pinned to the version they were designed for until they are updated. This prevents the "my agent is doing something weird" problem that happens when you mutate the object model while agents are running.

Agents run against the ontology, not raw data. An agent never touches a database directly; it asks the ontology for objects, acts on them, and writes back. Connected mode queries live data. Cached mode operates against a local snapshot from the last sync, good for up to 4 hours offline, with mutating actions queued. Permissions are role-scoped, time-bounded, and audit-logged at the point of grant; the grant history is immutable. Every agent action that could later need to be defended under RA 11765 or the SEC MC 18 conduct rules generates an append-only audit record, stored separately from the operational database, exportable in the format examiners prefer.

The sync layer uses a conflict-resolution model from distributed systems: every ontology write gets a vector clock. When a branch agent reconnects, its local state is compared to the server state to identify conflicts. Resolvable ones merge automatically. Conflicts that need judgment surface to the branch manager as a resolution queue without blocking normal operations. Foundry assumes connectivity. We designed for its absence.

Thin client, fat ontology

There is a unifying principle behind most of these decisions. Thin client, fat ontology. Most US SaaS is the opposite: rich client-side applications handle significant business logic in the browser, with the server as a data store. In a low-bandwidth branch environment, that fails. Pushing 800KB of JavaScript over intermittent LTE is slow, and keeping complex client-side state coherent across interruptions is fragile.

Flip it. Put the business logic server-side, in the ontology and the agents. The client requests a rendered state and displays it, nothing more. This is fast on low-bandwidth connections because round-trips carry rendered state, not raw data. It is resilient because there is no complex client-side state to corrupt. The flip side: your ontology design has to be excellent. A fat ontology resolves ambiguous input, enforces business rules, manages permissions, generates audit records, and caches for offline mode. A poorly designed one breaks every agent that touches it. The investment in ontology design at the start of a deployment is non-negotiable. It is the most important week of the project.

Why constraints make you more robust

Most people reading this will think these are limitations. They are a moat. A system designed for spotty connectivity and multi-language input handles clean corporate environments trivially. Give it always-on fiber and English-only records and it runs without friction. The reverse is not true. A system designed assuming always-on connectivity and clean English data breaks the moment it meets a rural Philippine branch.

Building under tight constraints produces durable systems. It forces you to solve state management under failure, imperfect data handling, multilingual identity resolution, and sub-cent-per-interaction unit economics, problems comfortable architectures never face. Once solved, you have a system that runs anywhere on the spectrum from tight-constraint to no-constraint. The Philippine and Southeast Asian market is not a stepping stone to "real" enterprise AI. It is the engineering school that produces the most durable architecture in the world.

The AI integration playbook: from brief to production in 90 daysMay 29, 2026 · 14 min read Hire a Filipino AI developer or a US agency? An honest breakdownMay 26, 2026 · 11 min read Why mid-size SEA lenders need an AI OS, not another point vendorMay 25, 2026 · 12 min read

Want this working for your business?

We build the automation your team keeps meaning to build, then hand it over running. Book a call and we will map the first working slice.

All writing