Salesforce Engineering

Why it matters: It demonstrates how to build a scalable, trust-first AI agent architecture. By integrating deterministic graphs with unstructured data and open standards like MCP, it provides a blueprint for enterprise-grade AI orchestration and governance beyond simple chat interfaces.

Salesforce is evolving its architecture into a general-purpose enterprise agent platform capable of handling non-CRM workloads through Agentforce and Data 360.
The platform uses AgentScript and AgentGraph to provide deterministic structure and orchestration for non-deterministic AI reasoning flows.
Data 360 acts as a unified context system, harmonizing structured and unstructured data with deep metadata enrichment for more accurate agent grounding.
A dedicated trust layer manages identity, credential context, and policy enforcement to protect against prompt injection and unauthorized data access.
The architecture supports open standards like Model Context Protocol (MCP) and Agent-to-Agent (A2A) for cross-platform tool invocation and orchestration.
Agents utilize short-term, long-term, and episodic memory combined with user personalization profiles to improve reasoning reliability.

#data #mlp #security

Read original

Salesforce EngineeringMar 9, 2026

Engineering Platform Trust: Cutting Customer Case Volume 20x with Petabyte-Scale Health Signals

Why it matters: This system demonstrates how to transform massive, fragmented telemetry into actionable insights. By standardizing health metrics and isolating analytics from production, engineers can proactively identify risks, reduce support overhead, and ensure platform stability at a petabyte scale.

Salesforce's Technical Health Score (THS) quantifies implementation health across five pillars: Security, Efficiency, Operational Excellence, Customization, and Observability.
The architecture processes petabytes of telemetry via an off-core analytics platform, ensuring zero impact on live transactional workloads.
Diverse metrics are normalized into a 1–100 scale using distribution-based methods to compare organizations against peers of similar complexity.
A signal-qualification framework filters for actionability, ensuring the score reflects customer-controlled configurations rather than platform-level issues.
This proactive approach has successfully reduced support case volume by 20x for customers who maintain high technical health scores.

#data #sre #security

Read original

Salesforce EngineeringMar 5, 2026

How Data 360 Optimized Kubernetes Scheduling Architecture, Delivering 13% Cost Savings

Why it matters: Optimizing Kubernetes scheduling for bursty Spark workloads resolves the conflict between cost efficiency and job stability. By moving from reactive consolidation to proactive bin-packing, engineers can achieve significant cost savings without triggering disruptive pod evictions.

Salesforce's Data 360 team optimized Kubernetes scheduling for Spark workloads, managing 2 million daily applications at global scale.
The default LeastAllocated strategy caused node fragmentation by spreading executors across the cluster, leaving many nodes underutilized.
Reactive autoscaling with Karpenter led to job instability, as evicting executors for consolidation triggered expensive Spark task retries.
The team implemented a custom scheduler using the MostAllocated scoring strategy via the NodeResourcesFit plugin to prioritize high-density bin-packing.
This proactive placement logic ensures executors are packed onto existing nodes before spinning up new capacity, reducing fragmentation.
The architectural shift delivered a 13% reduction in infrastructure costs while maintaining high reliability for critical data workloads.

#data #finops #dist

Read original

Salesforce EngineeringMar 2, 2026

Delivering Accurate, Low-Latency Voice-to-Form AI in Real-World Field Conditions

Why it matters: This architecture demonstrates how to balance on-device processing with cloud AI to solve real-world data entry challenges. It provides a blueprint for building low-latency, high-accuracy mobile AI features that function reliably in noisy, bandwidth-constrained environments.

Implemented a hybrid architecture using on-device speech-to-text (STT) to minimize latency and cloud-based LLMs for semantic field mapping.
Achieved 85% field-level accuracy by providing LLMs with schema-driven metadata, including field types, constraints, and examples.
Managed latency by performing transcription locally on mobile devices, reducing payload sizes and avoiding audio transmission to the cloud.
Developed a Voice Utterance Library containing real-world field recordings to test system reliability against background noise and diverse accents.
Integrated a human-in-the-loop review step, allowing technicians to edit transcriptions before final submission to ensure data integrity.

#mobile #mlp #data

Read original

Salesforce EngineeringFeb 26, 2026

Hyperforce Migration at Scale: How Deterministic Automation Replaced Manual Spreadsheets Across 95,000 Organizations

Why it matters: Automating large-scale infrastructure migrations is critical for reducing operational risk. MIPS demonstrates how to build a deterministic decision engine that maintains auditability and customer trust while scaling to handle tens of thousands of complex organization moves.

Salesforce developed the Migration Intake and Processing Service (MIPS) to automate the migration of 95,000 organizations to Hyperforce.
The platform replaced manual spreadsheets and email-based coordination with a deterministic decision engine for eligibility and capacity.
MIPS consolidates distributed sources of truth into a centralized layer using well-defined APIs and explicit data contracts.
The system achieves a 90%+ auto-approval rate while escalating complex exceptions for human review to maintain throughput.
Every migration decision is fully traceable and auditable, ensuring customer trust and data residency requirements are met.
The architecture utilizes continuous data quality checks to prevent misinterpretations of regional capacity or scheduling windows.

#sre #dist #data

Read original

Salesforce EngineeringFeb 23, 2026

Building an AI-Accelerated Compliance Automation Platform for 24x Faster Audits

Why it matters: Automating compliance reduces operational risk and engineering toil. By moving from fragile UI-driven workflows to API-first systems using AI-assisted development, teams can deliver audit-ready evidence 24x faster while maintaining high engineering standards.

Developed FastTrack, an API-first automation platform that replaced manual, screenshot-driven compliance audits for mobile app stores.
Achieved a 24x reduction in audit execution time by moving from fragile UI-based workflows to deterministic API integrations.
Utilized AI-assisted development to accelerate the path from system design to production, enabling rapid prototyping of validation logic.
Addressed data granularity gaps in the Google Play Console API by redefining evidence boundaries in collaboration with compliance stakeholders.
Embedded runtime validation and transparent API query logging to create verifiable, compliance-grade audit trails.

#mobile #security #culture

Read original

Salesforce EngineeringFeb 21, 2026

From Audio to Action: How Speech Invocable Action Powers Native AI Automation Across Salesforce

Why it matters: This shift to native speech automation eliminates third-party security risks and simplifies complex AI integration. It demonstrates how to build resource-intensive AI features within a multi-tenant environment while maintaining strict data residency and platform stability.

Salesforce developed Speech Invocable Action to provide native, secure speech-to-text and translation within its platform trust boundary.
The architecture manages shared memory and compute resources to ensure stability across concurrent multi-tenant workloads.
Defensive design uses structured error categories, enabling developers to implement explicit fallback logic in Flows and Agentforce.
The team leveraged AI-assisted development tools like Claude Code to navigate complex internal APIs and accelerate production delivery.
Standardizing speech as a composable action removes the need for external integrations and boilerplate code for audio streaming.

#mlp #security #data

Read original

Salesforce EngineeringFeb 16, 2026

How Agentforce Achieved Accurate Flow Generation Across 461 Billion Monthly Executions Using a Constrained DSL

Why it matters: This approach demonstrates how to scale LLM-driven automation by replacing black-box fine-tuning with deterministic DSLs. It ensures reliability and debuggability for mission-critical workflows while significantly reducing the operational overhead of model maintenance.

Salesforce transitioned from fine-tuned LLMs to a constrained, multi-stage DSL framework to improve the accuracy of natural-language-to-Flow generation.
The system manages over 461 billion monthly executions across 63+ Flow varieties by enforcing strict metadata rules and validation gates.
A modular pipeline separates the process into an Architect phase for structural planning and a Developer phase for low-level metadata production.
DSL constructs are derived programmatically from Flow Metadata WSDL, ensuring generation rules stay synchronized with evolving platform schemas.
This deterministic approach eliminates expensive model retraining cycles, allowing for faster response to schema changes and correctness fixes.

#mlp #dist

Read original

Salesforce EngineeringFeb 11, 2026

Against the Clock: How Data 360 Launched the Informatica Help Agent in 24 Days

Why it matters: This article demonstrates how a robust data foundation like Data 360 enables rapid AI deployment. It provides a blueprint for handling large-scale unstructured data and meeting aggressive deadlines through architectural reuse and automated data preparation.

Leveraged Data 360 to unify and index over 100,000 unstructured documents into a searchable knowledge base for AI agents.
Met a strict 24-day post-acquisition deadline by prioritizing production-grade foundations over complex edge cases.
Automated the cleanup of raw HTML documentation, removing noise like headers and navigation menus to improve retrieval precision.
Utilized a sitemaps crawling feature and Python workflows to ingest diverse content sources into a standardized format.
Implemented metadata tagging and optimized chunking strategies to handle complex product versioning and ensure high retrieval accuracy.
Achieved an 80% resolution rate with only 5% human escalation, demonstrating the effectiveness of the data-centric approach.

#data #mlp

Read original

Salesforce EngineeringFeb 9, 2026

How Agentic Memory Enables Durable, Reliable AI Agents Across Millions of Enterprise Users

Why it matters: This architecture solves the statelessness problem in AI agents, enabling long-term context and reliability at scale. It provides a blueprint for building governable, auditable AI systems that maintain user trust while reducing prompt noise and latency through structured memory layers.

Agentic Memory transforms stateless AI agents into durable collaborators by externalizing memory into a structured, persistent data layer linked to a profile graph.
The architecture separates short-term session context from long-term memory, ensuring continuity across different communication channels and sessions.
To ensure reliability, the system uses a pipeline with confidence scoring, write/read gates, and hybrid semantic validation to filter and update memory records.
Adaptive context allows agents to dynamically prioritize and prune information in real-time, reducing latency and noise compared to raw prompt injection.
Structured reasoning and session-level tracing provide an auditable history of agent decisions, making AI behavior explainable and compliant with enterprise standards.

#mlp #data #dist

Read original

Page 1 of 3

Prev1 2 3 Next