Curated topic

mlp

Posts tagged with mlp

Engineering at MetaMar 13, 2026

Patch Me If You Can: AI Codemods for Secure-by-Default Android Apps

Why it matters: Scaling security updates across massive codebases is traditionally slow and error-prone. By combining secure-by-default frameworks with AI-powered codemods, Meta demonstrates how to automate large-scale security migrations, reducing developer friction and improving app safety at scale.

Meta utilizes a two-pronged strategy for mobile security: secure-by-default frameworks and AI-driven automated migrations.
Secure-by-default frameworks wrap unsafe Android OS APIs to ensure the secure path is the easiest for developers to follow.
Generative AI is leveraged to automate the migration of legacy code to these secure frameworks at a massive scale.
The system automates the proposal, validation, and submission of security patches across millions of lines of code.
This approach significantly reduces developer friction and manual effort required to maintain security across a sprawling codebase.

#security #mobile #mlp

Read original

Spotify EngineeringMar 12, 2026

Inside the Archive: The Tech Behind Your 2025 Wrapped Highlights

Why it matters: This demonstrates how to turn massive datasets into personalized user experiences at scale, a key challenge for data-intensive consumer applications.

Spotify leverages large-scale data processing to identify unique listening patterns and 'moments' from a user's year.
The system transforms raw streaming logs into personalized narrative highlights using automated storytelling engines.
Scalability is a core challenge, as the system must process data for hundreds of millions of users simultaneously.
The architecture involves complex data pipelines and machine learning models to categorize and rank listening events.

#data #dist #mlp

Read original

Airbnb EngineeringMar 12, 2026

Recommending Travel Destinations to Help Users Explore

Why it matters: This approach demonstrates how to adapt NLP architectures for travel recommendations by balancing short-term intent with long-term history. It addresses the cold-start problem for dormant users while improving geolocation accuracy through multi-task learning.

Developed a Transformer-based framework that treats user actions like bookings, views, and searches as tokens to predict destination intent.
Integrated long-term historical interests with short-term contextual signals to capture both stable preferences and immediate travel needs.
Implemented a dual-training strategy to balance 'active' users with recent activity and 'dormant' users who haven't visited the platform recently.
Utilized multi-task learning with city-level and region-level prediction heads to improve the model's understanding of geolocation relationships.
Deployed the model in search autosuggest and abandoned search email notifications, leading to significant gains in bookings and user engagement.

#mlp #data

Read original

GitHub EngineeringMar 12, 2026

Continuous AI for accessibility: How GitHub transforms feedback into inclusion

Why it matters: This demonstrates how to use AI and automation to solve 'tragedy of the commons' issues like accessibility that cross team boundaries. It provides a blueprint for building agentic workflows that enhance human productivity and ensure critical user feedback is never lost in the backlog.

GitHub implemented an event-driven workflow using GitHub Actions and Copilot to centralize and automate accessibility feedback management.
The system uses LLMs via the GitHub Models API to analyze raw feedback, map it to WCAG standards, and assign severity scores automatically.
A human-in-the-loop approach ensures AI-generated summaries and technical details are verified before being routed to specific engineering teams.
By treating feedback as a data pipeline, the system identifies cross-cutting issues that lack clear ownership, improving overall platform inclusion.
The workflow automates repetitive tasks like technical documentation and team routing, allowing engineers to focus on implementing fixes.
The architecture supports continuous improvement by feeding human corrections back into AI prompts to refine future analysis accuracy.

#mlp #frontend #culture

Read original

Salesforce EngineeringMar 11, 2026

Beyond CRM: How Salesforce Engineered an Enterprise Agent Platform for Any Workload

Why it matters: It demonstrates how to build a scalable, trust-first AI agent architecture. By integrating deterministic graphs with unstructured data and open standards like MCP, it provides a blueprint for enterprise-grade AI orchestration and governance beyond simple chat interfaces.

Salesforce is evolving its architecture into a general-purpose enterprise agent platform capable of handling non-CRM workloads through Agentforce and Data 360.
The platform uses AgentScript and AgentGraph to provide deterministic structure and orchestration for non-deterministic AI reasoning flows.
Data 360 acts as a unified context system, harmonizing structured and unstructured data with deep metadata enrichment for more accurate agent grounding.
A dedicated trust layer manages identity, credential context, and policy enforcement to protect against prompt injection and unauthorized data access.
The architecture supports open standards like Model Context Protocol (MCP) and Agent-to-Agent (A2A) for cross-platform tool invocation and orchestration.
Agents utilize short-term, long-term, and episodic memory combined with user personalization profiles to improve reasoning reliability.

#data #mlp #security

Read original

Cloudflare BlogMar 11, 2026

Slashing agent token costs by 98% with RFC 9457-compliant error responses

Why it matters: Engineers building AI agents can now handle network errors programmatically and cost-effectively. By replacing verbose HTML with structured data, Cloudflare enables agents to make deterministic decisions like exponential backoff while slashing operational token costs by 98%.

Cloudflare now serves RFC 9457-compliant structured error responses in JSON and Markdown specifically for AI agents.
The new formats replace heavy HTML error pages, reducing payload size and LLM token consumption by over 98%.
Agents can trigger these responses by sending specific Accept headers such as application/problem+json or text/markdown.
Responses include machine-readable metadata like retryable status, retry_after durations, and specific error categories.
The implementation currently covers all 1xxx-class errors, including rate limits, WAF blocks, and DNS resolution issues.
This system requires no configuration from site owners and maintains standard HTML responses for browser-based traffic.

#dist #mlp #sre

Read original

Cloudflare BlogMar 11, 2026

AI Security for Apps is now generally available

Why it matters: AI apps introduce probabilistic attack surfaces like prompt injection that traditional WAFs can't stop. Cloudflare's GA release provides automated discovery and specialized guardrails to secure LLM endpoints and agents without requiring model-specific integrations.

Cloudflare's AI Security for Apps is now generally available, providing a reverse proxy layer to protect LLM-powered applications and agents.
AI endpoint discovery is now free for all customers, using behavioral analysis to identify AI-integrated endpoints across web properties.
The platform detects prompt injection, PII exposure, and toxic content, attaching metadata to requests for mitigation via WAF rules.
New custom topic detection allows organizations to define and flag specific off-policy categories relevant to their business domain.
Custom prompt extraction enables security teams to define where LLM inputs reside within non-standard JSON structures for accurate inspection.
The solution addresses risks highlighted in the OWASP Top 10 for LLM Applications, such as sensitive information disclosure and unbounded consumption.

#security #mlp

Read original

GitHub EngineeringMar 10, 2026

The era of “AI as text” is over. Execution is the new interface.

Why it matters: This shift transforms AI from a chat interface into programmable infrastructure. By embedding execution engines into apps, developers can build resilient, context-aware systems that handle complex multi-step tasks without brittle, hard-coded logic or custom orchestration layers.

The GitHub Copilot SDK enables developers to embed agentic execution and planning directly into their own applications.
Shift from hard-coded scripts to intent-based delegation where agents plan, modify files, and recover from errors autonomously.
Integration with the Model Context Protocol (MCP) allows agents to access structured runtime data like API schemas and dependency graphs.
AI capabilities are moving beyond the IDE into background services, desktop apps, and event-driven systems.
The SDK provides a production-tested orchestration layer, reducing the need for teams to build homegrown AI stacks from scratch.

#mlp #dist

Read original

GitHub EngineeringMar 9, 2026

Under the hood: Security architecture of GitHub Agentic Workflows

Why it matters: As AI agents integrate into CI/CD, they introduce risks like prompt injection and credential theft. This architecture provides a blueprint for running non-deterministic agents safely within trusted environments by enforcing strict isolation, secret redaction, and governed execution.

GitHub Agentic Workflows implement a layered security model consisting of substrate, configuration, and planning layers to mitigate risks from non-deterministic agent behavior.
The architecture enforces a zero-trust approach for secrets by isolating agents in dedicated containers with restricted egress and no direct access to environment variables or API keys.
A trusted Model Context Protocol (MCP) gateway manages server authentication and communication, preventing agents from leaking credentials via prompt injection or malicious inputs.
Network isolation is achieved through private networking between agents and a firewall, with all LLM calls routed through a secure API proxy to prevent data exfiltration.
The safe outputs subsystem ensures all repository writes are staged and vetted, preventing agents from making unauthorized or malicious commits to the codebase.

#security #mlp

Read original

Pinterest EngineeringMar 6, 2026

Unified Context-Intent Embeddings for Scalable Text-to-SQL

Why it matters: Scaling Text-to-SQL in large enterprises fails with simple RAG due to schema complexity. By encoding historical analyst intent and governance metadata into embeddings, engineers can build agents that provide trustworthy, context-aware queries instead of just syntactically correct ones.

Pinterest evolved its Text-to-SQL system into a production Analytics Agent by focusing on analytical intent rather than just raw SQL syntax.
The system utilizes unified context-intent embeddings, which translate historical SQL queries into semantically rich natural language descriptions using LLMs.
A three-step pipeline injects domain context, such as glossary terms and metric definitions, before converting SQL to structured text summaries.
Retrieval is enhanced by structural and statistical patterns, extracting validated join keys and aggregation logic from historical query data.
A governance-aware ranking system prioritizes trustworthy data by incorporating table tiers, usage signals, and documentation quality from the PinCat catalog.
This approach addresses the challenges of a massive data warehouse by grounding AI outputs in patterns that have historically worked for human analysts.

#data #mlp

Read original

Page 1 of 19

Prev1 2 3...19 Next