Curated topic

mlp

Posts tagged with mlp

Cloudflare BlogJun 9, 2026

Defend against frontier cyber models: Cloudflare's architecture as customer zero

Why it matters: AI models now automate exploit generation at scale, making the speed of patching insufficient. Engineers must shift toward resilient architectures that prioritize behavioral scoring and Zero Trust containment over reactive signature-based defenses.

Frontier models like Mythos accelerate vulnerability discovery and exploit chain construction, allowing attackers to move faster than traditional patching cycles.
Cloudflare utilizes its own infrastructure as 'customer zero,' applying its security stack to its own code and employees before external release.
Defense is shifting from static signatures to ML-based scoring, using WAF Attack Score to identify and block mutated, AI-generated payloads in real-time.
Zero Trust principles, including mTLS and microsegmentation, are critical for containing the blast radius when a vulnerability is inevitably exploited.
Visibility across 20% of global web traffic allows Cloudforce One to convert network-wide insights into automated threat intelligence and rapid WAF rule deployment.
The architecture around a vulnerability is more important than the speed of the patch, as AI can bypass signature-based detections through rapid adaptation.

#security #mlp

Read original

Salesforce EngineeringJun 8, 2026

Scaling Zero Copy from 1 Trillion to 120 Trillion Rows with File Federation

Why it matters: Scaling distributed systems to 120 trillion rows requires moving beyond query federation. Adopting a file-based approach with Apache Iceberg eliminates bottlenecks between compute and storage, enabling high-performance AI at petabyte scale without data duplication.

Salesforce evolved its Zero Copy architecture to scale from 1 trillion to 120 trillion rows monthly to support massive AI workloads.
The team transitioned from Query Federation to File Federation to overcome throughput limitations and high compute costs of dual-compute models.
By standardizing on Apache Iceberg, the system enables a single-compute model to operate directly against shared storage across different platforms.
This architectural shift eliminates the need for data movement or duplication while maintaining governance and reducing network latency.
The new approach allows Data 360 and Agentforce to reason across distributed datasets at petabyte scale as if they were centralized.

#data #dist #mlp

Read original

Cloudflare BlogJun 5, 2026

Your AI bill is out of control. Cloudflare can fix it now.

Why it matters: Uncontrolled AI spend is a major challenge for organizations. These tools provide the observability and governance needed to scale AI usage sustainably by offering granular cost attribution and automated guardrails to prevent unexpected bill shock.

Cloudflare AI Gateway now features dollar-based spend limits to prevent budget overages across multiple AI providers.
A new closed beta introduces identity-driven budgets, integrating with Cloudflare Access to attribute costs to specific users or teams.
Dynamic routing allows for automatic fallback to cheaper models once a primary budget threshold is reached, ensuring service continuity.
The gateway provides unified logging and real-time analytics for token counts and costs across OpenAI, Anthropic, Google, and others.
Administrators can define granular policies based on custom attributes, model types, or identity provider (IdP) groups.

#finops #mlp #security

Read original

GitHub EngineeringJun 4, 2026

GitHub Universe is back: All together now, in the agentic era

Why it matters: GitHub Universe 2026 highlights the shift toward agentic workflows, where AI agents become core collaborators in software development. For engineers, it's a chance to move from AI demos to practical, integrated workflows while networking with peers solving similar scale problems.

GitHub Universe 2026 focuses on the 'agentic era,' exploring how AI agents integrate into unified developer workflows.
The event introduces 'Ship & Tell' lightning talks for community members to showcase real-world builds and technical solutions.
Enhanced peer-to-peer learning via the Discussions Lounge allows for small-group deep dives into specific engineering challenges.
The Source expands the event's open-source footprint, facilitating direct interaction with project maintainers.
Speaker After Parties provide a dedicated space for technical follow-up questions and behind-the-scenes insights from keynote sessions.

#culture #mlp #security

Read original

Spotify EngineeringJun 3, 2026

Coding Is No Longer the Constraint: Scaling Developer Experience to Teams and Agents at Spotify

Why it matters: As AI agents become integral to software development, platform engineering must shift from manual coding efficiency to building systems that support hybrid human-AI collaboration, ensuring scalability in complex environments.

Spotify's chief architect argues that writing code is no longer the primary bottleneck in software development.
The focus is shifting toward scaling Developer Experience (DevEx) to support both human teams and AI agents.
Optimizing internal platforms is essential for enabling AI agents like Claude to operate effectively within the codebase.
The strategy involves reducing friction across the entire software lifecycle rather than just individual productivity.
Treating AI agents as first-class citizens requires a rethink of how developer tools and infrastructure are designed.

#culture #mlp #sre

Read original

Salesforce EngineeringJun 3, 2026

How Agentforce Conversation Client Accelerated Accessibility Remediation by 5x Using AI-Driven Workflows

Why it matters: Scaling accessibility across complex UI platforms is traditionally slow and manual. By integrating AI-driven MCP workflows, engineers can automate WCAG remediation, ensuring consistent, framework-aware fixes at 5x speed while maintaining feature delivery velocity.

ACC is a conversational UI platform supporting 2.1 million monthly actions across Salesforce products.
Manual accessibility remediation was unsustainable due to high audit volumes and strict delivery constraints.
The team built an MCP-based platform to automate WCAG analysis and remediation workflows.
The system uses axe-core and structured scoring to provide deterministic context for AI-assisted fixes.
Automated workflows translate accessibility requirements into framework-aware code modifications.
This engineering approach achieved a 5x acceleration in resolving accessibility issues while ensuring consistency.

#frontend #mlp

Read original

GitHub EngineeringJun 2, 2026

GitHub Copilot app: The agent-native desktop experience

Why it matters: This app shifts AI from simple chat prompts to autonomous agents handling complex workflows. By providing isolated environments and visual collaboration tools, it reduces the cognitive load of managing multiple AI-driven tasks while maintaining human oversight and code quality.

GitHub launched the Copilot app, a desktop control center designed to manage multiple AI agents in parallel across connected repositories.
The app utilizes isolated git worktrees for each agent session, allowing concurrent tasks without branch conflicts or manual environment setup.
Agent Merge automates the pull request lifecycle by monitoring CI, addressing failing checks, and responding to reviewer feedback.
Canvases introduce bidirectional work surfaces where developers and agents collaborate on plans, terminal outputs, and dashboards.
Cloud and local sandboxes provide bounded environments for agents to safely execute code, inspect results, and iterate before merging.

#mlp #culture

Read original

Airbnb EngineeringJun 2, 2026

When history fails you, borrow from geography

Why it matters: Traditional forecasting fails during unprecedented shocks. This approach demonstrates how to maintain model accuracy in data-scarce environments by using Bayesian prior propagation and cross-geographic signals, providing a blueprint for handling asynchronous global disruptions.

Airbnb addressed the failure of historical forecasting models during COVID-19 by leveraging sequential geographic recovery patterns instead of relying solely on past local data.
The team identified 'booking lead time'—the ratio of advance booking days relative to 2019—as a high-fidelity signal for tracking market recovery phases.
A Bayesian hierarchical model was implemented where parameters for 'late-reopening' corridors were informed by priors derived from 'early-reopening' corridors.
The system uses similarity weights to propagate information across geographies, allowing the model to 'borrow' data from markets that have already experienced similar recovery shocks.
This approach reduced forecasting errors in data-scarce environments by treating geography as a proxy for time, effectively using one region's present to predict another's future.

#data #mlp

Read original

Dropbox Tech BlogMay 28, 2026

Beyond code generation: rethinking engineering productivity in the age of AI agents

Why it matters: AI tools accelerate coding but can overwhelm CI/CD and review pipelines. This shift from writing code to orchestrating agents requires new platforms and metrics to ensure that increased output actually translates into customer value without breaking engineering systems.

Dropbox is transitioning from AI copilots to autonomous agents that can edit files, run tests, and iterate on failures independently.
Increased code generation speed has shifted engineering bottlenecks downstream to code reviews, CI systems, and release coordination.
Nova, an internal platform, allows engineers to execute scoped tasks via agents and now accounts for approximately 8% of all pull requests.
AI agents are being prioritized for high-toil engineering work such as migrations, flaky test remediation, and dependency updates.
Productivity measurement is evolving from simple output metrics like PR count to holistic signals like review burden and customer impact.
The engineering role is shifting toward intent definition, architectural oversight, and final quality validation rather than manual implementation.

#mlp #culture #sre

Read original

Slack EngineeringMay 28, 2026

Slack AI: The Path to Multi-Cloud

Why it matters: This article provides a blueprint for scaling enterprise LLM infrastructure. It details the transition from manual GPU management to managed services, highlighting how to balance security, cost-efficiency, and reliability through strategic multi-cloud orchestration and capacity forecasting.

Slack transitioned from AWS SageMaker to Amazon Bedrock to reduce operational overhead and address GPU scarcity.
The architecture uses an escrow VPC strategy to maintain a zero-knowledge environment, ensuring data privacy and model security.
Infrastructure evolved from managing raw GPU instances to utilizing Model Units for deterministic throughput and easier scaling.
Slack optimized costs by using Provisioned Throughput for interactive features and On Demand capacity for bursty, scheduled tasks.
A zero-incident migration was achieved through extensive load testing, shadow requests, and gradual feature flag rollouts.
The shift enabled faster adoption of new LLM models, reducing the feature lag previously experienced with custom serving solutions.

#mlp #sre #security

Read original

Page 7 of 38

Prev 1...5 6 7 8 9...38 Next