Curated topic

mlp

Posts tagged with mlp

Cloudflare BlogFeb 12, 2026

Why it matters: As AI agents become primary web consumers, serving raw HTML is inefficient and costly. This feature treats agents as first-class citizens, drastically reducing LLM token costs and improving parsing accuracy by providing clean, structured data directly at the network edge.

Cloudflare introduced 'Markdown for Agents,' a feature that automatically converts HTML content to Markdown in real-time at the edge.
Markdown significantly reduces token consumption by up to 80% compared to HTML, optimizing costs and context window usage for LLMs.
The feature utilizes standard HTTP content negotiation, allowing AI agents to request Markdown via the 'Accept: text/markdown' header.
Responses include an 'x-markdown-tokens' header to help developers manage context windows and chunking strategies effectively.
The system integrates with the Content Signals framework to define how content should be used for AI training and search indexing.

#dist #mlp #finops

Read original

GitHub EngineeringFeb 11, 2026

GitHub availability report: January 2026

Why it matters: This report highlights the risks of major infrastructure upgrades and model configuration changes in high-scale environments. It underscores the importance of robust rollback procedures and the need for load testing to detect resource contention before production deployment.

GitHub Copilot experienced a significant outage on January 13 due to a configuration error during a model update, peaking at 100% error rates.
The Copilot recovery was delayed by secondary availability issues with upstream provider OpenAI's GPT-4.1 model.
On January 15, a major version upgrade to data store infrastructure caused resource contention, leading to widespread latency across GitHub services.
The infrastructure incident impacted 1.8% of web and API requests, primarily affecting unauthenticated users through slow queries and timeouts.
Both incidents were mitigated via rollbacks to previous stable versions while GitHub works on improved high-load validation and configuration safeguards.

#sre #data #mlp

Read original

Microsoft Azure BlogFeb 11, 2026

Agentic cloud operations: A new way to run the cloud

Why it matters: As cloud complexity outpaces human capacity, agentic operations allow engineers to move from manual toil to high-level orchestration. By automating context-aware diagnosis and remediation, teams can maintain reliability and efficiency at the scale required for modern AI workloads.

Agentic cloud operations shift from manual, dashboard-centric management to dynamic, AI-driven systems that correlate signals and take autonomous actions.
Azure Copilot serves as the central agentic interface, integrating with subscriptions, resources, and policies to provide context-aware operational intelligence.
Specialized agents cover the full cloud lifecycle, including migration planning, infrastructure-as-code generation, and automated deployment validation.
Real-time observability and troubleshooting agents accelerate root cause analysis by diagnosing health signals across the full stack and recommending fixes.
Resiliency and optimization agents continuously identify gaps in recovery configurations and execute cost-saving or performance-enhancing adjustments.

#sre #mlp #finops

Read original

Salesforce EngineeringFeb 11, 2026

Against the Clock: How Data 360 Launched the Informatica Help Agent in 24 Days

Why it matters: This article demonstrates how a robust data foundation like Data 360 enables rapid AI deployment. It provides a blueprint for handling large-scale unstructured data and meeting aggressive deadlines through architectural reuse and automated data preparation.

Leveraged Data 360 to unify and index over 100,000 unstructured documents into a searchable knowledge base for AI agents.
Met a strict 24-day post-acquisition deadline by prioritizing production-grade foundations over complex edge cases.
Automated the cleanup of raw HTML documentation, removing noise like headers and navigation menus to improve retrieval precision.
Utilized a sitemaps crawling feature and Python workflows to ingest diverse content sources into a standardized format.
Implemented metadata tagging and optimized chunking strategies to handle complex product versioning and ensure high retrieval accuracy.
Achieved an 80% resolution rate with only 5% human escalation, demonstrating the effectiveness of the data-centric approach.

#data #mlp

Read original

Engineering at MetaFeb 11, 2026

The Death of Traditional Testing: Agentic Development Broke a 50-Year-Old Field, JiTTesting Can Revive It

Why it matters: Traditional testing is a bottleneck for AI-accelerated development. JiTTesting automates the test lifecycle—from generation to validation—eliminating maintenance toil and ensuring high-signal bug detection in high-velocity environments.

Agentic software development is accelerating code changes beyond the capacity of traditional, manually maintained test suites.
Just-in-Time Tests (JiTTests) are LLM-generated on the fly for specific pull requests to catch regressions before they reach production.
The system uses mutation testing to deliberately insert faults, simulating potential failures to verify that generated tests are effective.
JiTTests are ephemeral and do not reside in the codebase, eliminating the long-term burden of test maintenance and code review.
Ensembles of rule-based and LLM-based assessors are used to filter results, significantly reducing false positives and engineer toil.
The approach shifts testing focus from generic code quality to high-signal fault detection tailored to the specific intent of a code change.

#mlp #sre

Read original

Dropbox Tech BlogFeb 11, 2026

Insights from our executive roundtable on AI and engineering productivity

Why it matters: AI is shifting from experimental to essential in the SDLC. Dropbox's experience shows that combining off-the-shelf tools with custom solutions for specific monorepo constraints can measurably increase PR throughput and improve developer satisfaction at scale.

Dropbox integrated AI tools like Claude Code and Cursor into their engineering workflow to accelerate feature delivery.
AI adoption was elevated to a company-level priority to reduce administrative overhead and align leadership.
The engineering team developed custom AI tooling to handle scale constraints of their large, multi-language monorepo.
A specific custom tool monitors pull requests for failed builds and automatically proposes fixes using an internal AI platform.
Data shows a direct correlation between high AI tool engagement and increased pull request (PR) throughput per engineer.
Internal surveys indicate a significant shift toward positive developer sentiment as AI tools become more integrated.

#mlp #culture

Read original

Microsoft Azure BlogFeb 10, 2026

Can high-temperature superconductors transform the power infrastructure of datacenters?

Why it matters: As AI workloads drive unprecedented power demands, traditional copper infrastructure faces efficiency and space limits. HTS technology offers a path to lossless power delivery and higher density, enabling sustainable scaling of next-generation datacenter architecture.

Microsoft is investigating High-Temperature Superconductors (HTS) to meet the massive power demands of AI and data-intensive computing.
HTS cables provide lossless power transmission with zero electrical resistance, eliminating heat generation and voltage drops.
The technology allows for higher power density in smaller footprints, enabling more compact and efficient datacenter designs.
Operationalizing HTS requires specialized cryogenic cooling systems to maintain materials at temperatures necessary for superconductivity.
By reducing transmission losses and infrastructure size, HTS supports sustainability goals and helps scale cloud infrastructure effectively.

#sre #mlp #finops

Read original

Salesforce EngineeringFeb 9, 2026

How Agentic Memory Enables Durable, Reliable AI Agents Across Millions of Enterprise Users

Why it matters: This architecture solves the statelessness problem in AI agents, enabling long-term context and reliability at scale. It provides a blueprint for building governable, auditable AI systems that maintain user trust while reducing prompt noise and latency through structured memory layers.

Agentic Memory transforms stateless AI agents into durable collaborators by externalizing memory into a structured, persistent data layer linked to a profile graph.
The architecture separates short-term session context from long-term memory, ensuring continuity across different communication channels and sessions.
To ensure reliability, the system uses a pipeline with confidence scoring, write/read gates, and hybrid semantic validation to filter and update memory records.
Adaptive context allows agents to dynamically prioritize and prune information in real-time, reducing latency and noise compared to raw prompt injection.
Structured reasoning and session-level tracing provide an auditable history of agent decisions, making AI behavior explainable and compliant with enterprise standards.

#mlp #data #dist

Read original

Engineering at MetaFeb 9, 2026

Building Prometheus: How Backend Aggregation Enables Gigawatt-Scale AI Clusters

Why it matters: Scaling AI to gigawatt levels requires solving massive networking bottlenecks. BAG enables petabit-scale interconnectivity between distributed data centers, allowing thousands of GPUs to function as a single cluster, which is essential for training next-generation large-scale AI models.

Meta is developing Prometheus, a 1-gigawatt AI cluster designed to interconnect tens of thousands of GPUs across multiple data centers.
Backend Aggregation (BAG) serves as a centralized Ethernet-based super spine layer, enabling petabit-scale bandwidth (16-48 Pbps) between regions.
The architecture bridges two distinct network fabrics: Disaggregated Scheduled Fabric (DSF) and Non-Scheduled Fabric (NSF).
BAG utilizes planar and spread topologies to optimize for either management simplicity or enhanced path diversity and resilience.
The system manages strict distance, buffer, and latency constraints to maintain high-performance GPU-to-GPU communication.
BAG acts as the critical aggregation point between regional networks and Meta's backbone to support massive AI training demands.

#dist #mlp

Read original

Microsoft Azure BlogFeb 5, 2026

Claude Opus 4.6: Anthropic’s powerful model for coding, agents, and enterprise workflows is now available in Microsoft Foundry

Why it matters: This integration brings Anthropic's most advanced reasoning to Azure, enabling engineers to build secure, agentic workflows with a 1M token context window. It simplifies the path to production by combining frontier intelligence with enterprise-grade governance and data connectivity.

Claude Opus 4.6 is now available in Microsoft Foundry, integrating Anthropic's most advanced reasoning model with Azure's secure infrastructure.
The model features a 1M token context window (beta) and 128K max output, optimized for large-scale codebases and complex document analysis.
Integration with Foundry IQ enables agents to access and act on data across Microsoft 365, Fabric, and the web.
Engineers can leverage the model for autonomous coding tasks, including refactoring, bug detection, and full-lifecycle development.
The platform provides enterprise-grade governance, access controls, and operational tools to accelerate the transition from experimentation to production.
Specific industry applications include high-context financial analysis, legal drafting, and cybersecurity workflows.

#mlp #data #security

Read original

Page 5 of 19

Prev 1...3 4 5 6 7...19 Next