Curated topic

mlp

Posts tagged with mlp

Salesforce EngineeringDec 3, 2025

How Agentforce Achieved 3–5x Faster Response Times While Solving Enterprise-Scale Architectural Complexity

Why it matters: This article demonstrates how to scale agentic AI in complex enterprise environments by balancing LLM reasoning with deterministic logic. It provides a blueprint for reducing latency and ensuring architectural consistency across multi-brand deployments while maintaining high accuracy.

Restructured architecture by offloading deterministic tasks like JSON parsing and hierarchical decisioning from the LLM to Apex code to ensure consistency.
Reduced multi-stage reasoning latency by approximately 20 seconds by consolidating sequential model calls into a single execution step.
Optimized data retrieval by combining Data 360 lookups and order API calls into single, efficient pulls rather than incremental passes.
Developed a multi-brand architecture using a shared core logic layer while allowing brand-specific prompt overrides for unique tone and voice.
Improved response times by 3–5x through the elimination of redundant reasoning loops and the stabilization of data-flow boundaries.

#mlp #data #dist

Read original

Microsoft Azure BlogDec 2, 2025

Introducing Mistral Large 3 in Microsoft Foundry: Open, capable, and ready for production workloads

Why it matters: This article matters because it introduces a powerful, open-source, Apache-licensed frontier model (Mistral Large 3) into Azure Foundry, providing enterprises with a flexible, reliable, and production-ready AI solution for complex, multimodal, and long-context applications.

Mistral Large 3, an Apache-licensed open-weight frontier model, is now available in Microsoft Azure Foundry for enterprise production.
It offers reliable instruction following, long-context comprehension, and strong multimodal reasoning, optimized for real-world applications.
The model demonstrates low hallucination rates and consistent performance in complex, multi-turn interactions and extended inputs.
Exceptional long-context handling supports RAG, document understanding, and long-form summarization.
Its multimodal capabilities enable cross-modal understanding for text, images, and structured data.
Fully open and Apache 2.0 licensed, it allows flexible deployment, fine-tuning, and commercial use without restrictions.
Azure Foundry provides unified access, governance, and agent-ready tooling for seamless integration.

#mlp

Read original

Microsoft Azure BlogDec 2, 2025

A decade of open innovation: Celebrating 10 years of Microsoft and Red Hat partnership

Why it matters: This article highlights how a decade-long partnership between Microsoft and Red Hat has driven significant advancements in hybrid cloud, open source, and AI. Engineers can learn about integrated platforms like ARO, cost-saving benefits, and tools for modernizing applications and scaling AI.

Microsoft and Red Hat mark a decade of partnership, advancing open source and enterprise cloud innovation, particularly for hybrid cloud transformation.
Key offerings include Red Hat Enterprise Linux (RHEL) on Azure and Azure Red Hat OpenShift (ARO), a jointly engineered, fully managed application platform.
The collaboration has enabled digital transformation, cost savings, and accelerated AI initiatives for global enterprises across various industries.
Technical accomplishments include deep integration of Red Hat solutions on Azure, OpenShift Virtualization, Confidential Containers, and contributions to Kubernetes.
The partnership provides a secure, governable foundation for scalable AI adoption, leveraging ARO with Azure OpenAI Service and Microsoft Foundry.
Flexible pricing through Azure Hybrid Benefit for RHEL helps optimize costs for organizations running workloads on Azure.

#dist #mlp #finops

Read original

GitHub EngineeringDec 1, 2025

How to orchestrate agents using mission control

Why it matters: This tool enhances developer productivity by enabling parallel execution and orchestration of AI coding agents, centralizing task management and review. It shifts the mental model from sequential to concurrent work, optimizing development workflows.

GitHub's new Agent HQ mission control provides a unified interface for managing Copilot coding agent tasks across multiple repositories.
The tool facilitates a shift from sequential to parallel task execution, allowing engineers to assign and orchestrate multiple agent tasks concurrently.
Effective orchestration involves crafting clear, contextual prompts and leveraging custom agents for consistent results.
Engineers must actively monitor agents for signals like failing tests, scope creep, or misinterpretation, intervening with specific guidance when necessary.
While parallel processing is ideal for research, analysis, documentation, and security reviews, sequential workflows remain suitable for dependent or complex tasks.
Mission control centralizes assignment, oversight, and review, streamlining the development workflow and enhancing productivity.

#mlp #culture

Read original

Slack EngineeringDec 1, 2025

Streamlining Security Investigations with Agents

Why it matters: This article details how Slack built robust AI agent systems for security investigations by moving from single prompts to chained, structured model invocations, offering a blueprint for reliable AI application development.

Slack's Security Engineering team implemented AI agents to streamline security investigations, processing billions of events daily.
Initial prototypes, relying on a single large prompt, exhibited inconsistent performance despite prompt refinement attempts.
The team's solution involved breaking down complex investigations into a sequence of chained, single-purpose model invocations.
Utilizing structured output, defined by JSON schema, was key to achieving fine-grained control and predictable behavior at each step.
The production system employs a team of 'personas' (agents) for specific tasks, with the application orchestrating their interactions and context propagation.
This method significantly improves consistency and reliability in AI-driven security analysis, moving beyond simple prompt engineering.

#security #mlp

Read original

Cloudflare BlogDec 1, 2025

Why Replicate is joining Cloudflare

Why it matters: Replicate's acquisition by Cloudflare signifies a major step towards building a comprehensive, integrated AI infrastructure. It promises to simplify the deployment and scaling of complex AI applications by combining model serving with a global network and full-stack primitives.

Replicate, founded in 2019, aimed to democratize access to research-grade ML models by abstracting away infrastructure complexities.
They developed Cog for model packaging and the Replicate platform for running models as cloud API endpoints, successfully scaling with models like Stable Diffusion.
The modern AI stack has evolved beyond just model inference, requiring a full suite of services like microservices, storage, and databases.
Replicate is joining Cloudflare to leverage Cloudflare's extensive network, Workers, R2, and other primitives to build a complete, integrated AI infrastructure layer.
This acquisition will enable faster edge models, model pipelines on Workers, and streaming model I/O, realizing a vision where "the network is the computer" for AI.

#mlp #dist #data

Read original

Dropbox Tech BlogNov 26, 2025

Building the future: highlights from Dropbox’s 2025 summer intern class

Why it matters: This article showcases how intern-led projects drive critical production improvements in ML observability, storage latency, and developer productivity, highlighting the practical application of AI in enterprise-scale infrastructure.

Dropbox's 2025 intern program integrated 28 engineering interns into high-impact projects supporting Dropbox Dash, an AI-powered universal search tool.
Interns refactored the file history tracking system within the metadata infrastructure, significantly reducing operational costs and simplifying legacy systems.
The ML Platform team developed 'AI Sentinel,' a monitoring system providing real-time operational visibility into the health of machine learning model deployments.
Storage Core improvements included implementing health-aware routing in Magic Pocket to mitigate PUT latencies during scheduled disk restarts.
The Web Developer Experience team built an AI-powered automation tool for code migrations that automatically generates pull requests for developers.

#culture #mlp #dist

Read original

GitHub EngineeringNov 25, 2025

Why developers still flock to Python: Guido van Rossum on readability, AI, and the future of programming

Why it matters: This article highlights Python's enduring appeal, its foundational design principles emphasizing readability and accessibility, and its continued dominance in AI and data science, offering insights into language evolution and developer preferences.

Python, created by Guido van Rossum, emerged to simplify programming by offering a safer, more expressive alternative to C and shell scripting.
Despite TypeScript's recent lead on GitHub, Python grew 49% in 2025, maintaining its status as the default language for AI, science, and education.
Its core design emphasizes readability, intuitive syntax, friendly error messages, and a rich standard library, fostering accessibility.
Python's open-source nature, cross-platform support, and strong community are key to its versatility and widespread adoption.
The language's "irreverent" name reflects a deliberate choice to make programming less intimidating and more welcoming.

#data #mlp #culture

Read original

GitHub EngineeringNov 25, 2025

How GitHub’s agentic security principles make our AI agents as secure as possible

Why it matters: This article provides essential security principles for developing and deploying AI agents, addressing critical risks like data exfiltration and prompt injection. It offers practical guidelines for ensuring human oversight and accountability in agentic systems.

GitHub employs agentic security principles for AI agents like Copilot, balancing usability with security through a human-in-the-loop design.
Key risks for agentic AI include data exfiltration, impersonation/action attribution, and prompt injection.
Security controls ensure all context is visible, agents are firewalled, and access to sensitive data is limited.
Agents are prevented from making irreversible state changes without human approval, such as creating pull requests instead of direct commits.
Actions are clearly attributed to both the initiating user and the agent, ensuring accountability.
Context gathering is restricted to authorized users with appropriate repository permissions.

#security #mlp

Read original

Cloudflare BlogNov 25, 2025

Partnering with Black Forest Labs to bring FLUX.2 [dev] to Workers AI

Why it matters: Engineers can leverage FLUX.2 on Workers AI for highly consistent, photorealistic image generation, solving challenges like stochastic drift. Its advanced controls and multi-reference editing enable robust AI-powered applications for marketing, e-commerce, and creative content.

FLUX.2 [dev], a new open-weight image generation model from Black Forest Labs, is now available on Cloudflare Workers AI.
It offers enhanced photorealism, physical world grounding, and supports advanced customization like JSON prompting and multipart form data for multiple image inputs.
A key feature is its ability to maintain character and product consistency across multiple generations, addressing "stochastic drift" through multi-reference editing.
FLUX.2 is designed for functional business use cases, enabling consistent ad variations, reliable product shots, and dynamic editorial content.
It supports granular controls including JSON prompting, HEX codes, and multi-language input for highly specific image generation.

#mlp

Read original

Page 31 of 38

Prev 1...29 30 31 32 33...38 Next