Curated topic

data

Posts tagged with data

Pinterest EngineeringDec 3, 2025

Autonomous Observability at Pinterest (Part 1 of 2)

Why it matters: This article demonstrates how to overcome legacy observability challenges by pragmatically integrating AI agents and context engineering, offering a blueprint for unifying fragmented data without costly overhauls.

Pinterest faced fragmented observability data (logs, traces, metrics) due to legacy infrastructure predating OpenTelemetry, hindering efficient root-cause analysis.
They adopted a pragmatic solution using AI agents and a Model Context Protocol (MCP) server to unify disparate observability signals without a full infrastructure overhaul.
The MCP server allows AI agents to interact simultaneously with various data pillars (metrics, logs, traces, change events) to find correlations and build hypotheses.
This "context engineering" approach aims to provide intelligent agents with comprehensive data, leading to faster, clearer root-cause analysis and actionable insights.
The initiative represents a "shift-left" (proactive integration) and "shift-right" (production visibility) strategy, leveraging AI to overcome existing observability limitations.

#sre #dist #data

Read original

Microsoft Azure BlogDec 3, 2025

New options for AI-powered innovation, resiliency, and control with Microsoft Azure

Why it matters: This article highlights how Azure Local provides engineers with flexible, sovereign, and resilient cloud capabilities on-premises or at the edge. It enables deploying AI and critical workloads while meeting strict compliance and operational autonomy requirements, even in disconnected environments.

Azure Local extends Azure public cloud infrastructure to customer datacenters and distributed locations, ensuring control, resilience, and operational autonomy for mission-critical workloads.
It addresses data sovereignty and compliance needs, enabling AI, scalable compute, and advanced analytics to run locally or at the edge.
Key advancements include General Availability for Microsoft 365 Local, NVIDIA RTX GPUs for on-premises AI, and Azure Migrate support.
Preview features like AD-less deployments, Rack-Aware Clustering, multi-rack deployments, and fully disconnected operations enhance flexibility and autonomy.
Leveraging Azure Arc, Azure Local provides a unified platform for hybrid and disconnected environments, supporting diverse industries like manufacturing and public sector.
Integration with Azure IoT and Microsoft Fabric facilitates intelligent physical operations and real-time insights from operational data.

#dist #data #mlp

Read original

Salesforce EngineeringDec 3, 2025

How Agentforce Achieved 3–5x Faster Response Times While Solving Enterprise-Scale Architectural Complexity

Why it matters: This article demonstrates how to scale agentic AI in complex enterprise environments by balancing LLM reasoning with deterministic logic. It provides a blueprint for reducing latency and ensuring architectural consistency across multi-brand deployments while maintaining high accuracy.

Restructured architecture by offloading deterministic tasks like JSON parsing and hierarchical decisioning from the LLM to Apex code to ensure consistency.
Reduced multi-stage reasoning latency by approximately 20 seconds by consolidating sequential model calls into a single execution step.
Optimized data retrieval by combining Data 360 lookups and order API calls into single, efficient pulls rather than incremental passes.
Developed a multi-brand architecture using a shared core logic layer while allowing brand-specific prompt overrides for unique tone and voice.
Improved response times by 3–5x through the elimination of redundant reasoning loops and the stabilization of data-flow boundaries.

#mlp #data #dist

Read original

Cloudflare BlogDec 1, 2025

Why Replicate is joining Cloudflare

Why it matters: Replicate's acquisition by Cloudflare signifies a major step towards building a comprehensive, integrated AI infrastructure. It promises to simplify the deployment and scaling of complex AI applications by combining model serving with a global network and full-stack primitives.

Replicate, founded in 2019, aimed to democratize access to research-grade ML models by abstracting away infrastructure complexities.
They developed Cog for model packaging and the Replicate platform for running models as cloud API endpoints, successfully scaling with models like Stable Diffusion.
The modern AI stack has evolved beyond just model inference, requiring a full suite of services like microservices, storage, and databases.
Replicate is joining Cloudflare to leverage Cloudflare's extensive network, Workers, R2, and other primitives to build a complete, integrated AI infrastructure layer.
This acquisition will enable faster edge models, model pipelines on Workers, and streaming model I/O, realizing a vision where "the network is the computer" for AI.

#mlp #dist #data

Read original

GitHub EngineeringNov 25, 2025

Why developers still flock to Python: Guido van Rossum on readability, AI, and the future of programming

Why it matters: This article highlights Python's enduring appeal, its foundational design principles emphasizing readability and accessibility, and its continued dominance in AI and data science, offering insights into language evolution and developer preferences.

Python, created by Guido van Rossum, emerged to simplify programming by offering a safer, more expressive alternative to C and shell scripting.
Despite TypeScript's recent lead on GitHub, Python grew 49% in 2025, maintaining its status as the default language for AI, science, and education.
Its core design emphasizes readability, intuitive syntax, friendly error messages, and a rich standard library, fostering accessibility.
Python's open-source nature, cross-platform support, and strong community are key to its versatility and widespread adoption.
The language's "irreverent" name reflects a deliberate choice to make programming less intimidating and more welcoming.

#data #mlp #culture

Read original

PlanetScale Tech BlogNov 21, 2025

AI-Powered Postgres index suggestions

Why it matters: Automating index optimization reduces the manual burden of database tuning. By combining LLMs with rigorous validation via HypoPG, engineers receive reliable, data-driven recommendations that improve query speed without the risk of hallucinated or ineffective indexes.

PlanetScale Insights now provides AI-powered index suggestions for PostgreSQL by analyzing workload and schema data.
To ensure relevance, the system targets queries with high rows-read to rows-returned ratios and significant resource consumption.
LLMs generate candidate indexes, which undergo strict syntactic validation and performance impact analysis.
The HypoPG extension is used to create hypothetical indexes, allowing the Postgres planner to estimate cost savings without actual overhead.
Only suggestions that provide a substantial, measurable improvement in query cost are surfaced to the user.

#data #mlp #sre

Read original

GitHub EngineeringNov 20, 2025

Evolving GitHub Copilot’s next edit suggestions through custom model training

Why it matters: This article details advanced techniques in training AI for developer tools, showcasing how custom data collection, SFT, and RL overcome challenges in real-time code prediction. It's crucial for engineers building AI-powered developer experiences and understanding practical LLM deployment.

GitHub Copilot's Next Edit Suggestions (NES) uses a custom, low-latency model designed to predict developers' next code edits in real time.
Initial attempts with general LLMs and pull request data failed; a custom, high-quality dataset derived from real-time editing sessions was crucial for training.
The foundational NES model was developed using Supervised Fine-Tuning (SFT) on this specialized dataset.
Reinforcement Learning (RL), incorporating a custom 'grader' model, further refined the NES model, addressing SFT limitations by leveraging unlabeled data and explicitly defining criteria for 'bad' suggestions.
This 'AI-native' approach emphasizes end-to-end co-design of model training, prompt engineering, and user experience for seamless IDE integration.
Recent improvements focus on prompt optimization to reduce latency and enhance the relevance and quality of suggestions.

#mlp #data

Read original

Engineering at MetaNov 18, 2025

Efficient Optimization With Ax, an Open Platform for Adaptive Experimentation

Why it matters: Engineers can leverage Ax, an open-source ML-driven platform, to efficiently optimize complex systems like AI models and infrastructure. It streamlines experimentation, reduces resource costs, and provides deep insights into system behavior, accelerating development and deployment.

Ax 1.0 is an open-source adaptive experimentation platform leveraging machine learning for efficient optimization of complex systems.
It's widely used at Meta to improve AI models, tune production infrastructure, and accelerate advances in ML and hardware design.
The platform employs Bayesian optimization to guide resource-intensive experiments, identifying optimal configurations efficiently.
Ax provides advanced analytical tools, including Pareto frontiers and sensitivity analysis, for deeper system understanding beyond just finding optimal settings.
An accompanying paper details Ax's core architecture, methodology, and performance comparison against other black-box optimization libraries.

#mlp #sre #data

Read original

Microsoft Azure BlogNov 18, 2025

Azure at Microsoft Ignite 2025: All the intelligent cloud news explained

Why it matters: This article highlights Azure's comprehensive AI-first platform, offering engineers new tools for building, securing, and scaling intelligent applications and data solutions, enhancing productivity and innovation across various domains.

Azure at Ignite 2025 unifies AI, data, apps, and infrastructure to deliver intelligence at scale, addressing key business questions on AI adoption and data readiness.
New AI agent capabilities include Microsoft Fabric IQ, Foundry IQ, and Microsoft Agent Factory, simplifying the creation and scaling of intelligent applications.
Significant data modernization updates feature SAP BDC Connect for Fabric, Azure HorizonDB (PostgreSQL), Azure DocumentDB, and SQL Server 2025 for enhanced data management.
Operations and security are boosted with AI-powered tools like Foundry Control Plane, Azure Copilot with built-in agents, and native DevSecOps integration for Defender for Cloud and GitHub Advanced Security.
AI-ready infrastructure is introduced with Azure Boost for speed and security, and Azure Cobalt 200, redefining performance for the agentic era.
Microsoft Foundry expands its model choice by adding Anthropic Claude (Sonnet 4.5, Opus 4.1, Haiku 4.5) and Cohere models, making Azure the only cloud offering both OpenAI and Anthropic models.

#mlp #data #dist

Read original

Microsoft Azure BlogNov 18, 2025

Microsoft Databases and Microsoft Fabric: Your unified and AI-powered data estate

Why it matters: This article highlights Microsoft's push for a unified, AI-powered data estate. Engineers gain access to new, integrated database solutions like SQL Server 2025 and Azure DocumentDB, simplifying data management and accelerating AI development across hybrid and multi-cloud environments.

Microsoft announced the general availability of SQL Server 2025, Azure DocumentDB, and SQL/Cosmos DB in Fabric, alongside a preview of Azure HorizonDB (PostgreSQL).
Microsoft Fabric serves as a unified hub, integrating these new database offerings for a cohesive, AI-ready data estate.
SQL Server 2025 introduces developer-first AI capabilities like smarter search and AI model management, enhanced reliability, and security features.
SQL Server 2025 data is instantly accessible in Microsoft OneLake through mirroring for Fabric, supporting AI and analytics workloads.
Azure DocumentDB is a new MongoDB-compatible, AI-ready service designed for hybrid and multi-cloud environments.

#data #mlp

Read original

Page 24 of 33

Prev 1...22 23 24 25 26...33 Next