Curated topic

dist

Posts tagged with dist

Cloudflare BlogFeb 5, 2026

2025 Q4 DDoS threat report: A record-setting 31.4 Tbps attack caps a year of massive DDoS assaults

Why it matters: The scale of DDoS attacks is reaching unprecedented levels, with botnets leveraging IoT devices to hit 31.4 Tbps. Engineers must prioritize automated, multi-vector mitigation strategies as manual intervention is no longer viable against such hyper-volumetric volume.

DDoS attacks surged by 121% in 2025, totaling 47.1 million incidents with an average of 5,376 mitigations per hour.
A record-breaking 31.4 Tbps attack occurred in late 2025, highlighting a massive increase in attack volume.
The Aisuru-Kimwolf botnet, composed of 1-4 million infected Android TVs, launched hyper-volumetric HTTP attacks exceeding 200 million requests per second.
Network-layer DDoS attacks more than tripled year-over-year, accounting for 78% of all attacks in Q4 2025.
Multi-vector campaigns utilized SYN floods, Mirai-based attacks, and SSDP amplification to target global internet infrastructure.
Telecommunications emerged as the most-attacked industry, while Hong Kong and the UK saw the highest growth in attack frequency.

#security #sre #dist

Read original

Microsoft Azure BlogFeb 4, 2026

Enhanced storage resiliency with Azure NetApp Files Elastic zone-redundant service

Why it matters: It provides a managed, high-availability storage solution that ensures zero data loss and seamless failover across availability zones. This simplifies disaster recovery for mission-critical workloads like SAP HANA and SQL Server while optimizing costs and metadata performance.

Azure NetApp Files Elastic ZRS provides synchronous data replication across three or more availability zones within a single region.
The service features automated, service-managed failover that maintains the same mount targets and endpoints during zone-level outages.
It supports NFS and SMB protocols with enterprise-grade management capabilities including snapshots, clones, and storage tiering.
The architecture is cost-optimized, allowing for volumes as small as 1 GiB and reducing costs compared to manual cross-zone replication.
Future updates will introduce simultaneous multi-protocol access (NFS, SMB, and Object REST API) and custom region pairs for disaster recovery.
Optimized for metadata-heavy workloads, the service uses a shared QoS architecture to maintain low-latency operations during file enumeration.

#sre #dist #data

Read original

Cloudflare BlogFeb 3, 2026

Improve global upload performance with R2 Local Uploads

Why it matters: Engineers can significantly reduce upload latency for global users without managing complex multi-region replication logic. It provides the performance of a local edge cache with the reliability and strong consistency of centralized object storage.

Cloudflare R2 launched Local Uploads in open beta to improve global write performance by up to 75%.
Data is initially written to a storage location near the client and then asynchronously replicated to the bucket's home region.
The system maintains strong consistency, ensuring objects are immediately accessible for reads after the initial write.
Architecture utilizes R2 Gateway Workers for routing and Durable Objects for distributed metadata management.
Synthetic benchmarks show Time to Last Byte (TTLB) dropping from 2s to 500ms for cross-region 5MB uploads.
The feature is specifically designed for globally distributed workloads like media uploads and telemetry collection.

#dist #data #sre

Read original

Pinterest EngineeringFeb 2, 2026

Beyond Two Towers: Re-architecting the Serving Stack for Next-Gen Ads Lightweight Ranking Models…

Why it matters: Moving beyond Two-Tower models allows for more expressive ranking but introduces massive latency. This architecture demonstrates how to integrate heavy GPU inference into real-time stacks by optimizing feature fetching and moving business logic to the device.

Transitioned from Two-Tower architectures to complex neural networks to enable interaction features and target attention.
Implemented an Inventory Segmentation Strategy, bundling high-value document features directly into PyTorch model registered buffers to eliminate network I/O.
Moved business logic, including utility calculations and top-k sorting, into the PyTorch model to minimize data transfer between GPU and CPU.
Optimized GPU inference latency from 4000ms to 20ms using Multi-Stream CUDA to overlap compute and data transfer.
Leveraged in-house model inference engines supporting PyTorch traced models and CUDAGraphs for high-performance serving.

#mlp #dist

Read original

Microsoft Azure BlogFeb 2, 2026

PostgreSQL on Azure supercharged for AI

Why it matters: PostgreSQL is evolving into a central hub for AI development. By integrating vector search, LLM orchestration, and seamless IDE workflows directly into the managed database service, Microsoft reduces the friction of building and scaling intelligent, data-driven applications.

Azure HorizonDB introduced as a PostgreSQL-compatible cloud service optimized for scale-out and ultra-low latency.
Integrated VS Code extension enables provisioning and managing PostgreSQL instances directly within the IDE.
In-database AI features allow developers to invoke LLMs via SQL for text classification and embedding generation.
DiskANN vector indexing and semantic ranking support high-performance similarity searches for AI agents.
Native Model Context Protocol (MCP) server support connects PostgreSQL directly to Microsoft Foundry's agent framework.
Zero-ETL mirroring to Microsoft Fabric and Parquet file support streamline real-time analytics and data movement.

#data #mlp #dist

Read original

Salesforce EngineeringFeb 2, 2026

How Agentforce Enhanced Chat Built an Agent-first Chat Experience While Ensuring Easy Migration for 3,000+ Customers

Why it matters: This article demonstrates how to re-architect a legacy multi-tenant system for AI-driven features without breaking existing integrations. It highlights the importance of backward compatibility, performance optimization via CDNs, and using AI tools to accelerate developer velocity.

Re-architected the chat experience using Lightning Web Components and Lightning Types to support agent-first interactions and high-performance web experiences.
Implemented a centralized state management system and CDN-based loading to achieve sub-second load times and responsive UI performance.
Prioritized backward compatibility by ensuring the embedding script on customer pages remained unchanged, allowing for a one-click migration path for 3,000+ tenants.
Leveraged AI-driven development tools and local developer scaffolds to accelerate iteration cycles and build diagnostic panels without waiting for platform deploys.
Integrated context-aware intelligence to enable agents to understand user intent and journeys in real-time through deeper site integration.

#frontend #mlp #dist

Read original

Cloudflare BlogJan 30, 2026

Building vertical microfrontends on Cloudflare’s platform

Why it matters: Vertical microfrontends solve the monolith bottleneck by giving teams full autonomy over their tech stack and deployment cycles. By routing paths to independent Workers, engineers can ship faster with less risk, while CSS View Transitions maintain a unified, high-performance user experience.

Cloudflare introduced a new Worker template for Vertical Microfrontends (VMFE) to map independent Workers to specific URL paths on a single domain.
Unlike horizontal microfrontends that split components on a page, VMFE allows teams to own the entire stack—including framework choice and CI/CD—for specific routes.
This architecture enables teams to use different technologies, such as Astro for marketing and React for dashboards, within the same user-facing application.
VMFE improves deployment safety by isolating regressions to specific paths, preventing a single team's error from rolling back the entire platform.
CSS View Transitions and document preloading are used to create a seamless, SPA-like experience across different Workers by eliminating white-screen flashes.
Cloudflare uses this pattern internally to integrate products like ZeroTrust into the main dashboard while maintaining separate codebases.

#frontend #dist #culture

Read original

Cloudflare BlogJan 29, 2026

Introducing Moltworker: a self-hosted personal AI agent, minus the minis

Why it matters: Moltworker demonstrates the maturity of Cloudflare's serverless platform for hosting complex AI agents. It shows how improved Node.js compatibility and sandboxing allow engineers to deploy secure, stateful tools globally without the overhead of managing physical hardware.

Moltworker is a Cloudflare-hosted adaptation of Moltbot, allowing users to run a personal AI agent on serverless infrastructure instead of dedicated local hardware.
The project leverages Cloudflare Workers' improved Node.js compatibility, which now supports 98.5% of the top 1,000 NPM packages natively, including node:fs.
The architecture utilizes Cloudflare Sandboxes for isolated code execution and Browser Rendering for headless automation and web interaction.
Persistent data is managed via R2 storage, while the AI Gateway provides centralized logging, cost tracking, and unified billing for LLM providers.
Security is enforced through Cloudflare Access, protecting the entrypoint Worker that acts as a proxy between the user and the isolated agent environment.

#mlp #dist #security

Read original

Dropbox Tech BlogJan 28, 2026

Engineering VP Josh Clemm on how we use knowledge graphs, MCP, and DSPy in Dash

Why it matters: Engineers face increasing data fragmentation across SaaS silos. This post details how to build a unified context engine using knowledge graphs, multimodal processing, and prompt optimization (DSPy) to enable effective RAG and agentic workflows over proprietary enterprise data.

Dropbox Dash functions as a universal context engine, integrating disparate SaaS applications and proprietary content into a unified searchable index.
The system utilizes custom crawlers to navigate complex API rate limits, diverse authentication schemes, and granular permission systems (ACLs).
Content enrichment involves normalizing files into markdown and using multimodal models for scene extraction in video and transcription in audio.
Knowledge graphs are employed to map relationships between entities across platforms, providing deeper context for agentic queries.
The engineering team leverages DSPy for programmatic prompt optimization and 'LLM as a judge' frameworks for automated evaluation.
The architecture explores the Model Context Protocol (MCP) to standardize how LLMs interact with external data sources and tools.

#mlp #data #dist

Read original

Engineering at MetaJan 27, 2026

Rust at Scale: An Added Layer of Security for WhatsApp

Why it matters: WhatsApp's migration demonstrates that Rust is production-ready for massive-scale, cross-platform applications. It proves memory-safe languages can replace legacy C++ to eliminate vulnerabilities while improving performance and maintainability.

WhatsApp replaced its wamedia C++ library with a Rust implementation to mitigate memory-related vulnerabilities in media file processing.
The migration reduced the codebase from 160,000 lines of C++ to 90,000 lines of Rust while improving performance and memory efficiency.
The Kaleidoscope system performs structural checks on media, detects masquerading file types, and flags high-risk elements like embedded scripts.
WhatsApp utilized differential fuzzing and extensive integration testing to ensure compatibility between the legacy C++ and new Rust versions.
This deployment represents one of the largest global rollouts of Rust, spanning billions of devices across Android, iOS, Web, and wearables.

#security #mobile #dist

Read original

Page 18 of 34

Prev 1...16 17 18 19 20...34 Next