Curated topic

data

Posts tagged with data

Cloudflare BlogApr 16, 2026

Artifacts: versioned storage that speaks Git

Why it matters: Artifacts provides a scalable, programmable Git-compatible storage layer. It solves state persistence for AI agents and serverless apps by treating Git's data model as a primitive for time-travel, forking, and versioning any data at massive scale.

Artifacts is a distributed, versioned filesystem supporting the Git protocol, optimized for the high-volume storage needs of AI agents.
It enables programmatic repository creation via REST and native Workers APIs, facilitating repo-per-session or repo-per-sandbox architectures.
The platform supports standard Git operations including cloning, forking, and importing from external remotes like GitHub within serverless workflows.
Built on Cloudflare Durable Objects, the system scales to millions of isolated, stateful repository instances.
The core Git engine is written in Zig and compiled to a 100KB Wasm binary, ensuring high performance and low memory overhead.

#dist #data #mlp

Read original

Cloudflare BlogApr 16, 2026

AI Search: the search primitive for your agents

Why it matters: Building RAG pipelines is complex, requiring manual chunking, indexing, and hybrid search logic. This tool abstracts that infrastructure, allowing engineers to deploy isolated, searchable context for agents at scale without managing separate database clusters or complex pipelines.

Cloudflare AI Search (formerly AutoRAG) is a plug-and-play search primitive designed for agentic AI workflows.
Supports hybrid search by running semantic vector matching and BM25 keyword search in parallel with result fusion.
Features built-in storage and indexing, eliminating the need to manually configure R2 buckets or Vectorize indexes.
The new ai_search_namespaces binding allows developers to dynamically create, delete, and manage search instances at runtime.
Enables multi-instance querying, allowing agents to search across shared knowledge bases and private per-customer history in one call.
Integrates directly with the Cloudflare Agents SDK and Workers AI to provide a streamlined RAG implementation.

#mlp #data #dist

Read original

Cloudflare BlogApr 16, 2026

Artifacts: versioned storage that speaks Git

Why it matters: Artifacts provides a Git-compatible versioned filesystem designed for the scale of AI agents. By leveraging Durable Objects and a custom Zig-based Git engine, it enables programmatic, high-performance state management, allowing developers to treat versioning as a first-class primitive.

Artifacts is a distributed, versioned filesystem that implements the Git protocol for programmatic use by AI agents and serverless functions.
The system is built on Cloudflare Durable Objects, allowing for the creation of millions of isolated, stateful repository instances.
The core Git engine was written in Zig and compiled to a 100KB WebAssembly binary to ensure high performance and manual memory management.
It supports native Workers and REST APIs, enabling repository creation, branching, and forking without requiring a traditional Git client.
Beyond source control, it is designed for persisting sandbox states, session history, and time-traveling through application data.
Developers can bootstrap repositories from existing sources like GitHub using an import API for isolated agent workflows.

#dist #data #mlp

Read original

Cloudflare BlogApr 16, 2026

Deploy Postgres and MySQL databases with PlanetScale + Workers

Why it matters: This integration simplifies full-stack development by combining edge computing with managed relational databases. Unified billing and Hyperdrive-powered performance optimization reduce operational overhead and latency, making it easier to build scalable, data-intensive applications.

Cloudflare Workers now supports direct creation and management of PlanetScale Postgres and MySQL databases via the dashboard and API.
Unified billing allows users to pay for PlanetScale databases directly through their Cloudflare account, utilizing existing credits or committed spend.
Integration with Hyperdrive provides automatic database connection pooling and query caching to optimize performance and reliability.
Developers can use explicit placement hints to run Workers in data centers closest to their centralized PlanetScale databases, minimizing network latency.
Support for Postgres extensions like pgvector enables building AI-driven applications with vector search capabilities directly on the edge.

#data #dist #finops

Read original

Airbnb EngineeringApr 14, 2026

Privacy-first connections: Empowering social experiences at Airbnb

Why it matters: This architecture demonstrates how to build social features without compromising privacy. By decoupling internal identities from public profiles, engineers can provide granular user control and prevent unintended data leakage across different product contexts.

Airbnb decoupled internal 'User' records from public-facing 'Profiles' to implement privacy by design for new social features.
The system maps a single User ID to multiple context-specific Profile IDs, preventing unauthorized cross-context user tracking.
Experience-specific Guest Profiles allow users to toggle visibility (full profile vs. first name only) for each individual booking.
The architecture ensures co-guests in one Experience cannot link a user to their participation in other unrelated activities.
This granular identity management allows Airbnb to foster community connections while maintaining strict data privacy and user autonomy.

#security #data

Read original

Salesforce EngineeringApr 13, 2026

Reducing Agentforce AI Debugging from Two Weeks to Same-Day with Query-Driven Observability

Why it matters: Traditional logs fail to capture the data context of AI responses. This query-driven approach allows engineers to inspect the exact document chunks and embeddings used in production, slashing debugging time from weeks to hours while maintaining strict data isolation.

Salesforce's Einstein Notebooking platform enables secure, production-grade AI debugging for Agentforce and Data 360 systems.
The system addresses the 'black box' nature of AI agents by providing visibility into ingestion, chunking, retrieval, and response generation.
Engineers use Spark-based workflows to query production indexes, including vector, keyword, and hybrid search results directly.
Query-driven observability allows for inspection of document chunks, embeddings, and session-level feedback within a unified notebook environment.
Tenant-scoped access patterns ensure strict data isolation and compliance while investigating real production scenarios.
The approach reduced investigation times from two weeks to a single day for over 60 Agentforce features and 400 million records.

#mlp #data #sre

Read original

Pinterest EngineeringApr 13, 2026

Scaling Recommendation Systems with Request-Level Deduplication

Why it matters: Scaling ML models often leads to exponential costs. This approach demonstrates how architectural changes like request-level deduplication and SyncBatchNorm can decouple model complexity from infrastructure overhead, enabling massive scale-ups without proportional cost increases.

Pinterest implemented request-level deduplication to manage infrastructure costs as recommendation models scaled 100x in parameter count.
By sorting data by request ID in Apache Iceberg, the team achieved 10-50x storage compression for user-heavy feature columns.
Request-sorted training data initially disrupted the IID assumption, causing performance regressions in ranking models due to Batch Normalization instability.
The team resolved training regressions by implementing Synchronized Batch Normalization (SyncBatchNorm) to aggregate statistics across all devices.
Deduplication allows processing massive user sequences (16K tokens) once per request rather than redundantly for every candidate item scored.

#mlp #data #finops

Read original

Cloudflare BlogApr 13, 2026

Durable Objects in Dynamic Workers: Give each AI-generated app its own database

Why it matters: This feature allows AI-generated or user-provided code to have its own persistent, low-latency database without manual provisioning. It bridges the gap between ephemeral serverless execution and stateful application needs in a secure, sandboxed environment.

Cloudflare introduced Durable Object Facets to provide persistent storage for code loaded via Dynamic Workers.
Dynamic Workers use isolates instead of containers, offering 100x faster load times and 1/10 the memory usage.
Durable Object Facets allow dynamic code to extend the DurableObject class and access a dedicated SQLite database.
A supervisor Durable Object acts as a controller, managing the lifecycle and requests for dynamic facets.
This architecture enables AI-generated applications to maintain long-lived state with zero-latency local disk access.

#dist #data #security

Read original

Netflix Tech BlogApr 10, 2026

Evaluating Netflix Show Synopses with LLM-as-a-Judge

Why it matters: This framework shows how to automate subjective quality control at scale. By aligning LLMs with expert rubrics and business metrics, engineers can proactively optimize user engagement and content discovery before titles even launch.

Netflix developed an LLM-as-a-Judge framework to automate the quality evaluation of show synopses, achieving over 85% agreement with expert creative writers.
The system uses dedicated LLM judges for specific criteria like clarity and precision, rather than a single multi-purpose prompt, to improve scoring accuracy.
A golden dataset of 600 synopses was created using a model-in-the-loop consensus process to resolve subjectivity and align AI with human editorial standards.
Technical optimizations included Automatic Prompt Optimization (APO) and inference-time scaling techniques such as longer rationales and consensus scoring.
LLM-derived quality scores were found to correlate with key streaming metrics like Take Fraction and Abandonment Rate, enabling proactive content fixes.

#mlp #data

Read original

PlanetScale Tech BlogApr 10, 2026

Keeping a Postgres queue healthy

Why it matters: Using Postgres for queues is convenient but risky. High-churn tables generate dead tuples that can bloat indexes. If long-running transactions block autovacuum, I/O overhead can degrade the entire database's performance, potentially bringing down the application.

Postgres queues offer transactional consistency but are prone to performance degradation from high row churn.
MVCC creates dead tuples upon deletion, which persist in heap pages and indexes until vacuumed.
Accumulated dead tuples force index scans to traverse unnecessary entries, increasing I/O and latency.
Autovacuum efficiency is globally constrained; a single long-running transaction can block cleanup for all tables.
Implementing FOR UPDATE SKIP LOCKED enables high-concurrency job processing without worker interference.
Database health requires monitoring transaction age to prevent runaway bloat in transient tables.

#data #sre #dist

Read original

Page 11 of 33

Prev 1...9 10 11 12 13...33 Next