Curated topic

data

Posts tagged with data

Salesforce EngineeringMar 2, 2026

Delivering Accurate, Low-Latency Voice-to-Form AI in Real-World Field Conditions

Why it matters: This architecture demonstrates how to balance on-device processing with cloud AI to solve real-world data entry challenges. It provides a blueprint for building low-latency, high-accuracy mobile AI features that function reliably in noisy, bandwidth-constrained environments.

Implemented a hybrid architecture using on-device speech-to-text (STT) to minimize latency and cloud-based LLMs for semantic field mapping.
Achieved 85% field-level accuracy by providing LLMs with schema-driven metadata, including field types, constraints, and examples.
Managed latency by performing transcription locally on mobile devices, reducing payload sizes and avoiding audio transmission to the cloud.
Developed a Voice Utterance Library containing real-world field recordings to test system reliability against background noise and diverse accents.
Integrated a human-in-the-loop review step, allowing technicians to edit transcriptions before final submission to ensure data integrity.

#mobile #mlp #data

Read original

Engineering at MetaMar 2, 2026

Investing in Infrastructure: Meta’s Renewed Commitment to jemalloc

Why it matters: jemalloc is a critical foundation for high-performance systems. Meta's renewed commitment ensures the allocator evolves with modern hardware like ARM64 and complex workloads, reducing technical debt and improving memory efficiency for the entire open-source ecosystem.

Meta is unarchiving and renewing its stewardship of the jemalloc open-source repository to ensure long-term infrastructure health.
The project will prioritize technical debt reduction and refactoring to improve maintainability and ease of use for the community.
A key focus is enhancing the Huge-Page Allocator (HPA) to better utilize transparent hugepages for increased CPU efficiency.
Planned improvements to packing, caching, and purging mechanisms aim to optimize overall memory efficiency and performance.
The roadmap includes specific performance optimizations for the AArch64 (ARM64) platform to ensure high out-of-the-box performance.
Meta is shifting back to principled engineering practices, moving away from short-term hacks that previously accumulated technical debt.

#sre #data

Read original

Pinterest EngineeringFeb 27, 2026

Bridging the Gap: Diagnosing Online–Offline Discrepancy in Pinterest’s L1 Conversion Models

Why it matters: This case study highlights that even mathematically superior models fail if serving infrastructure lacks feature parity with training. It provides a blueprint for diagnosing ML system discrepancies by auditing the entire pipeline from embedding generation to funnel alignment.

Pinterest investigated why L1 conversion models with 20-45% offline LogMAE gains failed to produce online CPA improvements during A/B testing.
The team ruled out offline evaluation bugs, exposure bias, and serving latency, focusing instead on structural discrepancies between training and inference.
A critical root cause was feature disparity: high-impact signals like targeting specs and conversion counts were available in training logs but missing from the L1 embedding builder.
Temporal misalignment between query and Pin tower embeddings further degraded online performance, as the two towers were not synchronized during real-time serving.
The investigation highlights the necessity of a full-stack diagnosis framework covering model evaluation, serving pipelines, and funnel utility to isolate ML system failures.

#mlp #data

Read original

Cloudflare BlogFeb 27, 2026

Toxic combinations: when small signals add up to a security incident

Why it matters: Engineers often overlook minor anomalies, but their convergence signals sophisticated attacks. Understanding toxic combinations helps teams move beyond signature-based defense to intent-based security, identifying breaches that lack obvious exploit payloads.

Toxic combinations are security incidents where attackers exploit multiple minor misconfigurations or anomalies that appear harmless in isolation.
Cloudflare identifies these by intersecting bot traffic data, sensitive application paths (e.g., /wp-admin, /debug), and request anomalies like geo jumps or identity mismatches.
Traditional point defenses like WAFs often focus on individual requests, whereas toxic combination detection analyzes the broader context and intent of multiple signals.
Analysis shows that while only 0.25% of non-WordPress hosts are susceptible, these represent high-risk vulnerabilities to compromise.
Engineers can use tools like Cloudflare Log Explorer to run queries that identify automated probing of administrative endpoints and debug flags.

#security #data

Read original

Cloudflare BlogFeb 27, 2026

Bringing more transparency to post-quantum usage, encrypted messaging, and routing security

Why it matters: As quantum computing threats loom, transitioning to post-quantum cryptography and securing BGP routing are critical for long-term data integrity. These tools provide the transparency needed to audit infrastructure readiness and verify the security of encrypted communication channels.

Cloudflare Radar now monitors post-quantum (PQ) encryption support for origin-facing connections, complementing existing client-side tracking.
A new public tool allows users to verify if specific hostnames support PQ-secure hybrid key exchange algorithms like X25519MLKEM768.
The Key Transparency dashboard provides real-time verification status of logs for encrypted messaging services, ensuring public key integrity.
Routing security insights now include data on ASPA (Autonomous System Provider Authorization) deployment to mitigate BGP route leaks.
Origin-side PQ support has seen a 10x increase over the past year, driven by default enablement in libraries like OpenSSL 3.5.0 and Go 1.24.

#security #data #dist

Read original

Cloudflare BlogFeb 27, 2026

We deserve a better streams API for JavaScript

Why it matters: Modern web apps rely on streaming data, yet the current Web Streams API is plagued by performance bottlenecks and a complex locking model. Understanding these flaws is crucial for engineers building high-performance runtimes or handling large-scale data processing in JavaScript.

The WHATWG Streams Standard was designed before async iteration (ES2018), leading to an API that relies on complex reader/writer acquisition instead of idiomatic language primitives.
Web streams use a restrictive locking model where forgetting to call releaseLock() can permanently break a stream, creating significant debugging hurdles.
The current API requires excessive boilerplate for common operations, such as manually managing reader locks and the { value, done } protocol.
Retrofitting async iteration onto Web streams hides but does not solve underlying complexity, and it fails to support advanced features like BYOB (Bring Your Own Buffer) reads.
Benchmarks show that alternative stream implementations leveraging modern JS features can be 2x to 120x faster than the current standard across various runtimes.
The author advocates for a new streaming standard that aligns with modern JavaScript development patterns to improve both performance and developer experience.

#data #frontend #dist

Read original

PlanetScale Tech BlogFeb 27, 2026

Video Conferencing with Postgres

Why it matters: This experiment showcases the power of PostgreSQL's logical replication for real-time data streaming. It challenges the boundaries of traditional database use cases, proving that WAL-based change data capture can serve as a high-throughput alternative to dedicated message brokers.

Implements video conferencing by using PostgreSQL as a real-time message broker via logical replication.
Captures browser media as JPEG frames and PCM audio, inserting them into BYTEA columns in a standard Postgres table.
Uses a Node.js relay to consume the logical replication stream (WAL) and forward updates to clients via WebSockets.
Achieves 15fps at 640x360 resolution by leveraging the ordered nature of commit logs instead of polling with SELECT.
Evaluates and rejects LISTEN/NOTIFY due to its 8KB payload limit, which is insufficient for video frame sizes.
Maintains database performance by running a cleanup job every two seconds to prune frames older than five seconds.
Demonstrates that PostgreSQL can handle high-throughput binary data streams while ensuring durability and crash recovery.

#data #dist

Read original

Dropbox Tech BlogFeb 26, 2026

Using LLMs to amplify human labeling and improve Dash search relevance

Why it matters: Effective RAG systems depend on high-quality search ranking. Using LLMs to scale relevance labeling allows engineers to train more accurate models faster, overcoming the scalability and privacy limitations of traditional human-only labeling workflows.

Dropbox Dash uses a Retrieval-Augmented Generation (RAG) pattern, where search ranking quality is critical for grounding LLM responses.
Ranking models are trained using XGBoost on query-document pairs scored on a 1-5 relevance scale.
While human labeling provides a high-quality gold standard, it is expensive, slow, and poses privacy risks with proprietary data.
LLMs amplify human efforts by generating relevance labels at scale, significantly increasing the volume of training data available.
The hybrid labeling approach combines a small set of human-labeled data with LLM-assisted evaluation to improve model accuracy.
LLM-based labeling effectively addresses the cold-start problem for new search features where user behavior data is sparse.

#mlp #data

Read original

Salesforce EngineeringFeb 26, 2026

Hyperforce Migration at Scale: How Deterministic Automation Replaced Manual Spreadsheets Across 95,000 Organizations

Why it matters: Automating large-scale infrastructure migrations is critical for reducing operational risk. MIPS demonstrates how to build a deterministic decision engine that maintains auditability and customer trust while scaling to handle tens of thousands of complex organization moves.

Salesforce developed the Migration Intake and Processing Service (MIPS) to automate the migration of 95,000 organizations to Hyperforce.
The platform replaced manual spreadsheets and email-based coordination with a deterministic decision engine for eligibility and capacity.
MIPS consolidates distributed sources of truth into a centralized layer using well-defined APIs and explicit data contracts.
The system achieves a 90%+ auto-approval rate while escalating complex exceptions for human review to maintain throughput.
Every migration decision is fully traceable and auditable, ensuring customer trust and data residency requirements are met.
The architecture utilizes continuous data quality checks to prevent misinterpretations of regional capacity or scheduling windows.

#sre #dist #data

Read original

Airbnb EngineeringFeb 24, 2026

Academic Publications & Airbnb Tech: 2025 Year in Review

Why it matters: Airbnb's research demonstrates how to bridge the gap between academic theory and production-scale systems. By using bimodal embeddings and specialized ranking metrics, they solve complex marketplace challenges, providing a blueprint for driving revenue through advanced machine learning.

Airbnb expanded its 2025 research footprint across major conferences including KDD, CIKM, and VLDB, focusing on ML, NLP, and optimization.
Developed rapid pre-A/B assessment techniques using interleaving and counterfactual evaluation to streamline search ranking experiments.
Introduced BiListing embeddings which leverage LLMs and language-image models to unify unstructured text and photo data into ranking signals.
Optimized map-based search by creating a map-specific NDCG metric that better models user attention compared to traditional list-based metrics.
Implemented extreme classification for high-precision audience expansion and location retrieval within a two-sided marketplace.
Enhanced pairwise learning-to-rank algorithms by capturing item interactions during comparisons to more accurately reflect user intent.

#mlp #data

Read original

Page 3 of 19

Prev 1 2 3 4 5...19 Next