Explore the latest engineering posts and summaries

Search by topic, company, or concept and scan results quickly.

Posts indexed431

Last indexedMar 14, 2026

Spotify EngineeringDec 9, 2025

Background Coding Agents: Predictable Results Through Strong Feedback Loops (Part 3)

Why it matters: As AI agents become integrated into development, ensuring their output is safe and predictable is critical. This system provides a blueprint for building trust in automated code generation through rigorous feedback loops and validation.

Spotify's system focuses on making AI coding agents predictable and trustworthy through structured feedback loops.
The architecture ensures that agent-generated code is validated against existing engineering standards and tests.
Background agents operate asynchronously to improve code quality without disrupting the primary developer workflow.
The framework addresses the challenge of moving from experimental AI generation to production-ready software engineering.
Automated verification steps are integrated to prevent the introduction of bugs or technical debt by autonomous agents.

#mlp #culture #sre

Read original

Cloudflare BlogDec 9, 2025

Shifting left at enterprise scale: how we manage Cloudflare with Infrastructure as Code

Why it matters: This article provides a blueprint for implementing "shift left" security and IaC at enterprise scale, crucial for preventing misconfigurations, enhancing consistency, and improving operational efficiency in large, complex environments.

Cloudflare adopted "shift left" principles and Infrastructure as Code (IaC) to manage its critical platform securely and consistently at enterprise scale.
All production account configurations are managed via IaC using Terraform, integrated with a custom CI/CD pipeline (Atlantis, GitLab, tfstate-butler).
A centralized monorepo holds all configurations, with teams owning their specific sections, promoting accountability and consistency.
Security baselines are enforced through Policy as Code (Open Policy Agent with Rego), shifting validation to the earliest stages of development.
Policies are automatically checked on every merge request, preventing misconfigurations before deployment and minimizing human error.

#security #sre

Read original

Salesforce EngineeringDec 8, 2025

How AI-Powered Testing Enabled Sub-Second Latency for Agentforce Voice

Why it matters: Achieving sub-second latency in voice AI requires rethinking performance metrics and optimizing every microservice. This article shows how semantic end-pointing and synthetic testing are critical for building responsive, human-like voice agents at scale.

Developed the Flash Reasoning Engine to achieve sub-second Time to First Audio (TTFA) for natural, human-fast voice interactions.
Optimized the real-time voice pipeline by shaving hundreds of milliseconds from microservices, synchronous calls, and serialization paths.
Implemented semantic end-pointing algorithms that use confidence thresholds to distinguish between meaningful pauses and true utterance completion.
Created AI-driven synthetic customer testing frameworks to generate repeatable data sets and eliminate noise in performance metrics.
Resolved measurement inaccuracies where initial tests incorrectly reported 70-second latencies by focusing on TTFA instead of total output duration.

#mlp #dist #sre

Read original

GitHub EngineeringDec 8, 2025

The new identity of a developer: What changes and what doesn’t in the AI era

Why it matters: This article is crucial for developers to understand the evolving landscape of software engineering in the AI era, highlighting the shift in core skills from coding to AI orchestration and strategy. It guides how to adapt and thrive in future roles.

AI is transforming the developer role from "code producer" to "creative director of code," emphasizing orchestration and verification.
Early AI adoption (2023) showed developers seeking AI for summaries and plans, but resisting full implementation due to identity concerns.
Advanced AI users (2025) achieve fluency through consistent trial-and-error, integrating AI into daily workflows for diverse tasks.
The developer journey with AI progresses through stages: Skeptic, Explorer, Collaborator, and ultimately, Strategist.
Key skills now include effective prompting, iterating, and strategic decision-making on when and how to deploy various AI tools and agents.

#culture

Read original

Pinterest EngineeringDec 8, 2025

How Pinterest Built a Real‑Time Radar for Violative Content using AI

Why it matters: This system provides real-time, statistically robust insights into content safety, enabling platforms to proactively identify and mitigate harms. It's crucial for maintaining user trust and scaling content moderation efficiently with AI.

Pinterest developed an AI-assisted system to measure "prevalence" of policy-violating content, focusing on the percentage of total views.
This system addresses the shortcomings of report-only metrics, which often miss under-reported harms and lack statistical power.
It utilizes ML-assisted sampling from daily user impressions, leveraging production risk scores for efficiency while ensuring unbiased prevalence estimates.
A multimodal LLM (vision + text) enables bulk labeling of sampled content, significantly reducing latency and cost compared to human review.
Inverse-probability weighting ensures unbiased, design-consistent prevalence metrics, decoupling measurement from enforcement model thresholds.
Continuous calibration, human validation, and periodic checks against SME-labeled gold sets maintain LLM accuracy and detect model drift.
The system provides daily, statistically powered insights for faster interventions and effective content safety tracking.

#mlp #data #security

Read original

Cloudflare BlogDec 8, 2025

Python Workers redux: fast cold starts, packages, and a uv-first workflow

Why it matters: Engineers can now deploy Python applications globally on Cloudflare Workers with full package support and exceptionally fast cold starts. This significantly improves serverless Python development, offering a highly performant and flexible platform for a wide range of edge computing use cases.

Cloudflare Python Workers now support any Pyodide-compatible package, including pure Python and many dynamic libraries, enhancing developer flexibility.
A uv-first workflow and pywrangler tooling simplify package installation and global deployment of Python applications on the Workers platform.
Significant cold start performance improvements have been achieved through dedicated memory snapshots, making Python Workers 2.4x faster than AWS Lambda and 3x faster than Google Cloud Run for package-heavy applications.
The platform offers a free tier and supports various use cases, from FastAPI apps and HTML templating to real-time chat with Durable Objects and image generation.
These advancements provide a Python-native serverless experience with global deployment and minimal latency.

#dist #sre

Read original

Pinterest EngineeringDec 5, 2025

Improving Quality of Recommended Content through Pinner Surveys

Why it matters: This article demonstrates a practical approach to de-biasing recommendation systems by integrating direct user feedback via surveys into ML model training. Engineers can learn how to move beyond pure engagement metrics to build more user-centric and high-quality content platforms.

Pinterest implemented in-app Pinner surveys to gather direct user feedback on content visual quality, moving beyond traditional engagement metrics.
The survey design collected at least 10 ratings per image for 5k Pins across diverse interest verticals, averaging scores to ensure data reliability and reduce subjectivity.
A machine learning model was trained using this aggregated survey data, mapping image embedding features to a single score (0-1) indicating perceived visual quality.
This ML model is integrated into Pinterest's core recommendation systems, including Homefeed, Related Pins, and Search, to promote higher quality content.
The approach aims to de-bias recommendation systems, prevent the promotion of low-quality "clickbait," and align content delivery with user well-being and satisfaction.

#mlp #data

Read original

Cloudflare BlogDec 5, 2025

Cloudflare outage on December 5, 2025

Why it matters: This incident underscores the critical impact of configuration management in distributed systems. It highlights how rapid, global deployments without gradual rollouts and robust error handling can lead to widespread outages, even from seemingly minor code paths.

A 25-minute Cloudflare outage on Dec 5, 2025, impacted 28% of HTTP traffic due to a configuration change.
The incident stemmed from disabling an internal WAF testing tool, intended to mitigate a React Server Components vulnerability (CVE-2025-55182).
A global configuration system, lacking gradual rollout, propagated a change that triggered a Lua runtime error in the FL1 proxy.
The error was an attempt to access a nil value ('rule_result.execute') when a killswitch skipped an "execute" action rule, a bug undetected for years.
This highlights the need for robust type systems and safe deployment practices, especially for critical infrastructure.
Cloudflare acknowledges similar past incidents and is prioritizing enhanced rollouts and versioning to prevent future widespread impacts.

#sre #dist #security

Read original

GitHub EngineeringDec 4, 2025

How to use GitHub Copilot Spaces to debug issues faster

Why it matters: GitHub Copilot Spaces significantly reduces the time engineers spend hunting for context during debugging by providing AI with project-specific knowledge. This leads to faster, more accurate solutions and streamlined development workflows.

GitHub Copilot Spaces enhances AI debugging by providing project-specific context like files, pull requests, and issues, leading to more accurate suggestions.
Spaces act as dynamic knowledge bundles, automatically syncing with linked content to ensure Copilot always has up-to-date information.
Users create a space, add relevant project assets (e.g., security docs, architecture overviews, specific issues), and define custom instructions for Copilot's behavior.
Copilot leverages this curated context to generate detailed debugging plans and propose code changes, citing its sources for transparency and auditability.
The integrated coding agent can then create pull requests with before/after versions, explanations, and references to the guiding instructions and files.

#mlp #security

Read original

Netflix Tech BlogDec 4, 2025

AV1 — Now Powering 30% of Netflix Streaming

Why it matters: This article highlights how open video codecs like AV1 drive significant improvements in streaming quality and network efficiency. It showcases a successful large-scale rollout across diverse devices, offering valuable insights into optimizing content delivery and user experience.

Netflix's AV1 codec adoption has reached 30% of all streaming, becoming their second most-used codec due to its superior efficiency.
AV1 delivers higher video quality (4.3 VMAF points over AVC) with one-third less bandwidth and 45% fewer buffering interruptions.
The rollout began with Android mobile in 2020 using the dav1d software decoder, expanding to smart TVs, web browsers, and Apple devices with hardware support.
This advanced codec significantly improves network efficiency for Netflix's Open Connect CDN and partner ISPs by reducing overall internet bandwidth consumption.
AV1 enables advanced features like HDR10+ streaming and cinematic film grain, enhancing the overall viewing experience for members.

#dist #mobile

Read original

Page 23 of 44

Prev 1...21 22 23 24 25...44 Next