Discover Top Posts Tagged with #ebpf

AI-Assisted VoidLink Rootkit Hides in Linux Systems

VoidLink, a Linux rootkit, combines Loadable Kernel Modules and eBPF programs to stealthily hide processes and network connections, with AI-assisted development observed in its iterative coding and deployment.

Source: Elastic Security Labs

Read more: CyberSecBrief

#linux #malware #rootkit #aws #ebpf #cybersecurity

ICYMI: eBPF Web Servers vs Nginx: What Developers Need in 2026 http://dlvr.it/TTMwL1

#eBPF #WebServers #Nginx #Development #Performance

eBPF Web Servers vs Nginx: What Developers Need in 2026 http://dlvr.it/TTMJx6

#eBPF #WebServers #Nginx #Developers #Performance

Build your own scrubbing center. Understand the mitigation pipeline, handle per-CPU race conditions, and drop malicious traffic at wire-spee

Your iptables firewall is already dead.

When a 100Gbps DDoS flood hits your infrastructure, relying on iptables, ufw, or fail2ban is just an exercise in futility.

Here is the hard truth: By the time a packet reaches netfilter, the Linux kernel has already executed context switches and allocated an sk_buff memory structure. If 20 million malicious packets arrive per second, the sheer overhead of allocating and destroying those structures will cause 100% CPU starvation. Your server is dead.

The Fix: The Kernel Bypass Revolution

It’s time to drop packets before the operating system even realizes they exist. We do this using eBPF & XDP (eXpress Data Path). You attach an eBPF program directly to the NIC driver, executing an xdp_drop instruction to discard the packet instantly with virtually zero CPU overhead.

The "Cloud VM" Illusion

Beware of tutorials running XDP on a 1Gbps Cloud VM. They show you beautiful flame graphs of low CPU usage, but it’s a fatal illusion. XDP saves your CPU, but it does not save your bandwidth. If a 40Gbps flood hits your 1Gbps VM, the pipe saturates instantly. Your upstream ISP will panic and issue a Null-Route (Blackhole), completely isolating you from the internet.

Engineering a Real Scrubbing Center

To build a true wire-speed mitigation pipeline, you need:

Raw Bandwidth: 10Gbps to 100Gbps unmetered uplinks on Dedicated Bare Metal to physically absorb the volumetric attack.

BGP FlowSpec: To distribute the attack load across global datacenters.

Per-CPU LRU Maps: Writing production XDP code means handling multi-queue NIC architectures. You must use BPF_MAP_TYPE_LRU_PERCPU_HASH to prevent lock contention, race conditions, and memory exhaustion from spoofed IPs.

Stop paying the massive Cloudflare tax. Deploy raw compute and build your own scrubbing center.

📖 Read the full 100Gbps eBPF/XDP Engineering Blueprint here: 🔗 Dropping 100Gbps DDoS Attacks: The Ultimate eBPF & XDP Guide

#Cybersecurity #eBPF #XDP #DDoS #Linux #SysAdmin #DevOps #Networking #BareMetal #InfoSec #ServerInfrastructure #Tech #OpenSource #ServerMO

Observability Stack 2026: From Data Sprawl to Control Plane

The era of 'collect and keep everything' has met its financial and operational ceiling. As we move into 2026, the median enterprise spend on observability has surpassed $800,000 annually, with high-scale organizations often exceeding the $10 million mark. This surge isn't just a byproduct of more traffic; it is the result of a paradigm shift where observability has transitioned from a reactive debugging luxury into a mission-critical control plane for autonomous and agentic systems.,Modern stacks are no longer judged by the number of dashboards they host, but by their ability to provide high-cardinality insights without lighting the IT budget on fire. For the engineering leader in 2026, setting up an observability stack is less about selecting a vendor and more about architecting a unified telemetry pipeline that balances deep kernel visibility with intelligent, cost-aware data routing. Standardizing the Edge with OpenTelemetry and eBPF In 2026, OpenTelemetry (OTel) has achieved a staggering 95% adoption rate for new cloud-native instrumentations, effectively ending the era of proprietary vendor agents. The first step in a modern setup is the deployment of an OTel Collector-heavy architecture. This allows teams to decouple instrumentation from the backend, providing the flexibility to switch providers—a move 67% of IT leaders now consider within a two-year window to avoid vendor lock-in. Complementing OTel is the rise of eBPF (Extended Berkeley Packet Filter) for zero-instrumentation visibility. By 2027, it is estimated that 40% of production telemetry will be gathered at the kernel level, allowing platform teams to capture networking, syscalls, and security events without touching application code. This 'invisible' layer is crucial for monitoring the non-deterministic behaviors of agentic AI workflows that traditional tracing often misses. Breaking the Log Jam: AI-Driven Data Tiering Logs currently consume over 50% of the average observability budget, yet industry data shows that up to 80% of log volume consists of repetitive, low-value 'heartbeat' messages. The 2026 stack solves this through 'Adaptive Telemetry.' By implementing streaming aggregators like RisingWave or specialized OTel processors, organizations are now summarizing repeated patterns at the source while routing raw, high-fidelity data to low-cost object storage (S3/Azure Blob) via open formats like Apache Iceberg v3. This 'hot-cold' separation is no longer manual. AI-native observability platforms now use pattern recognition to automatically downsample 'normal' operations while instantly 'hydrating' or replaying detailed logs when an anomaly is detected. This strategy has allowed early adopters in the financial sector to reduce their annual ingestion costs by 35% without compromising their 2 am troubleshooting capabilities. From Dashboards to Decisions: The Rise of SLO-Driven AI Ops The most significant evolution in 2026 is the migration of Service Level Objectives (SLOs) from static charts to active decision-making engines. Organizations with mature observability practices now report 79% less downtime than those stuck in fragmented monitoring. This is achieved by feeding real-time telemetry directly into automated remediation loops, where AI 'collaborators' suggest or execute configuration rollbacks based on breach-risk forecasts. As we look toward 2027, observability is expanding into the 'Black Box' of GenAI. High-performing stacks now include specific instrumentation for LLM latency, token usage, and hallucination rates. By correlating these AI-specific metrics with traditional infrastructure health, teams can finally pinpoint whether a slow response is a failure of the model, the vector database, or a simple TCP timeout in the underlying Kubernetes cluster. Building an observability stack in 2026 is an exercise in strategic restraint and architectural foresight. The goal is no longer to see everything, but to ensure that the signals you do see are actionable, cost-effective, and standardized. By centering your strategy on OpenTelemetry and eBPF, you aren't just fixing today’s bugs; you are building the infrastructure required to govern the increasingly autonomous digital ecosystems of tomorrow.,As systems grow in complexity and the line between human and machine agency blurs, your telemetry will be the only source of truth that matters. The question for 2027 isn't whether you have enough data, but whether your data is smart enough to let your engineers focus on innovation rather than fire-fighting. Read the full article

#andAI-drivendatatieringareslashingMTTRandcontrollingspiralingcloudcosts.#eBPF #Masterthe2026observabilitystack.LearnhowOpenTelemetry

Modern Observability: Engineering the 2026 Telemetry Stack

The transition from simple monitoring to deep observability represents a fundamental shift in how we interpret the digital ghost in the machine. In early 2026, as distributed microservices move toward autonomous orchestration, the old guard of 'pre-defined dashboards' has crumbled under the weight of sheer complexity. Engineering teams are no longer asking if a system is up; they are hunting for the 'why' behind transient 400ms spikes that only affect 0.5% of users in specific geographic clusters.,True observability isn't a tool you buy—it's a capability you build through a telemetry-first culture. By unifying traces, metrics, and logs into a single, high-cardinality data stream, organizations can finally move past the fragmented 'war room' culture. This guide deconstructs the architectural blueprint required to gain total visibility into the modern cloud-native stack, where the distance between a code commit and a production outage is measured in microseconds. Standardizing the Foundation with OpenTelemetry and eBPF The heartbeat of any modern stack starts with OpenTelemetry (OTel), which has effectively ended the era of vendor lock-in. In the current 2026 landscape, the OTel Collector acts as the universal translator, decoupling the application's instrumentation from the backend storage. By deploying collectors as sidecars or daemonsets, SREs are capturing rich context—metadata like container IDs, commit hashes, and cloud provider regions—ensuring that every span tells a complete story. To reach the next level of granularity without the 20-30% performance tax of traditional agents, industry leaders like Netflix and Datadog have pivoted toward eBPF-based instrumentation. This 'kernel-level' visibility allows for zero-code instrumentation of the network stack and system calls. For instance, recent benchmarks show that eBPF-powered probes can capture distributed traces with less than 2% CPU overhead, making it feasible to observe high-throughput environments where every clock cycle is a precious commodity. Solving the Cardinality Crisis in Metrics Storage As systems scale, the traditional Prometheus-style metrics model often hits the 'cardinality wall'—where the sheer number of unique label combinations explodes the memory requirements of the time-series database. By 2027, the volume of telemetry data is projected to grow by 400% across enterprise DevOps teams. To survive this, the stack must incorporate sophisticated aggregators like Mimir or VictoriaMetrics, which utilize advanced sharding and persistent storage on S3-compatible backends to keep costs manageable. Smart sampling is the secret weapon of the data scientist in this space. Instead of capturing every single successful 200 OK response, elite stacks utilize 'tail-based sampling' to keep 100% of errors and high-latency traces while discarding the noise of healthy traffic. This strategy ensures that when a P99 latency spike occurs at 3 AM, the investigator has the full diagnostic trace available, rather than a thinned-out representation that hides the root cause. The Rise of Trace-to-Log Correlation The historical silos between logging and tracing are dissolving. In a high-functioning observability stack, a single 'TraceID' acts as the connective tissue that binds a distributed request across twenty different microservices. When an engineer clicks on a span in a Jaeger or Honeycomb UI, the stack should immediately surface the specific log lines emitted by that exact execution thread. This 'pivoting' capability reduces the Mean Time to Identification (MTTI) from hours to seconds. Moving into 2027, we are seeing the integration of vector-based log searching. By applying machine learning to log patterns, the stack can automatically group millions of lines of text into 'clusters' of behavior. If a new deployment introduces a subtle logic flaw, the system doesn't just alert on the error; it highlights that the error pattern is unique to the latest container image hash, effectively automating the first stage of the investigation. Operationalizing Insight with SLO-Driven Alerting The final layer of the stack is the human interface: how we consume these millions of data points without succumbing to alert fatigue. The most successful organizations have abandoned 'static threshold' alerts—like firing a page when CPU hits 80%—in favor of Service Level Objectives (SLOs) based on the user experience. By measuring the 'Error Budget,' teams can mathematically determine if a system's instability is significant enough to warrant waking up an engineer. These SLOs are powered by the very same high-cardinality data captured at the base of the stack. When the burn rate of an Error Budget accelerates, the stack automatically generates a 'snapshot' of the system state, including relevant traces, recent logs, and infrastructure metrics. This provides the 'incident responder' with a pre-packaged investigative brief, allowing them to focus on remediation rather than data gathering in the heat of a production crisis. Building an observability stack is an iterative journey toward radical transparency. As we look toward the horizon of 2027, the focus is shifting from simply seeing the system to understanding its intent. The architecture we have explored—rooted in open standards, kernel-level efficiency, and smart data sampling—is what separates the organizations that merely survive outages from those that use them as catalysts for architectural evolution.,The future belongs to the 'observable' enterprise, where the telemetry stream is treated with the same rigor as the primary application database. By investing in this foundation today, engineers gain the freedom to innovate at velocity, confident that no matter how complex the system becomes, the light of observability will always find the path back to the truth. Read the full article

#Beyondmonitoring:Adata-drivenguidetoarchitectinghigh-cardinalityobservabilitystacksusingOpenTelemetry #eBPF