Design, implement, and optimize low-level systems for large-scale web data collection. You will work beneath abstraction layers to develop highly efficient, adaptive scraping techniques that can navigate anti-bot protections, manipulate network protocols, and modify browser behavior at a deep level. Contract positions availableWe are looking for someone with hands-on experience in browser instrumentation, network security, and kernel-level systems, with a focus on performance, resilience, and scalability.Key ResponsibilitiesArchitect and maintain a high-performance, scalable web scraping infrastructure.Implement custom browser instrumentation using Chrome DevTools Protocol (CDP), WebDriver Wire Protocol, and related technologies.Develop low-level networking solutions, including TCP/IP socket programming, TLS handshake customization, and QUIC protocol manipulation.Reverse-engineer and bypass anti-bot detection mechanisms (e.g., Cloudflare, Akamai) using network traffic normalization, JA3/JA3S fingerprint customization, and entropy-based behavior randomization.Modify and optimize browser engines (e.g., V8, SpiderMonkey) for efficient headless rendering and fingerprint evasion.Implement containerized and distributed crawling solutions with deep knowledge of cgroups, namespaces, and container orchestration internals.Develop high-throughput data extraction and parsing systems, optimizing XPath queries, DOM mutation tracking, and real-time content differentiation.Build self-healing, resilient infrastructure for continuous, adaptive data collection at scale.Utilize AI/ML techniques to enhance dynamic crawling, adaptive evasion strategies, and anomaly detection in blocking events.What We’re Looking ForDeep expertise in low-level systems (Linux internals, network protocols, kernel-level networking).Hands-on experience with browser technologies (CDP, Firefox Remote Debugging, WebDriver, Blink engine).Proficiency in advanced networking techniques, including TLS fingerprint customization, raw TCP/IP socket programming, and traffic pattern normalization.Strong security background, with experience in bypassing anti-bot mechanisms and evading detection.Experience with distributed and containerized infrastructure, including Kubernetes, Terraform, and eBPF.Familiarity with AI/ML-driven automation techniques for web data extraction.Ability to operate beneath abstraction layers, designing custom solutions for challenging scraping environments.
Job Title
Senior Software Engineer