4 Easy Ways to Bypass "Please Verify You Are a Human"

August 13, 2025

6 min read

PerimeterX (now HUMAN Security) is that annoying gatekeeper standing between your scraper and the data you need. You've probably hit that "Press & Hold to confirm you are a human" button more times than you'd like to admit.

This guide shows you four practical methods to bypass these verification challenges—from quick hacks to industrial-strength solutions.

What Is PerimeterX and Why Should You Care?

PerimeterX is a bot detection system that protects websites like Zillow, Upwork, and Fiverr from automated traffic. It uses fingerprinting, behavioral analysis, and machine learning to spot bots. When it thinks you're not human, you get slapped with verification challenges or outright blocks.

The system works on multiple layers:

TLS/JA3 fingerprinting during SSL handshake
JavaScript challenges to verify browser execution
Browser fingerprinting (screen size, WebGL, fonts, plugins)
Behavioral tracking (mouse movements, click patterns, session timing)
IP reputation scoring based on your network origin

Method 1: Scrape Google Cache Instead (The Lazy Developer's Friend)

Sometimes the easiest solution is the best one. Why fight PerimeterX when Google already scraped the site for you?

How It Works

Google caches most web pages when crawling them. You can access these cached versions without triggering any anti-bot protection since you're technically scraping from Google, not the target site.

Implementation

import requests
from urllib.parse import quote

def scrape_google_cache(target_url):
    # Encode the URL properly
    cache_url = f"https://webcache.googleusercontent.com/search?q=cache:{quote(target_url)}"
    
    response = requests.get(cache_url)
    
    if response.status_code == 200:
        return response.text
    elif response.status_code == 404:
        print("No cache available for this URL")
        return None
    else:
        print(f"Error: {response.status_code}")
        return None

# Example usage
html = scrape_google_cache("https://example.com/product-page")

Pro Tips

Check cache freshness by looking for the timestamp in the response
Some sites (like LinkedIn) block Google from caching their pages with noarchive meta tags
For historical data, try the Wayback Machine: https://web.archive.org/web/*/YOUR_URL

When to Use This

Static content that doesn't change often
Public data that doesn't require login
Quick prototyping before building a proper scraper

Method 2: Fortified Headless Browsers (The Swiss Army Knife)

Default Puppeteer or Selenium screams "I'M A BOT!" to any decent anti-bot system. Let's fix that.

Puppeteer with Stealth Plugin

The puppeteer-stealth plugin patches over 200 detection points. Here's a battle-tested setup:

const puppeteer = require('puppeteer-extra');
const StealthPlugin = require('puppeteer-extra-plugin-stealth');

// Apply stealth patches
puppeteer.use(StealthPlugin());

async function bypassPerimeterX(url) {
    const browser = await puppeteer.launch({
        headless: false,  // Headless mode is easier to detect
        args: [
            '--no-sandbox',
            '--disable-blink-features=AutomationControlled',
            '--disable-dev-shm-usage',
            '--disable-accelerated-2d-canvas',
            '--disable-gpu',
            '--window-size=1920,1080',
            '--start-maximized'
        ]
    });

    const page = await browser.newPage();
    
    // Randomize viewport to avoid fingerprinting
    await page.setViewport({
        width: 1920 + Math.floor(Math.random() * 100),
        height: 1080 + Math.floor(Math.random() * 100),
        deviceScaleFactor: 1,
        isMobile: false,
        hasTouch: false
    });

    // Set realistic user agent
    const userAgents = [
        'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36',
        'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36',
        'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36'
    ];
    await page.setUserAgent(userAgents[Math.floor(Math.random() * userAgents.length)]);

    // Add mouse movements to appear human
    await page.evaluateOnNewDocument(() => {
        Object.defineProperty(navigator, 'webdriver', {
            get: () => false,
        });
    });

    await page.goto(url, { waitUntil: 'networkidle0', timeout: 30000 });
    
    // Random delay to mimic human reading
    await page.waitForTimeout(2000 + Math.random() * 3000);
    
    const content = await page.content();
    await browser.close();
    
    return content;
}

Python with Undetected ChromeDriver

For Python developers, undetected-chromedriver is your best friend:

import undetected_chromedriver as uc
import random
import time

def bypass_with_selenium(url):
    options = uc.ChromeOptions()
    
    # Randomize window size
    width = random.randint(1200, 1920)
    height = random.randint(800, 1080)
    options.add_argument(f'--window-size={width},{height}')
    
    # Additional anti-detection arguments
    options.add_argument('--disable-blink-features=AutomationControlled')
    options.add_experimental_option("excludeSwitches", ["enable-automation"])
    options.add_experimental_option('useAutomationExtension', False)
    
    driver = uc.Chrome(options=options, version_main=120)
    
    # Execute CDP commands to hide webdriver
    driver.execute_script("Object.defineProperty(navigator, 'webdriver', {get: () => undefined})")
    
    driver.get(url)
    
    # Random human-like delay
    time.sleep(random.uniform(3, 7))
    
    # Simulate mouse movement
    action = uc.ActionChains(driver)
    action.move_by_offset(random.randint(100, 500), random.randint(100, 500))
    action.perform()
    
    html = driver.page_source
    driver.quit()
    
    return html

Method 3: Residential Proxies + Request Optimization (The Scalable Solution)

Sometimes you don't need a full browser—just better networking.

Smart Proxy Rotation

import requests
from itertools import cycle
import random
import time

class SmartProxyRotator:
    def __init__(self, proxy_list):
        self.proxies = cycle(proxy_list)
        self.session = requests.Session()
        
    def get_with_retry(self, url, max_retries=3):
        headers = {
            'User-Agent': self._get_random_ua(),
            'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
            'Accept-Language': 'en-US,en;q=0.5',
            'Accept-Encoding': 'gzip, deflate',
            'DNT': '1',
            'Connection': 'keep-alive',
            'Upgrade-Insecure-Requests': '1'
        }
        
        for attempt in range(max_retries):
            proxy = next(self.proxies)
            
            try:
                # Add random delay between requests
                time.sleep(random.uniform(1, 3))
                
                response = self.session.get(
                    url,
                    headers=headers,
                    proxies={'http': proxy, 'https': proxy},
                    timeout=10
                )
                
                if response.status_code == 200:
                    return response
                    
            except Exception as e:
                print(f"Proxy {proxy} failed: {e}")
                continue
                
        return None
    
    def _get_random_ua(self):
        user_agents = [
            'Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:120.0) Gecko/20100101 Firefox/120.0',
            'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.0 Safari/605.1.15',
            'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36'
        ]
        return random.choice(user_agents)

# Usage
proxies = [
    'http://user:pass@residential-proxy1.com:8080',
    'http://user:pass@residential-proxy2.com:8080',
    # Add more residential proxies
]

rotator = SmartProxyRotator(proxies)
response = rotator.get_with_retry('https://protected-site.com')

curl-impersonate for Perfect TLS Fingerprinting

# Install curl-impersonate
wget https://github.com/lwthiker/curl-impersonate/releases/latest/download/curl-impersonate-chrome

# Use it in Python
import subprocess
import json

def fetch_with_curl_impersonate(url):
    cmd = [
        './curl-impersonate-chrome',
        url,
        '-H', 'User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36',
        '--compressed',
        '--tlsv1.2',
        '--http2'
    ]
    
    result = subprocess.run(cmd, capture_output=True, text=True)
    return result.stdout

Method 4: Reverse Engineering the Challenge (The Nuclear Option)

For the brave souls who want to understand what's really happening under the hood.

Analyzing PerimeterX's JavaScript

// Intercept and analyze PerimeterX challenge
const analyzeChallenge = async (page) => {
    // Hook into PerimeterX's sensor data collection
    await page.evaluateOnNewDocument(() => {
        const originalSensor = window._pxAppId;
        
        Object.defineProperty(window, '_pxAppId', {
            get: function() {
                console.log('PerimeterX App ID accessed');
                return originalSensor;
            },
            set: function(value) {
                console.log('PerimeterX initialized with:', value);
                originalSensor = value;
            }
        });
        
        // Monitor sensor data collection
        const originalXHR = window.XMLHttpRequest;
        window.XMLHttpRequest = function() {
            const xhr = new originalXHR();
            const originalOpen = xhr.open;
            
            xhr.open = function(method, url) {
                if (url.includes('/api/v2/collector')) {
                    console.log('PerimeterX collecting data:', url);
                }
                return originalOpen.apply(xhr, arguments);
            };
            
            return xhr;
        };
    });
};

Generating Valid PerimeterX Cookies

import hashlib
import base64
import time
import json

class PerimeterXSolver:
    def __init__(self, app_id):
        self.app_id = app_id
        self.uuid = self._generate_uuid()
        
    def _generate_uuid(self):
        # Generate device UUID similar to PerimeterX
        timestamp = str(int(time.time() * 1000))
        random_data = hashlib.md5(timestamp.encode()).hexdigest()
        return f"{random_data[:8]}-{random_data[8:12]}-{random_data[12:16]}-{random_data[16:20]}-{random_data[20:32]}"
    
    def generate_sensor_data(self):
        # Simulate sensor data collection
        sensor = {
            "PX": self.app_id,
            "uuid": self.uuid,
            "ts": int(time.time() * 1000),
            "navigator": {
                "webdriver": False,
                "plugins": ["Chrome PDF Plugin", "Native Client", "Chrome PDF Viewer"],
                "languages": ["en-US", "en"]
            },
            "screen": {
                "width": 1920,
                "height": 1080,
                "colorDepth": 24
            }
        }
        
        # Encode sensor data
        encoded = base64.b64encode(json.dumps(sensor).encode()).decode()
        return f"_px3={encoded}"

Bonus: Behavioral Mimicry

The secret sauce that makes everything work better:

// Realistic mouse movement using Bézier curves
async function humanLikeMouseMove(page, fromX, fromY, toX, toY) {
    const steps = 20;
    
    for (let i = 0; i <= steps; i++) {
        const t = i / steps;
        
        // Bézier curve for natural movement
        const x = fromX + (toX - fromX) * t * t * (3 - 2 * t);
        const y = fromY + (toY - fromY) * t * t * (3 - 2 * t);
        
        await page.mouse.move(x, y);
        await page.waitForTimeout(Math.random() * 50);
    }
}

// Random scrolling patterns
async function humanScroll(page) {
    const scrolls = Math.floor(Math.random() * 5) + 2;
    
    for (let i = 0; i < scrolls; i++) {
        const distance = Math.floor(Math.random() * 300) + 100;
        
        await page.evaluate((dist) => {
            window.scrollBy({
                top: dist,
                behavior: 'smooth'
            });
        }, distance);
        
        await page.waitForTimeout(1000 + Math.random() * 2000);
    }
}

Which Method Should You Choose?

Method	Cost	Difficulty	Success Rate	Best For
Google Cache	Free	Easy	60%	Static content, prototyping
Fortified Browsers	Low-Medium	Medium	85%	Small-medium scale scraping
Residential Proxies	High	Easy	90%	Large scale, parallel scraping
Reverse Engineering	Time Investment	Hard	95%+	Enterprise solutions

Final Tips

Mix methods: Use Google Cache for initial testing, then upgrade to browsers or proxies for production
Rotate everything: User agents, proxies, browser fingerprints, timing patterns
Act human: Add random delays, mouse movements, and scrolling
Monitor success rates: Track what works and adapt quickly
Stay updated: Anti-bot systems evolve weekly—your bypasses should too

Remember: The goal isn't to build the perfect undetectable bot (impossible), but to make detection so expensive and error-prone that sites won't bother blocking you.

Quick Start Kit

For those who want to get running immediately, here's a minimal working setup:

# Install dependencies
npm install puppeteer-extra puppeteer-extra-plugin-stealth
pip install undetected-chromedriver requests

# Clone starter scripts
git clone https://github.com/your-repo/perimeterx-bypass-kit
cd perimeterx-bypass-kit

# Run test
node test-bypass.js https://example.com

The arms race between scrapers and anti-bot systems never ends. But with these techniques in your toolkit, you'll stay ahead of the curve. Just remember: with great scraping power comes great responsibility—respect robots.txt, rate limits, and terms of service where reasonable.

Marius Bernard

Marius Bernard is a Product Advisor, Technical SEO, & Brand Ambassador at Roundproxies. He was the lead author for the SEO chapter of the 2024 Web and a reviewer for the 2023 SEO chapter.

Get the best
proxies out there

Get Proxies now

Related from Knowledge Base

C# Web Scraping Guide: Build Fast Working Scrapers

Web Scraping in R: Complete Guide 2026

Web Scraping in Rust: Complete 2026 Guide

How to Do Web Scraping in Kotlin: The Developer's Guide

How to Do Web Scraping in Lua: A Developer's Guide

How to Do Web Scraping in Dart: A Complete 2026 Guide

How to Do Web Scraping in Perl: The Complete Developer's Guide

Python Web Scraping Guide: Build Scrapers in 2026

How to Use Botasaurus in 2026

How to Scrape Dynamic Websites With Headless Web Browsers

12 Ways to Make HTTPS Requests in Node.js

15 Methods to Not Get Blocked Web Scraping

How to Use Playwright Playwright Proxy in 2026

How to Take Screenshots with Puppeteer

How to Store and Manage Scraped Data Efficiently

User-Agent Rotation: Why and How to Implement It

How to Scrape Data Behind Login Pages

What Are Backconnect Proxies and How They Work

How to Do Requests in Go (Golang)

How to Do Requests with C