点此免费加入Python网络爬虫学习交流QQ群:428518750

在加载页面时,可能页面上图片比较多,而我们又不关心图片内容。那么,在加载页面时,可以选择不加载图片,以提高页面加载速度。

示例代码:

from playwright.sync_api import Playwright, sync_playwright, expect
import re

def cancel_request(route,request):
    route.abort()

def run(playwright: Playwright) -> None:
    browser = playwright.chromium.launch(headless=False)
    context = browser.new_context()
    page = context.new_page()
    page.route(re.compile(r"(\.png)|(\.jpg)"),cancel_request)
    page.goto("https://www.baidu.com/")
    
    page.wait_for_timeout(20000)
    
    page.close()
    context.close()
    browser.close()

with sync_playwright() as playwright:
    run(playwright)

执行上面的代码,可以看到百度搜索框上的LOGO图片并没有加载出来:

点此免费加入Python网络爬虫学习交流QQ群:428518750

picture loss