selenium – Max的程式語言筆記

chrome cdp Input.dispatchKeyEventchrome

max-stackoverflow — Sat, 06 Apr 2024 04:58:40 +0000

由於 nodriver 暫時還無法送出 Enter，解法：

await tab.send(cdp.input_.dispatch_key_event("keyDown", code="Enter", key="Enter", text="\r", windows_virtual_key_code=13))
await tab.send(cdp.input_.dispatch_key_event("keyUp", code="Enter", key="Enter", text="\r", windows_virtual_key_code=13))

nodriver 程式碼：
https://github.com/ultrafunkamsterdam/nodriver/blob/main/nodriver/cdp/input_.py

DrissionPage 程式碼：
DrissionPage/DrissionPage/_functions/keys.py
https://github.com/g1879/DrissionPage/blob/master/DrissionPage/_functions/keys.py

def send_key(page, modifier, key):
    """发送一个字，在键盘中的字符触发按键，其它直接发送文本"""
    if key not in keyDefinitions:
        page.run_cdp('Input.insertText', text=key, _ignore=AlertExistsError)

    else:
        description = keyDescriptionForString(modifier, key)
        text = description['text']
        data = {'type': 'keyDown' if text else 'rawKeyDown',
                'modifiers': modifier,
                'windowsVirtualKeyCode': description['keyCode'],
                'code': description['code'],
                'key': description['key'],
                'text': text,
                'autoRepeat': False,
                'unmodifiedText': text,
                'location': description['location'],
                'isKeypad': description['location'] == 3,
                '_ignore': AlertExistsError}

        page.run_cdp('Input.dispatchKeyEvent', **data)
        data['type'] = 'keyUp'
        page.run_cdp('Input.dispatchKeyEvent', **data)

Google Chrome memory saver – command line switch?

max-stackoverflow — Thu, 04 Apr 2024 22:28:29 +0000

如何手動啟用chrome 瀏覽器「Memory Saver」, 解法：
https://stackoverflow.com/questions/76938654/google-chrome-memory-saver-command-line-switch

The setting seems to be here: "C:\Users\\AppData\Local\Google\Chrome\User Data\Local State"

{
    "autofill": {
        "states_data_dir": "C:\\Users\\\\AppData\\Local\\Google\\Chrome\\User Data\\AutofillStates\\2020.11.2.164946"
    },
    "background_mode": {
        "enabled": false
    },
    "browser": {
        "enabled_labs_experiments": [
            "memory-saver-multi-state-mode@1"
        ],
        "has_shown_refresh_2023_whats_new": true,
        "last_redirect_origin": "",
        "last_whats_new_version": 119,
        "shortcut_migration_version": "86.0.4240.75"
    },
    "data_use_measurement": {
        "data_used": {
            "services": {
                "background": {},
                "foreground": {}
.....

With the setting on Default it shows: "browser":{"enabled_labs_experiments":[]

Enabled shows: "enabled_labs_experiments": ["memory-saver-multi-state-mode@1"]

寫入 Local Data 的範例程式碼：

def nodriver_overwrite_prefs(conf):
    state_filepath = os.path.join(conf.user_data_dir,"Local State")
    state_dict = {}
    state_dict["performance_tuning"]={}
    state_dict["performance_tuning"]["high_efficiency_mode"]={}
    state_dict["performance_tuning"]["high_efficiency_mode"]["state"]=1
    state_dict["browser"]={}
    state_dict["browser"]["enabled_labs_experiments"]=[
        "memory-saver-multi-state-mode@1",
        "modal-memory-saver@1"
    ]
    json_str = json.dumps(state_dict)
    with open(state_filepath, 'w') as outfile:
        outfile.write(json_str)

How to change preferences in Chrome by modifying files?

max-stackoverflow — Thu, 04 Apr 2024 20:07:34 +0000

如何修改 chrome 的預設參數，資料來源：
https://superuser.com/questions/554233/how-to-change-preferences-in-chrome-by-modifying-files

解法：

There is a file called “Preferences” within the “User Data/” folder that appears to contain these settings. The location of this file varies according to OS. For the “Default” profile this is located at:

WinXP:

C:\Documents and Settings\\Local Settings\Application Data\Google\Chrome\User Data\Default\Preferences

WinVista:

C:\Users\\AppData\Local\Google\Chrome\User Data\Default\Preferences

You then need to search for the appropriate setting in that file. I would close Chrome (and backup) first as this file appears to be updated automatically as you navigate tabs.

“Enable Auto-fill to fill in web forms in a single click.” appears to be stored here:

   "autofill": {
      "enabled": true,

“Offer to save passwords I enter on the web.“

      "password_manager_enabled": true,

Deploy initial preferences:
https://support.google.com/chrome/a/answer/187948?hl=en#zippy=%2Cstep-create-the-initial-preferences-file

寫入 Preferences 的程式碼：

def nodriver_overwrite_prefs(conf, prefs_dict={}):
    prefs_filepath = os.path.join(conf.user_data_dir,"Default")
    if not os.path.exists(prefs_filepath):
        os.mkdir(prefs_filepath)
    prefs_filepath = os.path.join(prefs_filepath,"Preferences")
    prefs_dict["profile"]={}
    prefs_dict["profile"]["name"]=CONST_APP_VERSION
    prefs_dict["profile"]["password_manager_enabled"]=False
    json_str = json.dumps(prefs_dict)
    with open(prefs_filepath, 'w') as outfile:
        outfile.write(json_str)

selenium extension from unknown error: cannot read manifest

max-stackoverflow — Mon, 25 Dec 2023 05:03:42 +0000

在寫好 chrome extension 後, 透過 selenium 的 chrome_options.add_extension(ext), 測試其他的 extension 都正常, 但自己寫的 extension 會顯示錯誤訊息:

from unknown error: cannot read manifest

執行畫面:

發生的原因的確是無法讀取 manifest.json , 因為我直覺地直接壓縮目錄為 zip 檔, 應該要進去目錄裡再壓縮, 在多一層資料夾的情況下, 在解壓縮zip 後的根目錄是無法取得 manifest.json.

selenium 非同步 execute script 用法

max-stackoverflow — Fri, 15 Dec 2023 15:18:14 +0000

直接執行 js ，就用 execute_script() 就可以。

從 selenium 的source code 來看，有3種執行script 的方式，其中2個是非同步：
https://github.com/SeleniumHQ/selenium/blob/trunk/py/selenium/webdriver/remote/command.py

W3C_EXECUTE_SCRIPT: str = "w3cExecuteScript"
W3C_EXECUTE_SCRIPT_ASYNC: str = "w3cExecuteScriptAsync"
EXECUTE_ASYNC_SCRIPT: str = "executeAsyncScript"

範例python script

script = """
var callback = arguments[arguments.length - 1]; 
window.setTimeout(function(){ callback('timeout') }, 3000);
"""
driver.execute_async_script(script)

範例2號：

js = """var t = JSON.parse(Cookies.get("user")) ? JSON.parse(Cookies.get("user")).access_token : "";
fetch("%s",{headers: {
authorization: "Bearer ".concat(t)
}}).then(function (response) {
return response.json();
}).then(function (data) {
console.log(data);
if(data.result.product.length>0)
if(data.result.product[0].status=="pending") {
console.log("pending, start to reload");
location.reload();
}
}).catch(function (err){
console.log(err);
});
""" % getSeatsByTicketAreaIdUrl
driver.set_script_timeout(0.1)
driver.execute_async_script(js)

說明：經測試，用使 jQuery 的 ajax 或是使用 fetch 的 promise 都無法在 ajax 傳回的那一個區塊使用下面的程式碼傳回資料：

var callback = arguments[arguments.length - 1];
callback(data);

variable scope 的問題，var callback 才需在 global scope 才可以成功地傳回值，不然 driver.execute_async_script() 的 javascript script 不論怎麼執行都會等到 timeout 也沒資料回傳。

範例3號，常見 OCR 的：

driver.set_script_timeout(1)
form_verifyCode_base64 = driver.execute_async_script("""
    var canvas = document.createElement('canvas');
    var context = canvas.getContext('2d');
    var img = document.getElementById('%s');
    if(img!=null) {
    canvas.height = img.naturalHeight;
    canvas.width = img.naturalWidth;
    context.drawImage(img, 0, 0);
    callback = arguments[arguments.length - 1];
    callback(canvas.toDataURL()); }
    """ % (image_id))
if not form_verifyCode_base64 is None:
    img_base64 = base64.b64decode(form_verifyCode_base64.split(',')[1])

Selecting all text in textarea using Python Selenium

max-stackoverflow — Fri, 24 Nov 2023 08:08:51 +0000

在 python 的 selenium 裡全選後,輸入內容的用法如下:

builder = ActionChains(driver)
builder.move_to_element(el_text)
builder.click(el_text)
if platform.system() == 'Darwin':
    builder.key_down(Keys.COMMAND)
else:
    builder.key_down(Keys.CONTROL)
builder.send_keys("a")
if platform.system() == 'Darwin':
    builder.key_up(Keys.COMMAND)
else:
    builder.key_up(Keys.CONTROL)
builder.send_keys(val)
if submit:
    builder.send_keys(Keys.ENTER)
builder.perform()

在 selenium 裡有 web element 程式碼在:
py/selenium/webdriver/remote/webelement.py

source code:

def send_keys(self, *value) -> None:
    """Simulates typing into the element.

    :Args:
        - value - A string for typing, or setting form fields.  For setting
          file inputs, this could be a local file path.

    Use this to send simple key events or to fill out form fields::

        form_textfield = driver.find_element(By.NAME, 'username')
        form_textfield.send_keys("admin")

    This can also be used to set file inputs.

    ::

        file_input = driver.find_element(By.NAME, 'profilePic')
        file_input.send_keys("path/to/profilepic.gif")
        # Generally it's better to wrap the file path in one of the methods
        # in os.path to return the actual path to support cross OS testing.
        # file_input.send_keys(os.path.abspath("path/to/profilepic.gif"))
    """
    # transfer file to another machine only if remote driver is used
    # the same behaviour as for java binding
    if self.parent._is_remote:
        local_files = list(
            map(
                lambda keys_to_send: self.parent.file_detector.is_local_file(str(keys_to_send)),
                "".join(map(str, value)).split("\n"),
            )
        )
        if None not in local_files:
            remote_files = []
            for file in local_files:
                remote_files.append(self._upload(file))
            value = "\n".join(remote_files)

    self._execute(
        Command.SEND_KEYS_TO_ELEMENT, {"text": "".join(keys_to_typing(value)), "value": keys_to_typing(value)}
    )

網路上流傳的古時候的錯誤用法1:

src_elem.click()
src_elem.send_keys(Keys.CONTROL, 'a') # select all the text
src_elem.send_keys(Keys.CONTROL, 'c') # copy it

網路上流傳的古時候的錯誤用法2:

String selectAll = Keys.chord(Keys.CONTROL, "a");
element.sendKeys(selectAll);

或

Try to chord the Ctrl+A keys. The code below is working in my case:

element.sendKeys(Keys.chord(Keys.CONTROL, "a"));

以上3個, 是錯誤示範, 會顯示 chord 並不存在。

Blocking API/URL/CSS in Selenium 4

max-stackoverflow — Fri, 03 Nov 2023 03:57:12 +0000

常見的Google 服務會拖慢 selenium 效能, 除了使用 adblock plus 來 block connection 也可以設定在 selenium 裡.

範例 1:
https://stackoverflow.com/questions/46891301/can-i-automate-chrome-request-blocking-using-selenium-webdriver-for-ruby

driver.execute_cdp_cmd('Network.setBlockedURLs', {"urls": ["www.baidu.com"]})
driver.execute_cdp_cmd('Network.enable', {})

範例 2:
https://github.com/ultrafunkamsterdam/undetected-chromedriver/issues/387

import undetected_chromedriver as uc
driver = uc.Chrome()
driver.execute_cdp_cmd('Network.setBlockedURLs', {"urls": ['*png','*woff2','*woff','*jpg','https://www.apple.com/ac/globalnav/7/en_US/styles/ac-globalnav.built.css']})
driver.execute_cdp_cmd('Network.enable', {})
driver.get('https://apple.com/')

Java 版語法:

import org.junit.Test;
import org.openqa.selenium.chrome.ChromeDriver;
import org.openqa.selenium.devtools.DevTools;
import org.openqa.selenium.devtools.v94.network.Network;

public class BlockURL {
      @Test
      public void blockUrl() {
            System.setProperty("webdriver.chrome.driver", "path to chromedriver");
            ChromeDriver driver = new ChromeDriver();
            DevTools devTool = driver.getDevTools();
            devTool.createSession();
            devTool.send(Network.enable(Optional.empty(), Optional.empty(), Optional.empty()));
// Blocks all css files
            devTool.send(Network.setBlockedURLs(List.of("*.css"))); 
            devTool.addListener(Network.loadingFailed(), loadingFailed -> {
                  System.out.println("Blocking reason: " + loadingFailed.getBlockedReason().get());

            });

            driver.get("https://url.com");
      }
}

HTTP Proxy Authentication with Chromedriver in Selenium

max-stackoverflow — Tue, 31 Oct 2023 02:11:46 +0000

如果不需要帳號/密碼, 是很快就解決

Setting chromedriver proxy with Selenium using Python

If you need to use a proxy with python and Selenium library with chromedriver you usually use the following code (Without any username and password:

chrome_options = webdriver.ChromeOptions()
chrome_options.add_argument('--proxy-server=%s' % hostname + ":" + port)
driver = webdriver.Chrome(chrome_options=chrome_options)

It works fine unless proxy requires authentication. if the proxy requires you to log in with a username and password it will not work. In this case, you have to use more tricky solution that is explained below. By the way, if you whitelist your server IP address from the proxy provider or server it should not ask proxy credentials.

But, 大多數的proxy 應該要設帳號/密碼才合理, 因為不可能伺服器開在那, 給不認識的人使用, 解法:
https://stackoverflow.com/questions/55582136/how-to-set-proxy-with-authentication-in-selenium-chromedriver-python

HTTP Proxy Authentication with Chromedriver in Selenium

To set up proxy authentication we will generate a special file and upload it to chromedriver dynamically using the following code below. This code configures selenium with chromedriver to use HTTP proxy that requires authentication with user/password pair.

import os
import zipfile

from selenium import webdriver

PROXY_HOST = '192.168.3.2'  # rotating proxy or host
PROXY_PORT = 8080 # port
PROXY_USER = 'proxy-user' # username
PROXY_PASS = 'proxy-password' # password


manifest_json = """
{
    "version": "1.0.0",
    "manifest_version": 2,
    "name": "Chrome Proxy",
    "permissions": [
        "proxy",
        "tabs",
        "unlimitedStorage",
        "storage",
        "",
        "webRequest",
        "webRequestBlocking"
    ],
    "background": {
        "scripts": ["background.js"]
    },
    "minimum_chrome_version":"22.0.0"
}
"""

background_js = """
var config = {
        mode: "fixed_servers",
        rules: {
        singleProxy: {
            scheme: "http",
            host: "%s",
            port: parseInt(%s)
        },
        bypassList: ["localhost"]
        }
    };

chrome.proxy.settings.set({value: config, scope: "regular"}, function() {});

function callbackFn(details) {
    return {
        authCredentials: {
            username: "%s",
            password: "%s"
        }
    };
}

chrome.webRequest.onAuthRequired.addListener(
            callbackFn,
            {urls: [""]},
            ['blocking']
);
""" % (PROXY_HOST, PROXY_PORT, PROXY_USER, PROXY_PASS)


def get_chromedriver(use_proxy=False, user_agent=None):
    path = os.path.dirname(os.path.abspath(__file__))
    chrome_options = webdriver.ChromeOptions()
    if use_proxy:
        pluginfile = 'proxy_auth_plugin.zip'

        with zipfile.ZipFile(pluginfile, 'w') as zp:
            zp.writestr("manifest.json", manifest_json)
            zp.writestr("background.js", background_js)
        chrome_options.add_extension(pluginfile)
    if user_agent:
        chrome_options.add_argument('--user-agent=%s' % user_agent)
    driver = webdriver.Chrome(
        os.path.join(path, 'chromedriver'),
        chrome_options=chrome_options)
    return driver

def main():
    driver = get_chromedriver(use_proxy=True)
    #driver.get('https://www.google.com/search?q=my+ip+address')
    driver.get('https://httpbin.org/ip')

if __name__ == '__main__':
    main()

Function get_chromedriver returns configured selenium webdriver that you can use in your application. This code is tested and works just fine.

Read more about onAuthRequired event in Chrome.

using python, Remove HTML tags/formatting from a string

max-stackoverflow — Fri, 06 Oct 2023 09:15:26 +0000

在使用 selenium 時, 之前使用 element.text 都可以正確地取得 TEXT 內容, 很奇怪, 目前使用 selenium 4.13.0 + python 3.9.13 在 Win 10 環境, 有時候會正常, 但有時會失敗, 可以確定取得的內容是正確的, 取得的 innerHTML 長這樣:


                2023 JO1 1ST ASIA TOUR 'BEYOND THE DARK' LIMITED EDITION IN TAIPEI
              
日期
2023-11-11(六)
時間
19:00
地點

                    Zepp New Taipei
                    
                      新北市新莊區新北大道四段3號8樓

使用 .text 取得內容, 居然是空值! 但多試幾次, 偶爾可以取得正確的 text, 既然可以拿 innerHTML 就自己來去 tag 就好了.

import re
def striphtml(data):
    p = re.compile(r'<.*?>')
    return p.sub('', data)

>>> striphtml('I Want This text!')
'I Want This text!'

Below you will find the syntax which require as per different binding. Change the innerHTML to outerHTML as per required.

Python:

element.get_attribute('innerHTML')

[Python] yield 和 return 有什麼不同?

max-stackoverflow — Fri, 04 Aug 2023 20:15:10 +0000

使用Python很多年常常看到 yield，用這篇文將yield這個關鍵字重點整理一下。覺得整理的比較易懂的是這一篇：

How to Use Generators and yield in Python
https://realpython.com/introduction-to-python-generators/

要列舉 Generators 裡的項目：

letters = ["a", "b", "c", "y"]
it = iter(letters)
while True:
    try:
        letter = next(it)
    except StopIteration:
        break
    print(letter)

也可以用 for 來列舉，會簡單很多：

letters = ["a", "b", "c", "y"]
it = iter(letters)
for letter in it:
    print(letter)

yield 2 次，就可以 next 2 次：

>>> def multi_yield():
...     yield_str = "This will print the first string"
...     yield yield_str
...     yield_str = "This will print the second string"
...     yield yield_str
...
>>> multi_obj = multi_yield()
>>> print(next(multi_obj))
This will print the first string
>>> print(next(multi_obj))
This will print the second string
>>> print(next(multi_obj))
Traceback (most recent call last):
  File "", line 1, in 
StopIteration

yield和return一樣會回傳值，不過yield會記住上次執行的位置

yield在下次迭代時會從上次迭代的下一行接續執行，一直執行到下一個yield出現，如果沒有下一個yield則結束這個生成器。

這個神奇例子：

def yield_test(n):
    print("start n =", n)
    for i in range(n):
        yield i*i
        print("i =", i)

    print("end")

tests = yield_test(5)
for test in tests:
    print("test =", test)
    print("--------")

執行結果：

start n = 5
test = 0
--------
i = 0
test = 1
--------
i = 1
test = 4
--------
i = 2
test = 9
--------
i = 3
test = 16
--------
i = 4
end

可以解釋為遇到 yield 時，程式就會「暫時」被結束返回且相等於 return 會傳回值，但程式本身還保留在記憶體中，等下一次被 next() 呼叫，或 while 與 for 下一個 iter() 時，接續之前 yield 之後的程式碼。

目前我遇到的程式碼在這：
https://github.com/ultrafunkamsterdam/undetected-chromedriver/blob/1c704a71cf4f29181a59ecf19ddff32f1b4fbfc0/undetected_chromedriver/init.py#L716

def find_elements_recursive(self, by, value):
    """
    find elements in all frames
    this is a generator function, which is needed
        since if it would return a list of elements, they
        will be stale on arrival.
    using generator, when the element is returned we are in the correct frame
    to use it directly
    Args:
        by: By
        value: str
    Returns: Generator[webelement.WebElement]
    """
    def search_frame(f=None):
        if not f:
            # ensure we are on main content frame
            self.switch_to.default_content()
        else:
            self.switch_to.frame(f)
        for elem in self.find_elements(by, value):
            yield elem
        # switch back to main content, otherwise we will get StaleElementReferenceException
        self.switch_to.default_content()

    # search root frame
    for elem in search_frame():
        yield elem
    # get iframes
    frames = self.find_elements('css selector', 'iframe')

    # search per frame
    for f in frames:
        for elem in search_frame(f):
            yield elem