Wild Spider

web pages are crawled by being loaded into browser using multiple tabs parallelly

Vad är Wild Spider?

Wild Spider är en Chrome-tillägg utvecklad av Xuan Wu, och dess huvudfunktion är "web pages are crawled by being loaded into browser using multiple tabs parallelly".

Tilläggsskärmbilder

screenshot

Ladda ner Wild Spider-förlängningens CRX-fil

Ladda ner Wild Spider-filändelser i crx-format, installera Chrome-tillägg manuellt i webbläsaren eller dela crx-filerna med vänner för att enkelt installera Chrome-tillägg.

Användarmanual för Tillägg

                        WATCH OUT: more tabs you use, more computer resources (CPU, memory) will be used, and each page costs a bit disk to save the content (in IndexedDb, accessible from extensions -> Inspect views: background page).

The "spider" works in this way:
1) The current url is used as the starting point, and it's loaded again in a new tab.
2) After this page is loaded, fetch all the links on the page.
3) Get all the links on the page, including relative urls.
4) Open the extracted link parallelly in all the tabs used (by default 3, set in eventPage).
5) repeat 2-4

All source code at: https://github.com/nobodxbodon/ChromeCrawlerWildSpider                    

Grundläggande Information om Tillägg

Namn Wild Spider Wild Spider
ID aanpchnfojihjddlocpgoekffmjkhbbe
Officiell webbadress https://chromewebstore.google.com/detail/wild-spider/aanpchnfojihjddlocpgoekffmjkhbbe
Beskrivning web pages are crawled by being loaded into browser using multiple tabs parallelly
Filstorlek 121 KB
Antal Installationer 44
Aktuell Version 0.0.3
Senast Uppdaterad 2019-03-08
Publiceringsdatum 2019-03-08
Betyg 1.00/5 Totalt 1 Betyg
Utvecklare Xuan Wu
Betalningssätt free
Tilläggswebbplats https://github.com/nobodxbodon/ChromeCrawlerWildSpider
Hjälpsida URL https://github.com/nobodxbodon/ChromeCrawlerWildSpider/issues
Stödda Språk en-US
manifest.json
{
    "update_url": "https:\/\/clients2.google.com\/service\/update2\/crx",
    "name": "Wild Spider",
    "short_name": "demo web crawler that's still in experimenting",
    "description": "web pages are crawled by being loaded into browser using multiple tabs parallelly",
    "version": "0.0.3",
    "browser_action": {
        "default_icon": "icon.png"
    },
    "permissions": [
        "tabs",
        "activeTab",
        "webNavigation"
    ],
    "background": {
        "scripts": [
            "Dexie.js",
            "eventPage.js"
        ],
        "persistent": false
    },
    "content_scripts": [
        {
            "matches": [
                "*:\/\/*\/*"
            ],
            "js": [
                "htmlparser2.js",
                "content.js"
            ]
        }
    ],
    "manifest_version": 2
}