Wild Spider

web pages are crawled by being loaded into browser using multiple tabs parallelly

Wat is Wild Spider?

Wild Spider is een Chrome-extensie ontwikkeld door Xuan Wu, en de belangrijkste functie is "web pages are crawled by being loaded into browser using multiple tabs parallelly".

Extensie Screenshots

screenshot

Download het CRX-bestand van de extensie Wild Spider

Download Wild Spider-extensiebestanden in crx-indeling, installeer Chrome-extensies handmatig in de browser of deel de crx-bestanden met vrienden om Chrome-extensies eenvoudig te installeren.

Instructies voor het Gebruik van de Extensie

                        WATCH OUT: more tabs you use, more computer resources (CPU, memory) will be used, and each page costs a bit disk to save the content (in IndexedDb, accessible from extensions -> Inspect views: background page).

The "spider" works in this way:
1) The current url is used as the starting point, and it's loaded again in a new tab.
2) After this page is loaded, fetch all the links on the page.
3) Get all the links on the page, including relative urls.
4) Open the extracted link parallelly in all the tabs used (by default 3, set in eventPage).
5) repeat 2-4

All source code at: https://github.com/nobodxbodon/ChromeCrawlerWildSpider                    

Basisinformatie over de Extensie

Naam Wild Spider Wild Spider
ID aanpchnfojihjddlocpgoekffmjkhbbe
Officiële URL https://chromewebstore.google.com/detail/wild-spider/aanpchnfojihjddlocpgoekffmjkhbbe
Beschrijving web pages are crawled by being loaded into browser using multiple tabs parallelly
Bestandsgrootte 121 KB
Aantal Installaties 44
Huidige Versie 0.0.3
Laatst Bijgewerkt 2019-03-08
Publicatiedatum 2019-03-08
Beoordeling 1.00/5 Totaal 1 Beoordelingen
Ontwikkelaar Xuan Wu
Betalingswijze free
Extensiewebsite https://github.com/nobodxbodon/ChromeCrawlerWildSpider
Help Pagina-URL https://github.com/nobodxbodon/ChromeCrawlerWildSpider/issues
Ondersteunde Talen en-US
manifest.json
{
    "update_url": "https:\/\/clients2.google.com\/service\/update2\/crx",
    "name": "Wild Spider",
    "short_name": "demo web crawler that's still in experimenting",
    "description": "web pages are crawled by being loaded into browser using multiple tabs parallelly",
    "version": "0.0.3",
    "browser_action": {
        "default_icon": "icon.png"
    },
    "permissions": [
        "tabs",
        "activeTab",
        "webNavigation"
    ],
    "background": {
        "scripts": [
            "Dexie.js",
            "eventPage.js"
        ],
        "persistent": false
    },
    "content_scripts": [
        {
            "matches": [
                "*:\/\/*\/*"
            ],
            "js": [
                "htmlparser2.js",
                "content.js"
            ]
        }
    ],
    "manifest_version": 2
}