Wild Spider

web pages are crawled by being loaded into browser using multiple tabs parallelly

Qu'est-ce que Wild Spider ?

Wild Spider est une extension Chrome développée par Xuan Wu, et sa fonction principale est "web pages are crawled by being loaded into browser using multiple tabs parallelly".

Captures d'Écran de l'Extension

screenshot

Télécharger le fichier CRX de l'extension Wild Spider

Téléchargez les fichiers d'extension Wild Spider au format crx, installez manuellement les extensions Chrome dans le navigateur ou partagez les fichiers crx avec des amis pour installer facilement les extensions Chrome.

Instructions d'Utilisation de l'Extension

                        WATCH OUT: more tabs you use, more computer resources (CPU, memory) will be used, and each page costs a bit disk to save the content (in IndexedDb, accessible from extensions -> Inspect views: background page).

The "spider" works in this way:
1) The current url is used as the starting point, and it's loaded again in a new tab.
2) After this page is loaded, fetch all the links on the page.
3) Get all the links on the page, including relative urls.
4) Open the extracted link parallelly in all the tabs used (by default 3, set in eventPage).
5) repeat 2-4

All source code at: https://github.com/nobodxbodon/ChromeCrawlerWildSpider                    

Informations de Base sur l'Extension

Nom Wild Spider Wild Spider
ID aanpchnfojihjddlocpgoekffmjkhbbe
URL Officiel https://chromewebstore.google.com/detail/wild-spider/aanpchnfojihjddlocpgoekffmjkhbbe
Description web pages are crawled by being loaded into browser using multiple tabs parallelly
Taille du Fichier 121 KB
Nombre d'Installations 44
Version Actuelle 0.0.3
Dernière Mise à Jour 2019-03-08
Date de Publication 2019-03-08
Évaluation 1.00/5 Total 1 Évaluations
Développeur Xuan Wu
Type de Paiement free
Site Web de l'Extension https://github.com/nobodxbodon/ChromeCrawlerWildSpider
URL de la Page d'Aide https://github.com/nobodxbodon/ChromeCrawlerWildSpider/issues
Langues Prises en Charge en-US
manifest.json
{
    "update_url": "https:\/\/clients2.google.com\/service\/update2\/crx",
    "name": "Wild Spider",
    "short_name": "demo web crawler that's still in experimenting",
    "description": "web pages are crawled by being loaded into browser using multiple tabs parallelly",
    "version": "0.0.3",
    "browser_action": {
        "default_icon": "icon.png"
    },
    "permissions": [
        "tabs",
        "activeTab",
        "webNavigation"
    ],
    "background": {
        "scripts": [
            "Dexie.js",
            "eventPage.js"
        ],
        "persistent": false
    },
    "content_scripts": [
        {
            "matches": [
                "*:\/\/*\/*"
            ],
            "js": [
                "htmlparser2.js",
                "content.js"
            ]
        }
    ],
    "manifest_version": 2
}