Automatically migrated from Gitolite
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Sven Slootweg 2f05eb7ded Add website link in the readme. 12 years ago
README.md Add website link in the readme. 12 years ago
index.html Make the frontpage nicer and add info to the README 12 years ago
jquery.js Initial commit 12 years ago
scraperscript.js Clean up and reflow the code a bit 12 years ago

README.md

ScraperScript

A simple bookmarklet that lets you click any element in a page and returns the smallest unique (and reliable) selector it can find for that element.

Licensed under the WTFPL. A working example can be found at http://cryto.net/scraperscript.

Usecase: Figuring out a CSS-style selector for screenscraping.

How to use the selector: Use your favourite screenscraping library with CSS selector support. If you are using BeautifulSoup, you'll need to use my patched version to have support for :nth-of-type() pseudoselectors as returned by ScraperScript.

How to make it stop killing my input: Just click the X in the ScraperScript bar. ScraperScript will release your input.

How to develop: Modify the bookmarklet to point at a scraperscript.js that is hosted locally. It's loaded with a cachebuster, so you can simply use the same bookmarklet every time you change the local code.

Known bugs:

  • On certain kinds of elements (some submit buttons, some elements with click events), ScraperScript can't prevent the attached event from happening. I'm unsure what causes this.
  • You can't ctrl+C the selector because keyboard input is killed when ScraperScript is running. Needs to be fixed to allow ctrl+C.