Automatically migrated from Gitolite
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Sven Slootweg 8456151c34 Add and fix some information 11 years ago
README.md Add and fix some information 11 years ago
index.html Initial commit 11 years ago
jquery.js Initial commit 11 years ago
scraperscript.js Add and fix some information 11 years ago

README.md

ScraperScript

A simple bookmarklet that lets you click any element in a page and returns the smallest unique (and reliable) selector it can find for that element.

Usecase: Figuring out a CSS-style selector for screenscraping.

How to use the selector: Use your favourite screenscraping library with CSS selector support. If you are using BeautifulSoup, you'll need to use my patched version to have support for :nth-of-type() pseudoselectors as returned by ScraperScript.

How to make it stop killing my input: Just click the X in the ScraperScript bar. ScraperScript will release your input.

How to develop: Modify the bookmarklet to point at a scraperscript.js that is hosted locally. It's loaded with a cachebuster, so you can simply use the same bookmarklet every time you change the local code.

Known bugs:

  • On certain kinds of elements (some submit buttons, some elements with click events), ScraperScript can't prevent the attached event from happening. I'm unsure what causes this.
  • You can't ctrl+C the selector because keyboard input is killed when ScraperScript is running. Needs to be fixed to allow ctrl+C.