Category Archives: Scrape commands

I needed a place to make multiple posts about web scraping commands. I don’t recommend reading these posts right now. It’s quite raw information that will need a lot of work to make it understandable and usable.

Believe me, I intend to make things easy, easier than this. These scrape commands will probably end up not even visible to users. I’m hoping that I can create smart algorithms to automatically create them as-needed for web pages.

Delimiters, KeepAfter, KeepBefore, KeepAfterIncluding, KeepBeforeIncluding, SaveAppend, SavePrepend, etc

Saved Area

The “Saved Area” is a place where text can be saved for later. Suppose some webpage text got chopped up on the Chopping block, and the Chop Result is now a useful word. You can Copy the word from the … Continue reading

Posted in Scrape commands | Leave a comment

Chop Result

The “Chop Result” is how we refer to the words currently on the chopping block. What words? What chopping block? What is chopping? Good questions.

Posted in Scrape commands | Leave a comment

SavePrepend

The “SavePrepend” command is just like SaveAppend, except that instead of adding the Chop Result onto the end of the Saved Area, “Prepend” means to insert the Chop Result onto the beginning of the Saved Area, ahead of whatever text … Continue reading

Posted in Scrape commands | Leave a comment

SaveAppend

The SaveAppend command is very useful. Once you’ve chopped some words on the Chopping block, you can Save them to a special area called the “Saved Area”.

Posted in Scrape commands | Leave a comment

KeepBeforeIncluding ‘h’

The KeepBeforeIncluding command is ‘h’. I apologize yet again for a single-letter command. ^g^ is an example.

Posted in Scrape commands | Leave a comment

KeepAfterIncluding ‘g’

The KeepAfterIncluding command is ‘g’. My apologies for a single-letter command. ^g^ is an example.

Posted in Scrape commands | Leave a comment

View Source. As fun as being confused, is.

You know what, sooner or later, someone will want to know how the heck to “View Source”. That’s because people who want to extract useful information from webpages, will need to do exactly that,

Posted in Scrape commands | Leave a comment

KeepBefore ‘b’

The KeepBefore command is ‘b’. It’s very commonly used in scraping webpages, and is usually preceeded by a KeepAfter ‘a’ command.

Posted in Scrape commands | Leave a comment

KeepAfter ‘a’

KeepAfter and KeepBefore are the two most fundamental scraping commands, and most commonly used. So they were shortened to ‘a’ and ‘b’ to make the command sequences appear shorter. But you can also use the full word ‘KeepAfter’ and ‘KeepBefore’.

Posted in Scrape commands | Leave a comment

Delimiters ^ or ; or ~ or anything else.

The very first character within a bunch of scrape commands, must be the delimiter. Delimiter is a separator, just like a space between words separates the words. And actually, you *could* use a space as the delimiter, but you *shouldn’t*, … Continue reading

Posted in Scrape commands | Leave a comment