Improving the scrapers, trying new displays

Project(s): 

Getting scrapers for UMWBlogs data into better shaper---to some degree, at least, I'm getting dcmitypes info. That's let me put a couple tests together, generating image and video galleries: Images, Videos.

These are coming from scrapes only of the links in blog posts. If there are embedded videos or images that aren't links, those aren't yet in.

My next big chaos is going to be the many, many pages (as opposed to posts) in umwblogs that contain significant content. Need to figure a way to effectively scrape those.

But it's happy RDFizing so far!

Trackback URL for this post:

http://www.patrickgmj.net/trackback/134

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.

More information about formatting options