Getting scrapers for UMWBlogs data into better shaper---to some degree, at least, I'm getting dcmitypes info. That's let me put a couple tests together, generating image and video galleries: Images, Videos.
These are coming from scrapes only of the links in blog posts. If there are embedded videos or images that aren't links, those aren't yet in.
My next big chaos is going to be the many, many pages (as opposed to posts) in umwblogs that contain significant content. Need to figure a way to effectively scrape those.
But it's happy RDFizing so far!
Post new comment