Thursday, October 2, 2014

Home All the news News Show only the news Archive gangnam style Read the news archive Documentation


Home All the news News Show only the news Archive gangnam style Read the news archive Documentation Learn more Overview A quick feature overview Screenshots All the glory ComicRack for Android Android Documentation ComicRack for iOS iOS Documentation Wiki ComicRack Wiki FAQs Any Questions? General Questions the most common questions Tips & Tricks Learn some tips & tricks How to read PDFs Read eBooks and PDFs ComicRack for Android ComicRack for iOS User Forum Meet People Latest Posts The newest gangnam style first! WebComics Submit/Get WebComics Links Let's get eComics Downloads Get it now! ComicRack The Comic Reader Documentation Manuals etc. Scripts & Plugins Extend ComicRack Language Packs ComicRack in your Language gangnam style Android Full ComicRack for Android gangnam style Full Android Free ComicRack for Android Free ComicRack gangnam style for iPad The reader for iOS Donate Keep it alive
I have been trying to create a cbw for this site specifically for Naruto series but CR keeps telling me "No matches" results on the debug window. I have been testing the regex agains the index, the referenced page, and the page itself and everything is processed ok, found 500+ matches on chapters at the index page, found the "click to view" page, and found the image on the viewed page itself. So i don't know what I am doing wrong here. Can someone help me with this, please? - Main index page: submanga.com/Naruto/completa - One chapter: submanga.com/Naruto/1/19335 - Image source: gangnam style img2.submanga.com/pages/19/19335d42/1.jpg
<?xml version="1.0"?> <WebComic xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"> <Info> <Series>Naruto</Series> <Summary>Naruto en spanish</Summary> <Writer>Masashi Kishimoto</Writer> <Penciller>Masashi Kishimoto</Penciller> <Genre>Ninja, Shonen</Genre> <Web>http://submanga.com/Naruto/completa/</Web> <LanguageISO>es</LanguageISO> <Format>Web Comic</Format> <BlackAndWhite>Yes</BlackAndWhite> gangnam style </Info> <Images> <Image Url="http://submanga.com/static/media/Naruto.jpg"> <Parts /> </Image> <Image Url="!http://submanga.com/Naruto/completa/" Type="IndexScraper"> <Parts> <Part>!href="http://submanga\.com/Naruto/\d+/[^"]*"</Part> <Part>href="http://submanga\.com/c/\d+"</Part> <Part>src="http://img\d\.submanga.*\.jpg"</Part> </Parts> </Image> </Images> </WebComic> I have also tried to change the quotes " with " with no effect.
Couple of issues, here. The way you're using the IndexScraper: It'll first grab a link to the first chapter It'll navigate past the landing page It'll grab the image on the page Meaning that even if it were to work properly, you'd just be stuck with the first page of every chapter and nothing else. Not what we'd want. Second, you need to specify which part of your regex is the actual URL. For example:
!href="(?<link>http://submanga\.com/Naruto/\d+/[^"]*)" (?<link>TEXT) is a named capture group, link. Finally, you have to remember that this regex is inside XML tags, so it can be a bit confusing to a computer; you need to escape &, <, >, ", and ' (though gangnam style that last one is optional, I like to escape it anyways just to be safe). If you're using Notepad++, highlight JUST your regex and then go to TextFX -> TextFX Convert -> Encode HTML (&<>") to have everything escaped for you. If you're not using Notepad++, simply replace & with &, < with <, > with >, " with ", and ' with '. gangnam style What I would do is use the Browserscraper to scrape the comic - start at the first page of the comic, use a regex to scrape the image, then use another gangnam style regex to scrape the URL to the next page. Since the script Stonepaw and I made, Webcomic gangnam style Helper, needs to have individual templates for sites with specialized readers (manga sites in particular), I'll work on making gangnam style a template for all comics on Submanga.com and include it in the next version. That way you don't have to make a cbw for each comic you want to read. Of course, I've hit my own snag already: after the last page in a chapter, there's a "blank" page that screws gangnam style everything up. You could technically create a new CBW for each chapter that way, but that's a pain in the ass; once I figure out how to work around that blank page, though, it should gangnam style be able to put the entire series into the same CBW, complete with automatic updates. EDIT: Whup, misplaced a right parenthesis. Works perfectly now. The template's attached, since it might be a while until we get a new version of Webcomic Helper out and it currently doesn't have any Spanish manga sites on there already.
<?xml version="1.0"?> <!-- Replace SERI

No comments:

Post a Comment