-
Notifications
You must be signed in to change notification settings - Fork 138
added support for rdf site summary (rss 1.0) #35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…) with substr(-1) + many other improvements
The element stack will be removed, a new event structure will be introduced
use String#slice instead
|
Great idea. I merged it into my own fork. (Besides, I changed the code to have a better readable coding style.) |
what I really wanted to do is now in comments (it failed multiple times)
Just deleted the "_" in front of them
because they are xml, so <script> & <style> tags shouldn't get a special treatment
Use openTag and closeTag instead
for now, it's only enforceEmptyTags
function for disparing names in different rss types.
|
cool. I just merged your branch into mine. My last commit parses the feeds a little more dynamically. This undoes some of the normalization of the names, e.g. "lastBuildDate" is no longer returned as "updated" but I'm doing normalization after parsing it anyway. I might add a convenience function that automatically normalizes some of the crucial names later. |
|
I like the idea to not check every single item multiple times, but normalisation is absolutely required. My idea is to rewrite the entire thing to use my EventedHandler, so that it's not necessary to build a DOM or anything. Besides: I already opened a pull request for my own changes, so if you want to change my code, it would be easier if you fork my fork and send me a pull requests. I don't think @tautologistics is still actively developing this project, so any pull requests here will probably be forgotten. |
Idea & part of code from tautologistics/node-htmlparser#35
Idea & part of code from tautologistics/node-htmlparser#35
Idea & part of code from tautologistics/node-htmlparser#35
Idea & part of code from tautologistics/node-htmlparser#35
e.g. http://sfbay.craigslist.org/ccc/index.rss can now be parsed with the rsshandler