Hacker News new | past | comments | ask | show | jobs | submit login

lxml's default parsers are good for xml, atom and xhtml; for html5 and html tag soup, lxml.html.html5parser (which depends on html5lib) is the way to go. For feed tag soup, feedparser still uses BeautifulSoup internally.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: