我的申请需要检索根据国际交易日志、所有权或作者提供的任何已出版书籍的资料。 这几乎是亚马孙、切克康等独特的需求现场,甚至像图书收集者这样的软件似乎能够轻易做到这一点。 但我未能照此办理。
简言之,我不需要像在书集中那样,搜索一个有限的子集,只读书。 数据库只允许我用必要的元数据对投入书进行贴标签,以便检索这些书子。 因此,比额表并不是在这里获得元数据的问题。
我尝试的备选办法是:
- Scrape Amazon. Scraping the regular Amazon pages was not very robust to things like missing authors, and while scraping the smaller mobile pages was faster, they shared the same issues with robustness of extraction. Plus, building this into an application is a clear violation of Amazon s Terms of Service.
- Scrape the Library of Congress. While this seems to have fewer legal ramifications, ease and robustness were again issues.
- ISBNdb.com API. While the service is free up to a point, and does a good job of returning the necessary metadata, I need to do this for over 500 books on a daily basis, at which point this service costs money proportional to use. I d prefer a free or one-time payment solution that allows me to do the same.
- Google Book Data API. While this seems to provide the information I need, I cannot display the book preview as their terms of service requires.
- Buy a license to a database of books. For example, companies like Ingram or Baker & Taylor provide these catalogs to retailers and libraries. This solution is obviously expensive, so I m hoping that there s a more elegant solution I ve missed. But if not, and someone on SO has had a good experience with a particular database, I m willing to go with that.
我试图详细描述我的做法,这样,书本较少的其他人就可以利用上述解决办法。 但是,根据我的要求,我是我的证人,最后是检索图书元数据。