arachn.io
The days of writing custom scrapers for every website are gone. The arachn.io API provides a universal news scraper, which extracts content from news articles and other webpages automatically using web standards and artificial intelligence. The API provides many data points for articles, which include: * Canonical URL * Title * Author name * Locale * Body content -- Sanitized HTML and plain text…
arachn.io endpoints
| Method | Endpoint | Description |
|---|---|---|
| link | ||
| POST |
unwindLinkBatch /links/unwind/batch |
Converts a batch of links to their maximally unwound, and possibly canonical, forms |
| POST |
extractLink /links/extract |
Unwinds a link and attempts to extract structured microdata and body content |
| POST |
parseLinkBatch /links/parse/batch |
Parses a batch of absolute hyperlinks into their component parts |
| POST |
parseLink /links/parse |
Parses an absolute hyperlink into its component parts |
| POST |
unwindLink /links/unwind |
Converts a link to its maximally unwound, and possibly canonical, form. For example, convert `https://nyti.ms/3rBgKkH` to… |
| domain | ||
| POST |
parseDomainNameBatch /domains/parse/batch |
Parses a batch of domain names into their component parts |
| POST |
parseDomainName /domains/parse |
Parses a domain name into its component parts |
arachn.io pricing
| Plan | Price | Rate limit | Quotas |
|---|---|---|---|
| BASIC | Free | 1 / second |
|
| PRO | $9 / month | 1 / second |
|
| ULTRA Recommended | $49 / month | 5 / second |
|
| MEGA | $99 / month | 10 / second |
|