📄 skillActivePromisingmedium

apify/crawlee

by apify

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

Supported Platforms

🤖 Claude Code

Stars

23.7k

Skill Type

📖 Library & API Reference

Quality Score

90/200

License

Apache-2.0

Forks

1.4k

Last Updated

Jun 11, 2026

Discovered

May 10, 2026

Validation

Passed

github.com/apify/crawlee

Quality Breakdown

90/ 200

Content Signals

Gotchas/Edge Cases+40
Progressive Disclosure+30
Trigger Description+20
Verification/Safety+20
Code Examples+15
Composability+15

Repo Health

Recent Activity+15
Scripts/Automation+10
Real Usage (Issues)+10
Single Responsibility+10
Config/Persistence+10
Install Instructions+5

Multi-platform bonus: +5 pts if tool supports 2+ platforms. Score derived from 12 structural signals — not stars or popularity.

Trust & Verification

medium

Requires extended permissions (shell access, subagents). Review before use.

Active

Updated within the last 90 days. Actively maintained.

Unverified skill. Always review source code before installing any skill from an unknown author.

Risk Assessment

  • Repository contains `.claude/CLAUDE.md` file which suggests persistent Claude configuration that could be modified to alter AI behavior
  • GitHub workflows include CI/CD pipelines (publish-to-npm.yml, release.yml) that execute automated deployment scripts
  • Web scraping and browser automation library could potentially be misused for malicious purposes (unauthorized data collection, bot attacks)
  • Contains Docker-related deployment guidance which involves container execution
  • Husky pre-commit hooks are configured which can execute scripts on developer machines