CrazyCrawler
Web search engine with scraping and crawling capabilities built with Django, MySQL, and Android app
Overview
A full-stack web search engine that enables users to scrape and crawl data from websites. Built using Django framework with HTML, CSS, JavaScript, and MySQL database. Includes an Android mobile app built with Android Studio for on-the-go web scraping.
💻 GitHub: CrazyCrawler
🌐 Features: Web Scraping, Indexing, Search, Mobile App
🏗️ Stack: Django, MySQL, Android
Key Features
✅ Web Crawler: Automated website crawling with configurable depth
✅ Data Scraper: Extract structured data from web pages
✅ Search Engine: Index and search scraped content
✅ User Dashboard: Manage scraping jobs and view results
✅ Android App: Mobile client for remote scraping control
✅ Scheduled Crawls: Cron jobs for periodic data collection
Architecture
User Input (URL) → Django Backend → Crawler Engine
↓
BeautifulSoup/Scrapy
↓
Data Extraction
↓
MySQL Database
↓
Indexing (Whoosh/Elasticsearch)
↓
Search Interface
Tech Stack
Backend: Django, Python, Celery (async tasks)
Frontend: HTML, CSS, JavaScript, Bootstrap
Database: MySQL, Redis (caching)
Scraping: BeautifulSoup, Scrapy, Selenium
Mobile: Android (Java), Retrofit (API client)
Deployment: Docker, Nginx, Gunicorn
Use Cases
- Research: Academic data collection
- Market Intelligence: Competitor analysis
- Price Monitoring: E-commerce price tracking
- Content Aggregation: News and blog aggregation
Status: Completed
Type: Full-Stack Web + Mobile
License: MIT