SpookyStuff
  • Download 0.3.1
  • Community
  • Demos
  • Download
  • Documentation
  • Company

What is SpookyStuff?

A Deep Web Scraper

  • Write 50% shorter code than any alternative
  • Browser automation support
  • traversing the deep web at will with browser automation and proxy support

An API Integrator

  • Quickly combine heterogeneous web services to build your pipelines and products
  • Minimized time & cost through advanced query optimizer and web caching
  • Resilient to service downtime through distributed retry and failsafe resolving

A Linked Data Query Engine

  • Lightning-fast running, testing and deployment on Apache Spark
  • Able to parse & extract from various unstructured contents, including HTML/XML, JSON, CSV, PDF, MS Office fromats & RTF
  • Turn any online resource into open data

The eyes of AI

  • Easy integration with scalable machine learning API, including MLlib, H2O, and deeplearning4j
  • Friendly to proactive learning pattern: your AI cluster now actively seek data online to train itself!

Getting Started

Ready to get started?

Check out the documentation and tutorials!

User Guide Download Documentation
Databricks logo

Certified by Databricks for integration with Apache Spark

© 2017 Copyright tribbloids

Design & maintained by Tenacious Design.