Home - Hypergol - Data Science Productivity

Hypergol: Arm yourself with the power of an entire Data Science team

Run prototypes at scale and draw real conclusions
Generate your entire project repository with a single command
Use parallel processing without any infrastructure burden
Track your models with simplified MLOps procedures
Eliminate chores and focus on what you really want to do

Features

Hypergol achieves compounding acceleration by providing opinionated tools in all areas of the Data Science stack.

Consistent Data Modeling

Treat your data as first-class citizens. Handle everything in python classes. Autogenerate the code from the command line to avoid this tedious and error-prone task.

This simplifies:

Data pipeline I/O
Data storage
Model training and evaluation
Model deployment

Your data is the most utilised part of your project. Rather than defining it as columns in pandas dataframes or dictionaries that lack validation, create standardised and composable structures everywhere.

Learn More Install from PyPI

fast →ls

Standardized Storage

Store all your data in the same way with the data model defined above by using Hypergol’s own storage format.

This enables:

Parallel processing
Data lineage
Simplified serialisation of composite structures

The storage format enables processing larger than machine memory data sizes and parallel processing by sharding. Hypergol uses the same SHA-1 checksum and compression git uses to create data lineage.

You will always know how your data was created and process it as efficiently as possible.

Learn More Install from PyPI

Effortless Pipeline Parallelisation

Parallelise your data processing with Hypergol’s own no-infra task scheduler. All you need is just a really large clound instance. Using its custom data format you are able to process huge datasets multithreaded just as easily as on a single thread.

No schedulers, no DAGs, no containers, no clusters. Just organised computing power.

Learn More Install from PyPI

Straightforward Deep Learning

Hypergol’s Deep Learning Framework will enable you to create models that are easily maintainable. Treat machine learning as programming by adapting a well-structured system.

This enables:

Extending the model architecture
Adding new features to the input
Retraining and deploying with ease

Hypergol’s batch processing classes enable seamless integration with Hypergol’s storage format to simplify training and evaluation.

Learn More Install from PyPI

Simple Model Deployment

Hypergol’s integrated framework enables deployment code generation with FastAPI and uvicorn.

Using your datamodel you will be able to generate typed APIs for your models automatically through pydantic.

Because FastAPI uses Swagger your code is immediately self-documented. Deploy your models in minutes!

Learn More Install from PyPI

Even More Features

Simple use of stored data with context managers in Jupyter Notebooks.
Convenience tools for data discovery in interactive environments.
Experiment versioning with git branches.
Data model conversion between different versions for schema evolution.
Generate code for everything:

virtual environment generating scripts
shell scripts to run pipelines with parameters
scripts to run unit tests
the unit tests themselves
scripts to run pylint
stubs for all Hypergol components

Accelerate yourself with Hypergol right now!

All you need to do is `pip install hypergol`

Install from PyPI Join on Discord

Instructions

fast →pi

Latest Posts

How to get notified if your instance is preempted on GCP?

Preemptable instances are cheaper than normal ones but can automatically disappear. To deal with it, you not only need to make sure your data pipeline saves your progress at regular checkpoints but also to get notified when this unfortunate event happens.

Starting on GCP from scratch

Step-by-step instruction on how to create an instance, install the right tools easily and start using Hypergol.

How to start on a new machine with a "Settings repo"

How to get your usual environment on a new VM in no time at all? Including scripts to start Jupyter notebook server, create virtual environments and git autocomplete in command line.