How Teracloud Helped RunGalileo to promote a prototype of their solution to the cloud
About Run Galileo
RunGalileo is “The Data API for ML”. They provide an abstraction layer for ML models using a python library. They provide this service to development companies interested in creating machine learning products without actually writing the models themselves.
The customer product was in a very early stage. RunGalileo technically validated their product in local, small hadoop instances. Their challenge was to take that prototype and make it production-ready for its future customers in a reliable and cost-effective platform.
As technical challenges we can mention that they need to provide access to jupyter notebooks which can connect to a hadoop cluster with the Galileo library installed, and that the core of the library is written in Java (although is meant to be used from python).
AWS was chosen because we can deploy Galileo library quickly on an EMR cluster, deploy the core components as containers in EKS and have all the system to scale as needed. Also, having load balancing capabilities contribute to make the implementation simpler.
The client required to solve different problems:
The libraries were not containerized. The first job was to create Dockerfiles to generate images and upload them to EMR so we can run them in EKS.
Migrate a Cassandra database which is an integral part of the library and the demo site to Amazon Keyspaces.
Create an Amazon EMR cluster for running demos and giving notebooks to potential customers. Here we carefully crafted Policies and Roles so access to EMR notebooks can be granted to customers without endangering the implementation and other customer’s data.
The customer now is able to give their potential customers a set of credentials to log in into EMR notebooks, and they can test the RunGalileo library with the full power of EMR and without security risks.
An architecture diagram follows:
Benefits for RunGalileo
The most important benefit for the customer is that they can now do demonstrations to potential customers with a Live system, and they don’t need to install anything locally. This will allow RunGalileo to start selling their services and platform.
About the Partner
Teracloud is a fast-growing AWS Advanced Consulting Partner company created by certified cloud experts in migrating and deploying startups, enterprises, and everything in between to the cloud.
We have worked for companies from many different industries such as airlines, healthcare, education, and e-commerce designing, implementing, and managing Cloud workloads with HA architecture under 99.999% uptime SLA and PCI/HIPAA compliance requirements.
We also have a strong commitment to the Cloud community, we host local branch of AWS User Group and meetups in support of education, evangelization, and evolution of the IT community. As an Amazon Web Services partner, we have been invited to participate in the AWS Community Day Buenos Aires 2019 and AWS main conference in Las Vegas re:invent 2019 as speakers.