More

bem · on June 8, 2021

This post from fly.io [1] has a pretty comprehensive survey of the tech available for running users' code safely. It's a good read.

I've been investigating something similar for a feature I want to launch. I'm currently leaning towards running users' code in Kubernetes using Firecracker or gVisor.

My main takeaway has been that while there are good solutions for isolating users' code, there's going to be a lot of worked involved in orchestrating it at scale. I.e. building and storing images, spinning up containers, managing storage, tracking/billing minutes and bandwidth, killing timed-out containers, etc. I have not found a good library for that. It seems like a good use-case for a Kubernetes operator, so I think that's what I'll wind up building.

[1] https://fly.io/blog/sandboxing-and-workload-isolation/

sgslo · on June 8, 2021

An operator might be overkill.

I used a K8S cluster to run untrusted code. User code was executed inside of a container running as a job, rather than a naked pod or deployment. To monitor/track/handle abuse, I used a sidecar container running alongside the user's container.

The real challenge around running user's code isn't running code, per se. Instead, it is storage! I was never able to come up with a good solution for allowing users to create a very large number of files, such as the number of files created by creating a React app.

bem · on May 7, 2021

The daily newsletter (/commentary) from Messari is a great way to stay casually updated on what's happening in crypto: https://messari.io/newsletter

If you're looking for something more technical, the Week in Ethereum newsletter is incredible: https://weekinethereumnews.com

For learning the basics, the docs on the Ethereum foundation website will get you a long way: https://ethereum.org/

gogo61 · on May 8, 2021

thanks a lot.

bem · on May 4, 2021

One powerful way to deal with these problems is event sourcing. It's a reasonably elegant way to materialize a single application-specific cache based on many different data sources. Two great resources:

https://engineering.linkedin.com/distributed-systems/log-wha...

https://queue.acm.org/detail.cfm?id=3321612

bem · on April 29, 2021

Looks interesting! I’ve been looking at setting up gVisor to enable running users’ code. How does/will Kwarantine compare to gVisor?

riyakhanna1983 · on April 29, 2021

Thanks! gVisor intercepts app syscalls and serve them in user space (inside separate VMs, one for each container), which reduces runtime performance significantly. Both Firecracker and gVisor use VMs to sandbox container code.

Kwarantine, on the other hand, directly runs container code on the hardware (no VMs). It uses MMU/page tables to provide a different kernel to each container.

bem · on April 29, 2021

Makes sense. Why do you think Google and Amazon didn't pursue that approach for services like Cloud Functions and Lambda? Is there a trade-off or is it a matter of complexity?

ashishbijlani · on April 29, 2021

I believe they are constantly working on optimizing their infrastructure, and we will see improved versions of gVisor and Firecracker soon.

bem · on April 28, 2021

I personally love the vibes around Replit. Check out their Twitter as an example: https://twitter.com/replit

TechBro8615 · on April 29, 2021

They've been around for years but it seems they've only recently started getting more focused attention. I wonder if they can point to a deliberate strategy as the cause of that, or if it's mostly just good timing and getting hyped by the right people (e.g. PG on Twitter).

bloodcarter · on April 28, 2021

thanks! yes, I like Amjad very much. Anything else coming to your mind?

bem · on April 28, 2021

Looks like a useful product! I think it's neat how you built it directly on Google Sheets. I'd love to hear more about why you made that decision (versus building an independent tool).

Some feedback on the landing page:

- Would be nice with a 2-3 minute demo video

- Put the screenshots (or demo video) closer to the top, so they're visible without scrolling

- Some of the copywriting could be clearer. For example: "Pre-built financial models for SaaS companies, with plug-and-play software built directly in Google Sheets" could be just "Financial models for SaaS companies, directly in Google Sheets"

FlowCog · on April 28, 2021

I think building it in Google Sheets is important because finance folks are comfortable with it and (if needed) can customize it to suit their needs. It's also hard to replicate Google Sheet's collaboration features like edit history on individual cells, real-time updates, comments/suggestions etc.

- A demo video is a great idea, I'll get started on that this week.

- Screenshots/video closer to the top is a good idea too, think maybe I need an image in the hero.

- Good point on the copy, I'll have to revisit this.

Really appreciate the suggestions!

bem · on April 29, 2021

Makes sense – Google Sheets is a great tool. No problem!

bem · on April 28, 2021

Last year I read Masters of Doom by David Kushner after someone mentioned it on Hacker News. It was the best book I had read in a long time. It won't improve your skills, but I think it will motivate and inspire you to immerse yourself (if we're talking programming, doing small projects and getting feedback is a better way to improve your skills anyway).

https://www.goodreads.com/book/show/222146.Masters_of_Doom

is_true · on April 29, 2021

I love that book. Dopamine in written form.

bem · on April 28, 2021

I think you could find some inspiration in cryptocurrency exchanges. Most of them expose public websockets for prices, order books, trades, etc. They're high volume with lots of subscribers.

For example, check out the docs for the Binance websockets (they use both snapshot and delta messages): https://github.com/binance/binance-spot-api-docs/blob/master...

If you could tell me a little more about the data format, data volumes, number of subscribers, and how you get the data on the backend, I can try to give you some more concrete advice.

bem · on April 26, 2021

If you like GraphQL and don't mind managed services:

- Fauna (http://fauna.com) or Hasura (https://hasura.io) for the backend

- Vercel (http://vercel.com) or Netlify (https://www.netlify.com) for the frontend and functions

bem · on March 26, 2012

Facebook is one of few online services where programmers are not programmers. Alas, probably not worth it.