Web Architecture 101

The basic architecture concepts I wish I knew when I was getting started as a web developer

11 min readNov 7, 2017

Modern web application architecture overview

The above diagram is a fairly good representation of our architecture at Storyblocks. If youâ€™re not an experienced web developer, youâ€™ll likely find it complicated. The walk through below should make it more approachable before we dive into the details of each component.

A user searches on Google for â€œStrong Beautiful Fog And Sunbeams In The Forestâ€. The first result happens to be from Storyblocks, our leading stock photo and vectors site. The user clicks the result which redirects their browser to the image details page. Underneath the hood the userâ€™s browser sends a request to a DNS server to lookup how to contact Storyblocks, and then sends the request.
The request hits our load balancer, which randomly chooses one of the 10 or so web servers we have running the site at the time to process the request. The web server looks up some information about the image from our caching service and fetches the remaining data about it from the database. We notice that the color profile for the image has not been computed yet, so we send a â€œcolor profileâ€ job to our job queue, which our job servers will process asynchronously, updating the database appropriately with the results.
Next, we attempt to find similar photos by sending a request to our full text search service using the title of the photo as input. The user happens to be a logged into Storyblocks as a member so we look up his account information from our account service. Finally, we fire off a page view event to our data firehose to be recorded on our cloud storage system and eventually loaded into our data warehouse, which analysts use to help answer questions about the business.
The server now renders the view as HTML and sends it back to the userâ€™s browser, passing first through the load balancer. The page contains Javascript and CSS assets that we load into our cloud storage system, which is connected to our CDN, so the userâ€™s browser contacts the CDN to retrieve the content. Lastly, the browser visibly renders the page for the user to see.

Next Iâ€™ll walk you through each component, providing a â€œ101â€ introduction to each that should give you a good mental model for thinking through web architecture going forward. Iâ€™ll follow up with another series of articles providing specific implementation recommendations based on what Iâ€™ve learned in my time at Storyblocks.

1. DNS

DNS stands for â€œDomain Name Systemâ€ and itâ€™s a backbone technology that makes the world wide web possible. At the most basic level DNS provides a key/value lookup from a domain name (e.g., google.com) to an IP address (e.g., 85.129.83.120), which is required in order for your computer to route a request to the appropriate server. Analogizing to phone numbers, the difference between a domain name and IP address is the difference between â€œcall John Doeâ€ and â€œcall 201-867â€“5309.â€ Just like you needed a phone book to look up Johnâ€™s number in the old days, you need DNS to look up the IP address for a domain. So you can think of DNS as the phone book for the internet.

Thereâ€™s a lot more detail we could go into here but weâ€™ll skip over it because itâ€™s not critical for our 101-level intro.

2. Load Balancer

Before diving into details on load balancing, we need to take a step back to discuss horizontal vs. vertical application scaling. What are they and whatâ€™s the difference? Very simply put in this StackOverflow post, horizontal scaling means that you scale by adding more machines into your pool of resources whereas â€œverticalâ€ scaling means that you scale by adding more power (e.g., CPU, RAM) to an existing machine.

In web development, you (almost) always want to scale horizontally because, to keep it simple, stuff breaks. Servers crash randomly. Networks degrade. Entire data centers occasionally go offline. Having more than one server allows you to plan for outages so that your application continues running. In other words, your app is â€œfault tolerant.â€ Secondly, horizontal scaling allows you to minimally couple different parts of your application backend (web server, database, service X, etc.) by having each of them run on different servers. Lastly, you may reach a scale where itâ€™s not possible to vertically scale any more. There is no computer in the world big enough to do all your appâ€™s computations. Think Googleâ€™s search platform as a quintessential example though this applies to companies at much smaller scales. Storyblocks, for example, runs 150 to 400 AWS EC2 instances at any given point in time. It would be challenging to provide that entire compute power via vertical scaling.

Ok, back to load balancers. Theyâ€™re the magic sauce that makes scaling horizontally possible. They route incoming requests to one of many application servers that are typically clones / mirror images of each other and send the response from the app server back to the client. Any one of them should process the request the same way so itâ€™s just a matter of distributing the requests across the set of servers so none of them are overloaded.

Thatâ€™s it. Conceptually load balancers are fairly straight forward. Under the hood there are certainly complications but no need to dive in for our 101 version.

3. Web Application Servers

At a high level web application servers are relatively simple to describe. They execute the core business logic that handles a userâ€™s request and sends back HTML to the userâ€™s browser. To do their job, they typically communicate with a variety of backend infrastructure such as databases, caching layers, job queues, search services, other microservices, data/logging queues, and more. As mentioned above, you typically have at least two and often times many more, plugged into a load balancer in order to process user requests.

You should know that app server implementations require choosing a specific language (Node.js, Ruby, PHP, Scala, Java, C# .NET, etc.) and a web MVC framework for that language (Express for Node.js, Ruby on Rails, Play for Scala, Laravel for PHP, etc.). However, diving into the details of these languages and frameworks is beyond the scope of this article.

4. Database Servers

Every modern web application leverages one or more databases to store information. Databases provide ways of defining your data structures, inserting new data, finding existing data, updating or deleting existing data, performing computations across the data, and more. In most cases the web app servers talk directly to one, as will the job servers. Additionally, each backend service may have itâ€™s own database thatâ€™s isolated from the rest of the application.

While Iâ€™m avoiding a deep dive on particular technologies for each architecture component, Iâ€™d be doing you a disservice not to mention the next level of detail for databases: SQL and NoSQL.

SQL stands for â€œStructured Query Languageâ€ and was invented in the 1970s to provide a standard way of querying relational data sets that was accessible to a wide audience. SQL databases store data in tables that are linked together via common IDs, typically integers. Letâ€™s walk through a simple example of storing historical address information for users. You might have two tables, users and user_addresses, linked together by the userâ€™s id. See the image below for a simplistic version. The tables are linked because the user_id column in user_addresses is a â€œforeign keyâ€ to the id column in the users table.

If you donâ€™t know much about SQL, I highly recommend walking through a tutorial like you can find on Khan Academy here. Itâ€™s ubiquitous in web development so youâ€™ll at least want to know the basics in order to properly architect an application.

NoSQL, which stands for â€œNon-SQLâ€, is a newer set of database technologies that has emerged to handle the massive amounts of data that can be produced by large scale web applications (most variants of SQL donâ€™t scale horizontally very well and can only scale vertically to a certain point). If you donâ€™t know anything about NoSQL, I recommend starting with some high level introductions like these:

I would also keep in mind that, by and large, the industry is aligning on SQL as an interface even for NoSQL databases so you really should learn SQL if you donâ€™t know it. Thereâ€™s almost no way to avoid it these days.

5. Caching Service

A caching service provides a simple key/value data store that makes it possible to save and lookup information in close to O(1) time. Applications typically leverage caching services to save the results of expensive computations so that itâ€™s possible to retrieve the results from the cache instead of recomputing them the next time theyâ€™re needed. An application might cache results from a database query, calls to external services, HTML for a given URL, and many more. Here are some examples from real world applications:

Google caches search results for common search queries like â€œdogâ€ or â€œTaylor Swiftâ€ rather than re-computing them each time
Facebook caches much of the data you see when you log in, such as post data, friends, etc. Read a detailed article on Facebookâ€™s caching tech here.
Storyblocks caches the HTML output from server-side React rendering, search results, typeahead results, and more.

The two most widespread caching server technologies are Redis and Memcache. Iâ€™ll go into more detail here in another post.

6. Job Queue & Servers

Most web applications need to do some work asynchronously behind the scenes thatâ€™s not directly associated with responding to a userâ€™s request. For instance, Google needs to crawl and index the entire internet in order to return search results. It does not do this every time you search. Instead, it crawls the web asynchronously, updating the search indexes along the way.

While there are different architectures that enable asynchronous work to be done, the most ubiquitous is what Iâ€™ll call the â€œjob queueâ€ architecture. It consists of two components: a queue of â€œjobsâ€ that need to be run and one or more job servers (often called â€œworkersâ€) that run the jobs in the queue.

Job queues store a list of jobs that need to be run asynchronously. The simplest are first-in-first-out (FIFO) queues though most applications end up needing some sort of priority queuing system. Whenever the app needs a job to be run, either on some sort of regular schedule or as determined by user actions, it simply adds the appropriate job to the queue.

Storyblocks, for instance, leverages a job queue to power a lot of the behind-the-scenes work required to support our marketplaces. We run jobs to encode videos and photos, process CSVs for metadata tagging, aggregate user statistics, send password reset emails, and more. We started with a simple FIFO queue though we upgraded to a priority queue to ensure that time-sensitive operations like sending password reset emails were completed ASAP.

Job servers process jobs. They poll the job queue to determine if thereâ€™s work to do and if there is, they pop a job off the queue and execute it. The underlying languages and frameworks choices are as numerous as for web servers so I wonâ€™t dive into detail in this article.

7. Full-text Search Service

Many if not most web apps support some sort of search feature where a user provides a text input (often called a â€œqueryâ€) and the app returns the most â€œrelevantâ€ results. The technology powering this functionality is typically referred to as â€œfull-text searchâ€, which leverages an inverted index to quickly look up documents that contain the query keywords.

Example showing how three document titles are converted into an inverted index to facilitate fast lookup from a specific keyword to the documents with that keyword in the title. Note, common words such as â€œinâ€, â€œtheâ€, â€œwithâ€, etc. (called stop words), are typically not included in an inverted index.

While itâ€™s possible to do full-text search directly from some databases (e.g., MySQL supports full-text search), itâ€™s typical to run a separate â€œsearch serviceâ€ that computes and stores the inverted index and provides a query interface. The most popular full-text search platform today is Elasticsearch though there are other options such as Sphinx or Apache Solr.

8. Services

Once an app reaches a certain scale, there will likely be certain â€œservicesâ€ that are carved out to run as separate applications. Theyâ€™re not exposed to the external world but the app and other services interact with them. Storyblocks, for example, has several operational and planned services:

Account service stores user data across all our sites, which allows us to easily offer cross-sell opportunities and create a more unified user experience
Content service stores metadata for all of our video, audio, and image content. It also provides interfaces for downloading the content and viewing download history.
Payment service provides an interface for billing customer credit cards.
HTML â†’ PDF service provides a simple interface that accepts HTML and returns a corresponding PDF document.

9. Data

Today, companies live and die based on how well they harness data. Almost every app these days, once it reaches a certain scale, leverages a data pipeline to ensure that data can be collected, stored, and analyzed. A typical pipeline has three main stages:

The app sends data, typically events about user interactions, to the data â€œfirehoseâ€ which provides a streaming interface to ingest and process the data. Often times the raw data is transformed or augmented and passed to another firehose. AWS Kinesis and Kafka are the two most common technologies for this purpose.
The raw data as well as the final transformed/augmented data are saved to cloud storage. AWS Kinesis provides a setting called â€œfirehoseâ€ that makes saving the raw data to itâ€™s cloud storage (S3) extremely easy to configure.
The transformed/augmented data is often loaded into a data warehouse for analysis. We use AWS Redshift, as does a large and growing portion of the startup world, though larger companies will often use Oracle or other proprietary warehouse technologies. If the data sets are large enough, a Hadoop-like NoSQL MapReduce technology may be required for analysis.

Another step thatâ€™s not pictured in the architecture diagram: loading data from the app and servicesâ€™ operational databases into the data warehouse. For example at Storyblocks we load our VideoBlocks, AudioBlocks, Storyblocks, account service, and contributor portal databases into Redshift every night. This provides our analysts a holistic dataset by co-locating the core business data alongside our user interaction event data.

10. Cloud storage

â€œCloud storage is a simple and scalable way to store, access, and share data over the Internetâ€ according to AWS. You can use it to store and access more or less anything youâ€™d store on a local file system with the benefits of being able to interact with it via a RESTful API over HTTP. Amazonâ€™s S3 offering is by far the most popular cloud storage available today and the one we rely on extensively here at Storyblocks to store our video, photo, and audio assets, our CSS and Javascript, our user event data and much more.

11. CDN

CDN stands for â€œContent Delivery Networkâ€ and the technology provides a way of serving assets such as static HTML, CSS, Javascript, and images over the web much faster than serving them from a single origin server. It works by distributing the content across many â€œedgeâ€ servers around the world so that users end up downloading assets from the â€œedgeâ€ servers instead of the origin server. For instance in the image below, a user in Spain requests a web page from a site with origin servers in NYC, but the static assets for the page are loaded from a CDN â€œedgeâ€ server in England, preventing many slow cross-Atlantic HTTP requests.

Check out this article for a more thorough introduction. In general a web app should always use a CDN to serve CSS, Javascript, images, videos and any other assets. Some apps might also be able to leverage a CDN to serve static HTML pages.

Parting thoughts

And thatâ€™s a wrap on Web Architecture 101. I hope you found this useful. Iâ€™ll hopefully post a series of 201 articles that provide deep dives into some of these components over the course of the next year or two.

The Storyblocks Tech Blog