ESP32: Chat Application (Introduction)

Introduction

In this series of tutorials we will learn how to build a chat application hosted by the ESP32. Since the ESP32 is a resource constrained device, our application will have very simple functionalities and will be mainly focused on teaching the technologies that will support it, such as:

  • Websockets
  • HTTP
  • SNTP
  • JSON

Besides covering the ESP32 Arduino programming, we will also need to write some JavaScript and HTML code for the chat UI, which will be presented on the browser to the users of the application. We will also write some CSS to make the UI look better. Don’t worry if you don’t have previous experience with these technologies as the code we will write will be very simple and linked to some resources that you can explore.

The expected outcome, at the end of these of tutorials, is shown below in figure 1.

Final layout of the ESP32 chat application.
Figure 1 – Final layout of the chat application.

To build this application we will only need an ESP32 board and some device to to test the chat as a user (a computer with a web browser or a smartphone is enough). I’ve used a a ESP32-E FireBeetle board from DFRobot to perform all my tests.

Why a Chat application?

In a real application scenario, it is extremely unlikely that a chat application would be implemented in a microcontroller. These applications are usually complex and require scalability. They need to handle many users and many messages, which requires more resources than a device such as the ESP32 can offer.

Nonetheless, a simplified chat application has many interesting challenges that can be solved using some of the technologies supported by the ESP32. Furthermore, it is something practical that is fun to build and to see working.

As such, we are doing a chat application for teaching purposes, since it will allows us to cover the basis of some interesting technologies that you can then use in other projects. Naturally, we will support few users and the set of functionalities we will implement will be reduced. Also, we are not going to worry much about performance.

The requirements and architecture of the application

Our chat application will be very simple. There will be a single chat room to which all the users can connect to and disconnect from. A connected user can send a message, that will be received by all the other users. There is no concept of history and messages sent are only delivered to the users currently connected.

A chat application typically requires that the messages sent between connected users are delivered in “real-time”. Here, we use the term “real-time” to indicate “sending the messages as fast as possible”, since the term “real-time” has a different meaning in the world of embedded systems.

As such, websockets are one of the technologies supported by the ESP32 that fit perfectly this requirements, allowing the chat clients to maintain a persistent connection to the ESP32 (the ESP32 will be acting as a websocket server). Then, whenever a chat client produces a message, the ESP32 will be responsible for delivering it to all the other participants of the chat.

To keep things simple, when the ESP32 receives a message, it will broadcast it back to all the websocket connections, meaning the original sender will also receive the message back. Naturally, in a more realistic application scenario, we could exclude the sender when broadcasting the message.

Taking in consideration that the sender will later get its own message back, we will simplify the client code by only displaying messages received. As such, when the client sends a message to the server, it won’t immediately display it in the UI. That way, each client can always display all the received messages from the ESP32 without having to exclude their own. Once again, this is a simplified approach since, in a real application scenario, there are many situations that would justify the client keeping a copy of the message it sends.

Figure 2 displays the communication diagram described before.

Communication diagram between the clients and the ESP32 server.
Figure 2 – Communication diagram between the clients and the ESP32 server, for a message sent by a client.

Besides acting as a websocket server, the ESP32 will also need to serve the HTML, CSS and JavaScript files that will implement the UI of the chat application, which will be shown to the users. As such, the ESP32 will act as HTTP server to serve these files. The websocket client functionality will be implemented in the JavaScript code.

Besides displaying the chat messages, our application will also be able to display when clients connect and disconnect from the chat.

It is important to take in consideration that we are going to do some simplifications in our code, to make it simpler. For example, we won’t be treating all the error situations (message could not be delivered, websocket connection failed, UI fields not properly filled, etc…). Naturally, in a real application scenario, we should take care of all the errors.

Other important aspect to mention is related with security. The files will be delivered over HTTP and the websocket connections won’t be encrypted. This means that all the communications won’t be secure and this application should be used only for testing purposes.

The tutorials

The implementation of the chat application will be divided in multiple parts. At the end of part 1 we will already have a basic prototype working and then we will build the rest incrementally.

  • Part 1: We will implement the basic messaging via websockets on the ESP32 (without timestamps). We will also implement the basic HTML and JavaScript client code that will be able to connect to the server, send messages to the ESP32, display the broadcasted messages and disconnect from the server. At this stage, we will still have the HTML and JavaScript code together in the same file and test it directly from a computer, without serving it from the ESP32.
  • Part 2: We will split the HTML (presentation) and JavaScript (behavior) in two different files and adapt the ESP32 code to serve these files from the file system.
  • Part 3: We will focus on making the UI of our application look better by writing some CSS code to style the components. We will adapt the HTML and the JavaScript files to integrate with these changes and also the ESP32 code to serve the CSS file.
  • Part 4: We will add a timestamp to all the messages on the ESP32 server and adapt the JavaScript code to handle this new field.
  • Part 5: We will start handling the client connected and disconnected events, displaying a message in the chat window whenever they occur.

Previous knowledge

The practical application we are going to develop here is built on top of many tutorials we have already covered on TechTutorialsX. Although each part of this tutorial explains the code used, you may consult the posts below for more details.

Leave a Reply