The NanoVer gRPC Protocol
General architecture
NanoVer provides three gRPC services: the state service, the trajectory service, and the command service. While none of these services are strictly mandatory, some features expect two or three services to cooperate.
The state service coordinates a shared data store that is shared between the server and one or more clients. The state is presented as a key-value store; users can subscribe to updates to the state, send updates themselves, and request exclusive write access to some keys. This service is used to: share the position of the users’ avatars, send users’ interactions with molecular systems, and share the molecular representations. It can be used to send any arbitrary data to the server and to the clients.
The trajectory service allows the server to broadcast the state of a simulation to the clients. It sends frames to the clients at the requested frame rate.
The command service lets client run functions on the server. For example, this service is used to pause or reset a molecular simulation.
The services can all be served from different addresses and/or a different ports, however, they are commonly served together from the same address and port. The default port is 38801.
The state service
Introduction
The state service coordinates a shared data store that is shared between the server and the clients. This section explores the technical details of the state service. For an interactive Jupyter notebook tutorial that complements the information presented in this section, check out our commands_and_state notebook (see NanoVer Fundamentals).
State and state updates
The state is thought of as a key-value store.
Clients can subscribe to a stream of updates and need to maintain their own version of the full state.
message StateUpdate {
// Struct where each field is an updated state key and it's latest value,
// where null is equivalent to a key removal.
google.protobuf.Struct changed_keys = 1;
}
The state update is presented as a protobuf Struct. The keys in the update directly refer to the keys in the full state. The values in the update are the new values that the state should contain. Null values are a special case, corresponding to keys that should be removed from the full state. Therefore, the full state cannot contain null values.
A state update can contain nested values. In that case, the whole nested structure must be updated at once. A nested value is still considered as a single value and the protocol does not offer ways of performing a partial update of such structures.
Subscribing to state updates
service State {
// Periodically received aggregated updates from last known state to latest
// state of a shared key/value store.
rpc SubscribeStateUpdates(SubscribeStateUpdatesRequest) returns (stream StateUpdate) {}
...
}
message SubscribeStateUpdatesRequest {
// Interval (in seconds) between update sends.
float update_interval = 1;
}
Clients can subscribe to a stream of updates. The server sends the
updates at the requested rate, waiting at least the requested
update_interval
between two updates. The waiting time may be
longer, though, due to a variety of factors including a slow server or
network delays. Therefore, a client should not assume that the rate is
regular, or even respected. Still, it is important to request the
longest update interval that is suitable for the needs in order to
reduce the load on all the involved actors.
Warning
It is possible for an update to be too large to be transmitted in one gRPC packet. If this happens, the behaviour is undefined.
Updating the state
service State {
// Attempt to make an update to the shared key/value store.
rpc UpdateState(UpdateStateRequest) returns (UpdateStateResponse) {}
}
message UpdateStateRequest {
// Token for associating requests to their lock ownership.
string access_token = 1;
// Updates to make to state.
StateUpdate update = 2;
}
message UpdateStateResponse {
// Whether the update was successful.
bool success = 1;
}
A client can request an update of the state using the UpdateSate
method. The
request contains an access_token
and the update itself. The update is
formatted in the same way as updates received from the server. The
access_token
is an arbitrary string, chosen by the client, and that
identifies that client to the server. The access token is used by the server to
resolve locks that may be set on the keys in the update. The method returns a
UpdateStateResponse
containing a boolean that is true if the update
succeeded.
State updates are “atomic” operations. All the keys in an update are updated at once and they are either all successfully updated or none are updated. An update can fail if one key is locked by another client. See the State locks section.
When an update succeeds, the server incorporates the changes and broadcasts them to all subscribed clients. Clients may receive these updates aggregated with other updates depending on what updates were received by the server during the client’s subscription interval.
Note
A non-existing key can be removed if the locks allow.
A server can make updates to the shared state. How the server does it is out of scope of the protocol, but the server updates need to appear in the state update stream of the subscribed clients.
State locks
service State {
// Attempt to acquire, renew, or release exclusive control of keys in the
// shared key/value store.
rpc UpdateLocks(UpdateLocksRequest) returns (UpdateLocksResponse) {}
}
message UpdateLocksRequest {
// Token for associating requests to their lock ownership.
string access_token = 1;
// Struct where each field an state key and either a time in seconds to
// acquire or renew the lock for, or a null to indicate the lock should be
// released if held.
google.protobuf.Struct lock_keys = 2;
}
message UpdateLocksResponse {
// Whether the locking was successful.
bool success = 1;
}
Multiple clients may update the same key. If they do so close enough in time, other clients will receive a different assortment of these updates which can appear as visual or logical glitches. In practice, if clients display an object with its location bound to a shared state key, and if multiple clients try to move that object, it may appear to jump between different locations as clients receive conflicting locations. To avoid such situations, clients have the ability to request a lock on a key or set of keys.
A lock applies to a key in the shared state. It has an access token, and a duration in seconds during which it is valid. The access token is an arbitrary string, chosen by the client, that associates the client with its locks. The client sends this key alongside its requests to update the shared state, and the update only succeeds if all the keys in the request have no valid locks on them or if the locks are associated with the same access token as in the update request.
A client can create, renew, or remove locks. To do so, it needs to call the
UpdateLocks
method with an UpdateLocksRequest
. The request contains the
access token and a Struct.
with the state key associated with the lock to update as key, and either a
duration in seconds or a Null value
as value. If the value is a duration, then the lock is created or renewed with
the requested validity duration. If the value is null, then the lock is deleted.
A lock can only be updated if:
it does not yet exist
it exists but has expired
it is held by the same access token as the request
Each update can be about one or multiple locks; a request only succeeds if all the locks can be updated. If any of the locks cannot be updated, then none of the locks are updated.
Note
Locks can be applied to non-existing keys. Removing a lock does not remove the key on which it was applied and removing a key does not remove a lock that would apply to it.
Security issues
The way to handle updates larger than a gRPC packet is undefined. Servers may implement that case by shutting down, implementing solutions that lead to a stale state or a degraded experience. This makes the state service very susceptible to low effort denial of service attacks.
For now, no server nor client implement any form of encryption. Therefore, the access tokens used to lock keys in the shared state should be considered publicly exposed.
The trajectory service
Introduction
A server can broadcast molecular systems using the trajectory service. Molecular systems can be running simulations, static structures, recorded trajectories, or any collection of particles regardless of how they are produced. They are represented as a sequence of one or more frames where each frame represents a state of the molecular system.
Note
The trajectory service was initially designed with molecular systems in mind, hence the wording in this documentation. However, while we established a set of conventions to represent such systems, the protocol is not limited to them.
Frame description
/* A general structure to represent a frame of a trajectory.
It is similar in structure to the Google Struct message,
representing dynamically typed objects and lists. However,
as frames often consist of large arrays of data of the same
type, a set of arrays are also provided as specified in
nanover/protocol/array.proto */
message FrameData {
/* A standard key-value list of dynamically typed data */
map<string, google.protobuf.Value> values = 1;
/* A key-value list of value arrays */
map<string, nanover.protocol.ValueArray> arrays = 2;
}
NanoVer describes frames using the FrameData
structure. A FrameData
contains two key-value maps to describe the changes from the previous state of
the trajectory. An implementation using this structure needs to maintain an
aggregate FrameData
and merge all incoming frames to get the current state
of the system.
A FrameData
contains two fields: values
and arrays
.
The
values
field is a key-value map where each key is a string and each value is a protobuf Value. This map typically stores simple data related to the frame: data consisting of a single number, boolean, or string. This being said, it can contain more complex data structures such as heterogeneous lists or protobuf Structs.The
arrays
field is a key-value map in which homogeneous arrays (i.e. arrays where all the values have the same type) can be stored. In this map, each key is a string and each value is aValueArray
, which can contain a homogeneous array of either floats (FloatArray
), unsigned integers (IndexArray
), or strings (StringArray
).
The meaning of the keys in both fields of the FrameData
depends on the application.
To see examples of how these types of data are added to frames in practice, take a look at our frame tutorial notebook (see NanoVer Fundamentals).
message FloatArray {
repeated float values = 1;
}
message IndexArray {
repeated uint32 values = 1;
}
message StringArray {
repeated string values = 1;
}
message ValueArray {
oneof values {
FloatArray float_values = 1;
IndexArray index_values = 2;
StringArray string_values = 3;
}
}
While a FrameData
can describe a full frame, it is mostly used to describe
the changes in a frame compared to the previous ones. As such, it is expected
that a program working with these frames will merge them. A FrameData
contains the key-value pairs to change for both the values
and the
arrays
fields. In case of complex structures in values
, the new
FrameData
needs to contain the full new value even if only part of it
changed. Likewise for arrays
, the new FrameData
needs to contain the full array in arrays
even if only a
single element of it has changed. When merging, key-value pairs from the new frame
replace those from the aggregated frame. Key-value pairs that are only in the
new frame are added to the aggregated frame. Pairs that do not appear in the
new frame remain untouched in the aggregated one.
Note
This aggregation process is made use of in NanoVer’s interactive molecular
dynamics application, in which clients can access the most
recent updates to the frame (NanoverImdClient.latest_frame
) or
the full set of aggregated data pertaining to the current frame of the
simulation (NanoverImdClient.current_frame
).
Here is an example of frames being merged:
Aggregated frame: New frame: Resulting frame:
* values: * values: * values:
- key0: A - key1: B - key0: A
- key1: A + - key4: B = - key1: B
* arrays: * arrays: - key4: B
- key2: A - key2: B * arrays:
- key3: A - key2: B
- key3: A
When part of a stream, FrameData
messages are wrapped into GetFrameResponse
ones.
/* A server response representing a frame of a molecular trajectory */
message GetFrameResponse {
/* An identifier for the frame */
uint32 frame_index = 1;
/* The frame of the trajectory, which may contain positions and topology information */
nanover.protocol.trajectory.FrameData frame = 2;
}
A GetFrameResponse
message contains a FrameData
and a frame index. This
index is an unsigned integer that is commonly incremented every time a new
frame is created. The exact value of the index, however, is only meaningful
when it is 0. When it is 0, it signals that the aggregated frame needs to be
reset. This can occur when the new frame originates from a completely different
simulation, for instance. In this case, the aggregated frame and the new frame
do not describe the same system and they should not be merged. Note that this
is the only mechanism that allows the removal of a key from the aggregated frame.
Subscribing to the latest frames
/* A service which provides access to frames of a trajectory,
which may either be precomputed or represent a live simulation.
It can also be used to obtain one or more frames on demand,
allowing molecules or trajectories to be generated based on requests */
service TrajectoryService {
/* Subscribe to a continuous updating source of frames.
The client gets the latest available frame at the time of transmission. */
rpc SubscribeLatestFrames (GetFrameRequest) returns (stream GetFrameResponse);
}
/* A client request to get frame(s) from a trajectory service */
message GetFrameRequest {
/* Arbitrary data that can be used by a TrajectoryService to
decide what frames to return */
google.protobuf.Struct data = 1;
/* Interval to send new frames at e.g 1/30 sends 30 frames every second. */
float frame_interval = 2;
}
A client can subscribe to a stream of the frames broadcast by the server
using the SubscribeLatestFrames
method. When subscribing, the client sends
a GetFrameRequest
message with a time interval expressed in seconds. The
server will try to send new frames as they are available and at most at this
interval. If multiple frames were produced during the interval, the server will
send the aggregate of these frames. The frames are sent as GetFrameResponse
messages.
When subscribing, a client may provide additional data as part of the
GetFrameRequest
. This aims at allowing some server-side filtering of the
broadcast frames. At this time, no server uses this data.
Note
A client subscribed to this stream may miss some data. If more than one frame is generated by the server during the interval, then an aggregated frame is sent by the server. This can cause the client to miss data when one frame overwrites keys from the previous one. Client should expect to always receive the latest state of the trajectory, but not to receive all the time points generated by the server.
Subscribing to all frames
/* A service which provides access to frames of a trajectory,
which may either be precomputed or represent a live simulation.
It can also be used to obtain one or more frames on demand,
allowing molecules or trajectories to be generated based on requests */
service TrajectoryService {
/* Subscribe to a continuous updating source of frames.
Frames are added to the stream when they are available */
rpc SubscribeFrames (GetFrameRequest) returns (stream GetFrameResponse);
}
Optionally, a server may allow a client to subscribe to all the broadcast
frames using the SubscribeFrames
method. In this case, the client sends a
GetFrameRequest
with a time interval and possibly extra data. The server
will send frames as GetFrameResponse
objects when they are available and at
most at the requested interval. However, the frames will not be aggregated so
the last frame received by the client may not be the last frame that was
produced. A client subscribing to this stream will receive all the time points
produced by the server, but may lag behind the current state of the simulation.
This subscription method can be a security risk and servers may choose to not implement it. Indeed, if a client subscribes to all the frames with a long interval, the server needs to record all the frames until they are sent to the client. This can cause significant disk and/or memory usage.
The command service
Introduction
A server can expose functions that clients can call. Such functions can take arguments and return values. Each function itself should return shortly after being called.
These functions are exposed through the command service. A client can use this service to list the commands that are available and to call commands.
Each command has a name and a list of arguments. The name is an arbitrary
string. By convention the name can be attached to a namespace by naming the
command namespace/command_name
.
Listing available commands
service Command {
/* Get a list of all the commands available on this service */
rpc GetCommands (GetCommandsRequest) returns (GetCommandsReply) {}
}
message GetCommandsRequest {
}
message GetCommandsReply{
repeated CommandMessage commands = 1;
}
message CommandMessage {
string name = 1;
google.protobuf.Struct arguments = 2;
}
A client can call the GetCommands
method to list the commands exposed by
the server. It needs to send a GetCommandRequest
message—that is a message
without content—and it receives a list of the commands. This list is wrapped
in a GetCommandsReply
under the commands
field. Each command is
presented as a CommandMessage
that contains the name of the command, and
the list of arguments that the command accepts alongside the default values for
these arguments.
Running commands
service Command {
/* Runs a command on the service */
rpc RunCommand (CommandMessage) returns (CommandReply) {}
}
message CommandReply {
google.protobuf.Struct result = 1;
}
message CommandMessage {
string name = 1;
google.protobuf.Struct arguments = 2;
}
To invoke a command, a client needs to run the RunCommand
method with a
CommandMessage
. The CommandMessage
contains the name of the command to
invoke and a Struct of arguments to pass to the command. The server will use
the default value for arguments that are missing from the CommandMessage
.
The RunCommand
method returns a CommandReply
that contains a Struct of
the values returned by the server-side function.
If the name of the command or one of the names of an argument is unknown to the
server, the RunCommand
method fails with a INVALID_ARGUMENT
status
code.
Note
The protocol does not have an in-built way of handling errors during the execution of the command. It does not have an in-built way of handling long-running commands either.
For an interactive Jupyter notebook tutorial that demonstrates how to set up and run commands in practice, check out our commands_and_state notebook (see NanoVer Fundamentals).