Microsoft filed a patent in January, which became available to public this week, for a series of head-mounted display devices and their potential gaming applications.

GameSpot are reporting on the filing that was spotted by a NeoGAF user which is dubbed “Multiplayer gaming with head-mounted display.”

The system is described as “a system and related methods for inviting a potential player to participate in a multiplayer game via a user head-mounted display device.”

One example given demonstrated how users can use voice data as an invitation to participate in a multiplayer game.

“The program receives eye-tracking information, depth information, facial recognition information, potential player head-mounted display device information, and/or potential player voice data,” reads the example.

“The HMD device may include an eye-tracking system that utilises at least one inward-facing sensor. The inward-facing sensor may be an image sensor that is configured to acquire image data in the form of eye-tracking information from a user’s eyes. Provided the user has consented to the acquisition and use of this information, the eye-tracking system may use this information to track the position and/or movement of the user’s eyes. The eye-tracking system may then determine where, and/or at what person or object the user is looking. In another example, the inward-facing sensor may capture retinal scan information from a user’s retina. Provided the user has consented to the acquisition and use of this information, such information may be used to identify the user wearing the HMD device.”

The figures also show outward-facing sensors which detect the environment of the wearer, locate users sharing the space, and track gestures performed by the wearer. There’s also numerous mentions of “time-of-flight depth camera” systems which GameSpot notes, “draw parallels with similar technology included in the updated Kinect 2.0 that will ship alongside the upcoming Xbox One console.”

“The HMD device may also include an optical sensor system that utilises at least one outward-facing sensor, such as an optical sensor. Outward-facing sensor may detect movements within its field of view, such as gesture-based inputs or other movements performed by a user or by a person within the field of view. Outward-facing sensor may also capture image information, such as facial recognition information, and depth information from a physical environment and real-world objects within the environment. For example, outward-facing sensor may include a depth camera, a visible light camera, an infrared light camera, and/or a position-tracking camera.”

“Motion sensors may also be employed as user input devices, such that a user may interact with the HMD device via gestures of the neck and head, or even of the body.”

“In other examples, audio may be presented to the user via one or more speakers on the HMD device. Such audio may include, for example, music, instructions, and/or other communication from the multiplayer game program, the potential player invitation program, or other sources.”

You can see all the figures on NeoGaf