244 lines
10 KiB
Markdown
244 lines
10 KiB
Markdown
# XCB
|
|
|
|
XCB is a library for communicating with the X-Windows system used on
|
|
Linux, FreeBSD, and other Unix-like operating systems. XCB's interface
|
|
is written in C.
|
|
|
|
## Understanding X, at least the parts I'm dealing with.
|
|
|
|
The X Windows System (hereafter, **X**) is a server. It listens for
|
|
events (keyboard events, mouse events, timer events from connected
|
|
programs, etc.) and "stores" the results on a *display*. The
|
|
`display` is the root the of the **X** hierarchy, and the object to
|
|
which you initially connect. The `display` is described by a string
|
|
which encapsulates the address of the **X** server. The format of the
|
|
string predates the URL standard.
|
|
|
|
Part of the problem with untangling the hierarchy in X-Windows is that
|
|
it's meant to be extremely malleable and re-usable. [The Wikipedia
|
|
article](https://en.wikipedia.org/wiki/X_Window_System_protocols_and_architecture)
|
|
says that the `display` has a top-level window; The [Xlib
|
|
Tutorial](https://tronche.com/gui/x/xlib/introduction/overview.html)
|
|
says a `screen` is a physical monitor, and a workstation can have more
|
|
than one screen, but each screen has its own top-level window. But
|
|
both of these statements are inaccurate!
|
|
|
|
A top-level window can span multiple screens, and each screen may or
|
|
may not have an monitor at all. Headless **X** systems used for
|
|
testing may have an `output`, which is a managed region of memory into
|
|
which **X** is "drawing", but may have no physical device at all!
|
|
|
|
For the purposes of modern **X**, the server manages one or more (in
|
|
commonplace practice, only one) logical `screen` objects, which has
|
|
multiple `output` and `crtc` objects. An `output` is the video output
|
|
manager on your device, such as your GPU, but it may just be a
|
|
virtualized chunk of memory for the headless scenario described above.
|
|
The **X** server is responsible for figuring out at start time what
|
|
`output` objects you have and what `crtc` objects are connected to
|
|
them, assigning `crtc` devices to each `output`, and choosing a
|
|
default set-up that will support running your
|
|
instance of GTK or KDE or whatever.
|
|
|
|
> ASIDE: You kids have it easy. Before the existence of modern,
|
|
> self-describing hardware, we had to hand-enter every single one of
|
|
> these details, and if you got a detail wrong it was possible to burn
|
|
> out your video card or monitor!
|
|
|
|
For example, in a two-monitor set up X will have a single `screen`
|
|
with a single root `window` spanning both monitors, but there would be
|
|
two `output` devices, each with its own `crtc`, mapping a client
|
|
program's output to the physical pixels on the screen. This is how
|
|
it's possible to drag an application window from one monitor to
|
|
another, and have it be visible as it crosses the bezels between them.
|
|
The `output` and `ctrc` together act as the framebuffer manager for
|
|
displays.
|
|
|
|
My goal is to enable autorotation on tablets running **X**, using the
|
|
XCB interface. For the purposes of that fairly straightforward goal,
|
|
I want to find the base `screen`, assert that it has a single `output`
|
|
and a single `crtc`, and then send a command to the `crtc` object to
|
|
rotate its contents to the orientation I desire. The change to the
|
|
`crtc` will cause the `output` object to remap all of the pixels it is
|
|
currently tracking to the new orientation. Most modern window
|
|
managers are pretty good about re-arranging the screen to manage this
|
|
change!
|
|
|
|
> ASIDE: It makes no sense for a virtualized `output` device, one which
|
|
> has no actual display visible to the human eye, to have a `crtc`.
|
|
> It's orientation doesn't matter. If it ever _is_ displayed to a human
|
|
> being it will be in a virtualizing environment such as
|
|
> [Xephyr](https://en.wikipedia.org/wiki/Xephyr), in which case it will
|
|
> be getting its `crtc` information from the host **X** display.</aside>
|
|
|
|
> ASIDE: I'm still not sure how all this interacts with a window manager
|
|
> that has 'workspaces', such as Gnome-Mate. Which means I could be
|
|
> entirely wrong about this whole thing! On the other hand, it could be
|
|
> as simple as every workspace having it's own pseudo-root-window, and
|
|
> being dependent upon the `crtc` object for screen dimensions and pixel
|
|
> mapping. [I have to read this carefully](/https://jichu4n.com/posts/how-x-window-managers-work-and-how-to-write-one-part-i/).</aside>
|
|
>
|
|
TODO: I haven't yet figured out the bit about remapping the tablet's
|
|
touchscreen inputs, so that when you place your finger or stylus on
|
|
the screen **X** maps the pointer location to the right place.
|
|
|
|
## Connecting.
|
|
|
|
To connect to **X** via XCB, you use the `xcb_connect` function. It
|
|
takes two arguments, a string with the name of the display, and a
|
|
pointer-to-int to the default screen's ID, which is a returned value.
|
|
It returns an opaque data structure, 'xcb_connection_t'.
|
|
|
|
```
|
|
xcb_connection_t* xcbConnection = xcb_connect(const char* display, int* screen);
|
|
```
|
|
|
|
There are variants for connection-with-authorization, and
|
|
connection-with-file-descriptor.
|
|
|
|
This function always returns an allocated structure, even on failure.
|
|
You _must_ test for failure with:
|
|
|
|
```
|
|
int error = xcb_connection_has_error(xcb_connection_t* xcbConnection);
|
|
```
|
|
|
|
The error is an number defined in `xcb.h`. See that file for the list
|
|
of possible failure modes.
|
|
|
|
## Disconnecting
|
|
|
|
You _must_ close the connection when you are finished. In the event
|
|
of a connection failure, you _must_ still call this function to
|
|
free the memory XCB used to report the connection failure:
|
|
|
|
```
|
|
void xcb_disconnect(xcb_connection_t* xcbConnection);
|
|
```
|
|
|
|
## Getting the default screen structure
|
|
|
|
Once you've connected, you need the default `screen` structure.
|
|
Oddly, it's at the bottom of a linked list of `screen` structures that
|
|
you have to find by counting down from the default screen ID retrieved
|
|
during the connection, using XCB's supplied traversal functions.
|
|
|
|
Doing this is so commonplace that there's a function for doing it
|
|
provided by the `xcb-aux` extension, which on Ubuntu is accessed
|
|
through `libxcb-util-dev`:
|
|
|
|
```
|
|
xcb_screen_t* screen = xcb_aux_get_screen(xConnection, default_screen_id);
|
|
```
|
|
|
|
The screen structure is not opaque; it contains the root window, as
|
|
well as the width and height of the total display (covering all
|
|
monitors!) that it is expected to manage, along with other details
|
|
that, well, aren't relevant (at least, not yet, and maybe I hope not).
|
|
|
|
## Getting the screen's resources
|
|
|
|
Now we're getting into RandR's portion of the business. RandR is an
|
|
extension to **X** that allows userspace programs to manipulate the
|
|
core functionality of outputs and displays. Those are the resources
|
|
the **X** screen has to draw on. This is also the first time we're
|
|
going to encounter the "standard" XCB interface.
|
|
|
|
XCB has an idiom of sending a request to the **X** server and storing
|
|
a token (a `uint32_t`) that it calls a *cookie*. When it wants to
|
|
review the reply to that request, it asks for the reply using the
|
|
cookie.
|
|
|
|
It's possible to send many requests, both commands-to-set and
|
|
requests-for-information, to the **X** server, and then retrieve the
|
|
replies all at once. If the XCB interface has already received the
|
|
replies, it can hand them over at once; otherwise, it'll wait for
|
|
one. In this way, XCB and **X** can work asynchronously, batching
|
|
transactions and reducing latency.
|
|
|
|
For our purposes, we're not going to do that. We're just going to ask
|
|
for one object. The `get` command uses the rootWindow, which as I
|
|
mentioned earlier is on the `screen` we retrieved as `screen->root`.
|
|
|
|
```
|
|
xcb_generic_error_t* error = nullptr;
|
|
|
|
xrandr_get_screen_resources_cookie_t screen_resources_cookie =
|
|
xcb_randr_get_screen_resources(connection, screen->root);
|
|
|
|
xrandr_get_screen_resources_reply_t* screen_resources =
|
|
xcb_xrandr_get_screen_resources_reply(connection, screen_resources_cookie, &error);
|
|
```
|
|
|
|
**IMPORTANT** `reply` objects are allocated by XCB. *You* are
|
|
responsible for `free()`ing them afterward. If there is an error, the
|
|
reply object will be `null`, but the error object will contain the
|
|
error response as an allocated object and you are responsible for
|
|
`free()`ing it. Only `reply` and `error` objects are allocated; all
|
|
the rest are part of the `connection` object and will be freed on
|
|
disconnect.
|
|
|
|
## Getting the outputs and crtcs
|
|
|
|
Now that we have a screen, we want to get all the `output` devices, find
|
|
the `crtc` associated with it, and see the rotation! As I've been
|
|
using C++, I'm going to store a collection of cookies in vector.
|
|
|
|
There are two idioms in this example; the first collects all the query
|
|
cookies and then iterates through the replies; the second gets the
|
|
query cookie and immediately requests the reply. While the second
|
|
idiom is "slower" by an order of magnitude, on my laptop with a
|
|
shared-memory connection the difference is 3000 nanoseconds vs 300
|
|
nanoseconds-- not enough for most people to notice.
|
|
|
|
Notice that I do *not* free the return from the
|
|
`xcb_randr_get_screen_resources_outputs` call; that is simply
|
|
interpreting the contents of the `screen_resources` object. As
|
|
before, the `screen_resources` object itself will have to be freed
|
|
eventually.
|
|
|
|
```
|
|
std::vector<xcb_randr_get_output_info_cookie_t> output_get_cookies;
|
|
std::vector<xcb_randr_get_output_info_cookie_t> output_crtc_ids;
|
|
|
|
xcb_generic_error_t* error = nullptr;
|
|
|
|
int len = xcb_randr_get_screen_resources_outputs_length(screen_resources);
|
|
xcb_randr_output_t* randr_outputs =
|
|
xcb_randr_get_screen_resources_outputs(screen_resources);
|
|
|
|
for (int i = 0; i < len; ++i) {
|
|
output_get_cookies.push_back(
|
|
xcb_randr_get_output_info(connection, randr_outputs[i], timestamp));
|
|
}
|
|
|
|
for (const auto& cookie : cookies) {
|
|
xcb_randr_get_output_info_reply_t* reply =
|
|
xcb_randr_get_output_info_reply(connection, cookie, &error);
|
|
if (error) {
|
|
free(error);
|
|
continue;
|
|
}
|
|
|
|
xcb_randr_get_crtc_info_reply_t* crtc = xcb_randr_get_crtc_info_reply(
|
|
connection,
|
|
xcb_randr_get_crtc_info(connection, output->crtc, timestamp), NULL);
|
|
|
|
// It's possible that there is no CRTC associated with the
|
|
token. This isn't an error.
|
|
|
|
if (!crtc) {
|
|
continue;
|
|
}
|
|
|
|
std::cout << "(x: " << crtc->x << ", y: " << crtc->y << ") (width: " << crtc->width
|
|
<< ", height: " << crtc->height << ") status:" << unsigned(crtc->status)
|
|
<< " rotation: " << rotation_map(crtc->rotation) << std::endl;
|
|
|
|
free(crtc);
|
|
free(reply);
|
|
}
|
|
```
|
|
|
|
... this is as far as I've gotten. And it's all starting to make
|
|
sense, but wow, what a journey just to get this far.
|