root / qemu-doc.texi @ 6c9bf893
History | View | Annotate | Download (33.5 kB)
1 |
\input texinfo @c -*- texinfo -*- |
---|---|
2 |
|
3 |
@iftex |
4 |
@settitle QEMU CPU Emulator Reference Documentation |
5 |
@titlepage |
6 |
@sp 7 |
7 |
@center @titlefont{QEMU CPU Emulator Reference Documentation} |
8 |
@sp 3 |
9 |
@end titlepage |
10 |
@end iftex |
11 |
|
12 |
@chapter Introduction |
13 |
|
14 |
@section Features |
15 |
|
16 |
QEMU is a FAST! processor emulator. By using dynamic translation it |
17 |
achieves a reasonnable speed while being easy to port on new host |
18 |
CPUs. |
19 |
|
20 |
QEMU has two operating modes: |
21 |
|
22 |
@itemize @minus |
23 |
|
24 |
@item |
25 |
User mode emulation. In this mode, QEMU can launch Linux processes |
26 |
compiled for one CPU on another CPU. Linux system calls are converted |
27 |
because of endianness and 32/64 bit mismatches. The Wine Windows API |
28 |
emulator (@url{http://www.winehq.org}) and the DOSEMU DOS emulator |
29 |
(@url{http://www.dosemu.org}) are the main targets for QEMU. |
30 |
|
31 |
@item |
32 |
Full system emulation. In this mode, QEMU emulates a full |
33 |
system, including a processor and various peripherials. Currently, it |
34 |
is only used to launch an x86 Linux kernel on an x86 Linux system. It |
35 |
enables easier testing and debugging of system code. It can also be |
36 |
used to provide virtual hosting of several virtual PCs on a single |
37 |
server. |
38 |
|
39 |
@end itemize |
40 |
|
41 |
As QEMU requires no host kernel patches to run, it is very safe and |
42 |
easy to use. |
43 |
|
44 |
QEMU generic features: |
45 |
|
46 |
@itemize |
47 |
|
48 |
@item User space only or full system emulation. |
49 |
|
50 |
@item Using dynamic translation to native code for reasonnable speed. |
51 |
|
52 |
@item Working on x86 and PowerPC hosts. Being tested on ARM, Sparc32, Alpha and S390. |
53 |
|
54 |
@item Self-modifying code support. |
55 |
|
56 |
@item Precise exceptions support. |
57 |
|
58 |
@item The virtual CPU is a library (@code{libqemu}) which can be used |
59 |
in other projects. |
60 |
|
61 |
@end itemize |
62 |
|
63 |
QEMU user mode emulation features: |
64 |
@itemize |
65 |
@item Generic Linux system call converter, including most ioctls. |
66 |
|
67 |
@item clone() emulation using native CPU clone() to use Linux scheduler for threads. |
68 |
|
69 |
@item Accurate signal handling by remapping host signals to target signals. |
70 |
@end itemize |
71 |
@end itemize |
72 |
|
73 |
QEMU full system emulation features: |
74 |
@itemize |
75 |
@item QEMU can either use a full software MMU for maximum portability or use the host system call mmap() to simulate the target MMU. |
76 |
@end itemize |
77 |
|
78 |
@section x86 emulation |
79 |
|
80 |
QEMU x86 target features: |
81 |
|
82 |
@itemize |
83 |
|
84 |
@item The virtual x86 CPU supports 16 bit and 32 bit addressing with segmentation. |
85 |
LDT/GDT and IDT are emulated. VM86 mode is also supported to run DOSEMU. |
86 |
|
87 |
@item Support of host page sizes bigger than 4KB in user mode emulation. |
88 |
|
89 |
@item QEMU can emulate itself on x86. |
90 |
|
91 |
@item An extensive Linux x86 CPU test program is included @file{tests/test-i386}. |
92 |
It can be used to test other x86 virtual CPUs. |
93 |
|
94 |
@end itemize |
95 |
|
96 |
Current QEMU limitations: |
97 |
|
98 |
@itemize |
99 |
|
100 |
@item No SSE/MMX support (yet). |
101 |
|
102 |
@item No x86-64 support. |
103 |
|
104 |
@item IPC syscalls are missing. |
105 |
|
106 |
@item The x86 segment limits and access rights are not tested at every |
107 |
memory access. |
108 |
|
109 |
@item On non x86 host CPUs, @code{double}s are used instead of the non standard |
110 |
10 byte @code{long double}s of x86 for floating point emulation to get |
111 |
maximum performances. |
112 |
|
113 |
@item Some priviledged instructions or behaviors are missing, especially for segment protection testing (yet). |
114 |
|
115 |
@end itemize |
116 |
|
117 |
@section ARM emulation |
118 |
|
119 |
@itemize |
120 |
|
121 |
@item ARM emulation can currently launch small programs while using the |
122 |
generic dynamic code generation architecture of QEMU. |
123 |
|
124 |
@item No FPU support (yet). |
125 |
|
126 |
@item No automatic regression testing (yet). |
127 |
|
128 |
@end itemize |
129 |
|
130 |
@section SPARC emulation |
131 |
|
132 |
The SPARC emulation is currently in development. |
133 |
|
134 |
@chapter Installation |
135 |
|
136 |
If you want to compile QEMU, please read the @file{README} which gives |
137 |
the related information. Otherwise just download the binary |
138 |
distribution (@file{qemu-XXX-i386.tar.gz}) and untar it as root in |
139 |
@file{/}: |
140 |
|
141 |
@example |
142 |
su |
143 |
cd / |
144 |
tar zxvf /tmp/qemu-XXX-i386.tar.gz |
145 |
@end example |
146 |
|
147 |
@chapter QEMU User space emulator invocation |
148 |
|
149 |
@section Quick Start |
150 |
|
151 |
In order to launch a Linux process, QEMU needs the process executable |
152 |
itself and all the target (x86) dynamic libraries used by it. |
153 |
|
154 |
@itemize |
155 |
|
156 |
@item On x86, you can just try to launch any process by using the native |
157 |
libraries: |
158 |
|
159 |
@example |
160 |
qemu-i386 -L / /bin/ls |
161 |
@end example |
162 |
|
163 |
@code{-L /} tells that the x86 dynamic linker must be searched with a |
164 |
@file{/} prefix. |
165 |
|
166 |
@item Since QEMU is also a linux process, you can launch qemu with qemu (NOTE: you can only do that if you compiled QEMU from the sources): |
167 |
|
168 |
@example |
169 |
qemu-i386 -L / qemu-i386 -L / /bin/ls |
170 |
@end example |
171 |
|
172 |
@item On non x86 CPUs, you need first to download at least an x86 glibc |
173 |
(@file{qemu-runtime-i386-XXX-.tar.gz} on the QEMU web page). Ensure that |
174 |
@code{LD_LIBRARY_PATH} is not set: |
175 |
|
176 |
@example |
177 |
unset LD_LIBRARY_PATH |
178 |
@end example |
179 |
|
180 |
Then you can launch the precompiled @file{ls} x86 executable: |
181 |
|
182 |
@example |
183 |
qemu-i386 tests/i386/ls |
184 |
@end example |
185 |
You can look at @file{qemu-binfmt-conf.sh} so that |
186 |
QEMU is automatically launched by the Linux kernel when you try to |
187 |
launch x86 executables. It requires the @code{binfmt_misc} module in the |
188 |
Linux kernel. |
189 |
|
190 |
@item The x86 version of QEMU is also included. You can try weird things such as: |
191 |
@example |
192 |
qemu-i386 /usr/local/qemu-i386/bin/qemu-i386 /usr/local/qemu-i386/bin/ls-i386 |
193 |
@end example |
194 |
|
195 |
@end itemize |
196 |
|
197 |
@section Wine launch |
198 |
|
199 |
@itemize |
200 |
|
201 |
@item Ensure that you have a working QEMU with the x86 glibc |
202 |
distribution (see previous section). In order to verify it, you must be |
203 |
able to do: |
204 |
|
205 |
@example |
206 |
qemu-i386 /usr/local/qemu-i386/bin/ls-i386 |
207 |
@end example |
208 |
|
209 |
@item Download the binary x86 Wine install |
210 |
(@file{qemu-XXX-i386-wine.tar.gz} on the QEMU web page). |
211 |
|
212 |
@item Configure Wine on your account. Look at the provided script |
213 |
@file{/usr/local/qemu-i386/bin/wine-conf.sh}. Your previous |
214 |
@code{$@{HOME@}/.wine} directory is saved to @code{$@{HOME@}/.wine.org}. |
215 |
|
216 |
@item Then you can try the example @file{putty.exe}: |
217 |
|
218 |
@example |
219 |
qemu-i386 /usr/local/qemu-i386/wine/bin/wine /usr/local/qemu-i386/wine/c/Program\ Files/putty.exe |
220 |
@end example |
221 |
|
222 |
@end itemize |
223 |
|
224 |
@section Command line options |
225 |
|
226 |
@example |
227 |
usage: qemu-i386 [-h] [-d] [-L path] [-s size] program [arguments...] |
228 |
@end example |
229 |
|
230 |
@table @option |
231 |
@item -h |
232 |
Print the help |
233 |
@item -L path |
234 |
Set the x86 elf interpreter prefix (default=/usr/local/qemu-i386) |
235 |
@item -s size |
236 |
Set the x86 stack size in bytes (default=524288) |
237 |
@end table |
238 |
|
239 |
Debug options: |
240 |
|
241 |
@table @option |
242 |
@item -d |
243 |
Activate log (logfile=/tmp/qemu.log) |
244 |
@item -p pagesize |
245 |
Act as if the host page size was 'pagesize' bytes |
246 |
@end table |
247 |
|
248 |
@chapter QEMU System emulator invocation |
249 |
|
250 |
@section Introduction |
251 |
|
252 |
@c man begin DESCRIPTION |
253 |
|
254 |
The QEMU System emulator simulates a complete PC. It can either boot |
255 |
directly a Linux kernel (without any BIOS or boot loader) or boot like a |
256 |
real PC with the included BIOS. |
257 |
|
258 |
In order to meet specific user needs, two versions of QEMU are |
259 |
available: |
260 |
|
261 |
@enumerate |
262 |
|
263 |
@item |
264 |
@code{qemu-fast} uses the host Memory Management Unit (MMU) to simulate |
265 |
the x86 MMU. It is @emph{fast} but has limitations because the whole 4 GB |
266 |
address space cannot be used and some memory mapped peripherials |
267 |
cannot be emulated accurately yet. Therefore, a specific Linux kernel |
268 |
must be used (@xref{linux_compile}). |
269 |
|
270 |
@item |
271 |
@code{qemu} uses a software MMU. It is about @emph{two times |
272 |
slower} but gives a more accurate emulation. |
273 |
|
274 |
@end enumerate |
275 |
|
276 |
QEMU emulates the following PC peripherials: |
277 |
|
278 |
@itemize @minus |
279 |
@item |
280 |
VGA (hardware level, including all non standard modes) |
281 |
@item |
282 |
PS/2 mouse and keyboard |
283 |
@item |
284 |
2 IDE interfaces with hard disk and CD-ROM support |
285 |
@item |
286 |
NE2000 network adapter (port=0x300, irq=9) |
287 |
@item |
288 |
Serial port |
289 |
@item |
290 |
Soundblaster 16 card |
291 |
@item |
292 |
PIC (interrupt controler) |
293 |
@item |
294 |
PIT (timers) |
295 |
@item |
296 |
CMOS memory |
297 |
@end itemize |
298 |
|
299 |
@c man end |
300 |
|
301 |
@section Quick Start |
302 |
|
303 |
Download and uncompress the linux image (@file{linux.img}) and type: |
304 |
|
305 |
@example |
306 |
qemu linux.img |
307 |
@end example |
308 |
|
309 |
Linux should boot and give you a prompt. |
310 |
|
311 |
@section Direct Linux Boot and Network emulation |
312 |
|
313 |
This section explains how to launch a Linux kernel inside QEMU without |
314 |
having to make a full bootable image. It is very useful for fast Linux |
315 |
kernel testing. The QEMU network configuration is also explained. |
316 |
|
317 |
@enumerate |
318 |
@item |
319 |
Download the archive @file{linux-test-xxx.tar.gz} containing a Linux |
320 |
kernel and a disk image. |
321 |
|
322 |
@item Optional: If you want network support (for example to launch X11 examples), you |
323 |
must copy the script @file{qemu-ifup} in @file{/etc} and configure |
324 |
properly @code{sudo} so that the command @code{ifconfig} contained in |
325 |
@file{qemu-ifup} can be executed as root. You must verify that your host |
326 |
kernel supports the TUN/TAP network interfaces: the device |
327 |
@file{/dev/net/tun} must be present. |
328 |
|
329 |
When network is enabled, there is a virtual network connection between |
330 |
the host kernel and the emulated kernel. The emulated kernel is seen |
331 |
from the host kernel at IP address 172.20.0.2 and the host kernel is |
332 |
seen from the emulated kernel at IP address 172.20.0.1. |
333 |
|
334 |
@item Launch @code{qemu.sh}. You should have the following output: |
335 |
|
336 |
@example |
337 |
> ./qemu.sh |
338 |
Connected to host network interface: tun0 |
339 |
Linux version 2.4.21 (bellard@voyager.localdomain) (gcc version 3.2.2 20030222 (Red Hat Linux 3.2.2-5)) #5 Tue Nov 11 18:18:53 CET 2003 |
340 |
BIOS-provided physical RAM map: |
341 |
BIOS-e801: 0000000000000000 - 000000000009f000 (usable) |
342 |
BIOS-e801: 0000000000100000 - 0000000002000000 (usable) |
343 |
32MB LOWMEM available. |
344 |
On node 0 totalpages: 8192 |
345 |
zone(0): 4096 pages. |
346 |
zone(1): 4096 pages. |
347 |
zone(2): 0 pages. |
348 |
Kernel command line: root=/dev/hda sb=0x220,5,1,5 ide2=noprobe ide3=noprobe ide4=noprobe ide5=noprobe console=ttyS0 |
349 |
ide_setup: ide2=noprobe |
350 |
ide_setup: ide3=noprobe |
351 |
ide_setup: ide4=noprobe |
352 |
ide_setup: ide5=noprobe |
353 |
Initializing CPU#0 |
354 |
Detected 2399.621 MHz processor. |
355 |
Console: colour EGA 80x25 |
356 |
Calibrating delay loop... 4744.80 BogoMIPS |
357 |
Memory: 28872k/32768k available (1210k kernel code, 3508k reserved, 266k data, 64k init, 0k highmem) |
358 |
Dentry cache hash table entries: 4096 (order: 3, 32768 bytes) |
359 |
Inode cache hash table entries: 2048 (order: 2, 16384 bytes) |
360 |
Mount cache hash table entries: 512 (order: 0, 4096 bytes) |
361 |
Buffer-cache hash table entries: 1024 (order: 0, 4096 bytes) |
362 |
Page-cache hash table entries: 8192 (order: 3, 32768 bytes) |
363 |
CPU: Intel Pentium Pro stepping 03 |
364 |
Checking 'hlt' instruction... OK. |
365 |
POSIX conformance testing by UNIFIX |
366 |
Linux NET4.0 for Linux 2.4 |
367 |
Based upon Swansea University Computer Society NET3.039 |
368 |
Initializing RT netlink socket |
369 |
apm: BIOS not found. |
370 |
Starting kswapd |
371 |
Journalled Block Device driver loaded |
372 |
Detected PS/2 Mouse Port. |
373 |
pty: 256 Unix98 ptys configured |
374 |
Serial driver version 5.05c (2001-07-08) with no serial options enabled |
375 |
ttyS00 at 0x03f8 (irq = 4) is a 16450 |
376 |
ne.c:v1.10 9/23/94 Donald Becker (becker@scyld.com) |
377 |
Last modified Nov 1, 2000 by Paul Gortmaker |
378 |
NE*000 ethercard probe at 0x300: 52 54 00 12 34 56 |
379 |
eth0: NE2000 found at 0x300, using IRQ 9. |
380 |
RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize |
381 |
Uniform Multi-Platform E-IDE driver Revision: 7.00beta4-2.4 |
382 |
ide: Assuming 50MHz system bus speed for PIO modes; override with idebus=xx |
383 |
hda: QEMU HARDDISK, ATA DISK drive |
384 |
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14 |
385 |
hda: attached ide-disk driver. |
386 |
hda: 20480 sectors (10 MB) w/256KiB Cache, CHS=20/16/63 |
387 |
Partition check: |
388 |
hda: |
389 |
Soundblaster audio driver Copyright (C) by Hannu Savolainen 1993-1996 |
390 |
NET4: Linux TCP/IP 1.0 for NET4.0 |
391 |
IP Protocols: ICMP, UDP, TCP, IGMP |
392 |
IP: routing cache hash table of 512 buckets, 4Kbytes |
393 |
TCP: Hash tables configured (established 2048 bind 4096) |
394 |
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0. |
395 |
EXT2-fs warning: mounting unchecked fs, running e2fsck is recommended |
396 |
VFS: Mounted root (ext2 filesystem). |
397 |
Freeing unused kernel memory: 64k freed |
398 |
|
399 |
Linux version 2.4.21 (bellard@voyager.localdomain) (gcc version 3.2.2 20030222 (Red Hat Linux 3.2.2-5)) #5 Tue Nov 11 18:18:53 CET 2003 |
400 |
|
401 |
QEMU Linux test distribution (based on Redhat 9) |
402 |
|
403 |
Type 'exit' to halt the system |
404 |
|
405 |
sh-2.05b# |
406 |
@end example |
407 |
|
408 |
@item |
409 |
Then you can play with the kernel inside the virtual serial console. You |
410 |
can launch @code{ls} for example. Type @key{Ctrl-a h} to have an help |
411 |
about the keys you can type inside the virtual serial console. In |
412 |
particular, use @key{Ctrl-a x} to exit QEMU and use @key{Ctrl-a b} as |
413 |
the Magic SysRq key. |
414 |
|
415 |
@item |
416 |
If the network is enabled, launch the script @file{/etc/linuxrc} in the |
417 |
emulator (don't forget the leading dot): |
418 |
@example |
419 |
. /etc/linuxrc |
420 |
@end example |
421 |
|
422 |
Then enable X11 connections on your PC from the emulated Linux: |
423 |
@example |
424 |
xhost +172.20.0.2 |
425 |
@end example |
426 |
|
427 |
You can now launch @file{xterm} or @file{xlogo} and verify that you have |
428 |
a real Virtual Linux system ! |
429 |
|
430 |
@end enumerate |
431 |
|
432 |
NOTES: |
433 |
@enumerate |
434 |
@item |
435 |
A 2.5.74 kernel is also included in the archive. Just |
436 |
replace the bzImage in qemu.sh to try it. |
437 |
|
438 |
@item |
439 |
qemu creates a temporary file in @var{$QEMU_TMPDIR} (@file{/tmp} is the |
440 |
default) containing all the simulated PC memory. If possible, try to use |
441 |
a temporary directory using the tmpfs filesystem to avoid too many |
442 |
unnecessary disk accesses. |
443 |
|
444 |
@item |
445 |
In order to exit cleanly from qemu, you can do a @emph{shutdown} inside |
446 |
qemu. qemu will automatically exit when the Linux shutdown is done. |
447 |
|
448 |
@item |
449 |
You can boot slightly faster by disabling the probe of non present IDE |
450 |
interfaces. To do so, add the following options on the kernel command |
451 |
line: |
452 |
@example |
453 |
ide1=noprobe ide2=noprobe ide3=noprobe ide4=noprobe ide5=noprobe |
454 |
@end example |
455 |
|
456 |
@item |
457 |
The example disk image is a modified version of the one made by Kevin |
458 |
Lawton for the plex86 Project (@url{www.plex86.org}). |
459 |
|
460 |
@end enumerate |
461 |
|
462 |
@section Invocation |
463 |
|
464 |
@example |
465 |
@c man begin SYNOPSIS |
466 |
usage: qemu [options] [disk_image] |
467 |
@c man end |
468 |
@end example |
469 |
|
470 |
@c man begin OPTIONS |
471 |
@var{disk_image} is a raw hard disk image for IDE hard disk 0. |
472 |
|
473 |
General options: |
474 |
@table @option |
475 |
@item -fda file |
476 |
@item -fdb file |
477 |
Use @var{file} as floppy disk 0/1 image (@xref{disk_images}). |
478 |
|
479 |
@item -hda file |
480 |
@item -hdb file |
481 |
@item -hdc file |
482 |
@item -hdd file |
483 |
Use @var{file} as hard disk 0, 1, 2 or 3 image (@xref{disk_images}). |
484 |
|
485 |
@item -cdrom file |
486 |
Use @var{file} as CD-ROM image (you cannot use @option{-hdc} and and |
487 |
@option{-cdrom} at the same time). |
488 |
|
489 |
@item -boot [a|b|c|d] |
490 |
Boot on floppy (a, b), hard disk (c) or CD-ROM (d). Hard disk boot is |
491 |
the default. |
492 |
|
493 |
@item -snapshot |
494 |
Write to temporary files instead of disk image files. In this case, |
495 |
the raw disk image you use is not written back. You can however force |
496 |
the write back by pressing @key{C-a s} (@xref{disk_images}). |
497 |
|
498 |
@item -m megs |
499 |
Set virtual RAM size to @var{megs} megabytes. |
500 |
|
501 |
@item -n script |
502 |
Set network init script [default=/etc/qemu-ifup]. This script is |
503 |
launched to configure the host network interface (usually tun0) |
504 |
corresponding to the virtual NE2000 card. |
505 |
|
506 |
@item -initrd file |
507 |
Use @var{file} as initial ram disk. |
508 |
|
509 |
@item -tun-fd fd |
510 |
Assumes @var{fd} talks to tap/tun and use it. Read |
511 |
@url{http://bellard.org/qemu/tetrinet.html} to have an example of its |
512 |
use. |
513 |
|
514 |
@item -nographic |
515 |
|
516 |
Normally, QEMU uses SDL to display the VGA output. With this option, |
517 |
you can totally disable graphical output so that QEMU is a simple |
518 |
command line application. The emulated serial port is redirected on |
519 |
the console. Therefore, you can still use QEMU to debug a Linux kernel |
520 |
with a serial console. |
521 |
|
522 |
@end table |
523 |
|
524 |
Linux boot specific (does not require a full PC boot with a BIOS): |
525 |
@table @option |
526 |
|
527 |
@item -kernel bzImage |
528 |
Use @var{bzImage} as kernel image. |
529 |
|
530 |
@item -append cmdline |
531 |
Use @var{cmdline} as kernel command line |
532 |
|
533 |
@item -initrd file |
534 |
Use @var{file} as initial ram disk. |
535 |
|
536 |
@end table |
537 |
|
538 |
Debug options: |
539 |
@table @option |
540 |
@item -s |
541 |
Wait gdb connection to port 1234 (@xref{gdb_usage}). |
542 |
@item -p port |
543 |
Change gdb connection port. |
544 |
@item -d |
545 |
Output log in /tmp/qemu.log |
546 |
@end table |
547 |
|
548 |
During emulation, use @key{C-a h} to get terminal commands: |
549 |
|
550 |
@table @key |
551 |
@item C-a h |
552 |
Print this help |
553 |
@item C-a x |
554 |
Exit emulatior |
555 |
@item C-a s |
556 |
Save disk data back to file (if -snapshot) |
557 |
@item C-a b |
558 |
Send break (magic sysrq) |
559 |
@item C-a C-a |
560 |
Send C-a |
561 |
@end table |
562 |
@c man end |
563 |
|
564 |
@ignore |
565 |
|
566 |
@setfilename qemu |
567 |
@settitle QEMU System Emulator |
568 |
|
569 |
@c man begin SEEALSO |
570 |
The HTML documentation of QEMU for more precise information and Linux |
571 |
user mode emulator invocation. |
572 |
@c man end |
573 |
|
574 |
@c man begin AUTHOR |
575 |
Fabrice Bellard |
576 |
@c man end |
577 |
|
578 |
@end ignore |
579 |
|
580 |
@end ignore |
581 |
@node disk_images |
582 |
@section Disk Images |
583 |
|
584 |
@subsection Raw disk images |
585 |
|
586 |
The disk images can simply be raw images of the hard disk. You can |
587 |
create them with the command: |
588 |
@example |
589 |
dd if=/dev/zero of=myimage bs=1024 count=mysize |
590 |
@end example |
591 |
where @var{myimage} is the image filename and @var{mysize} is its size |
592 |
in kilobytes. |
593 |
|
594 |
@subsection Snapshot mode |
595 |
|
596 |
If you use the option @option{-snapshot}, all disk images are |
597 |
considered as read only. When sectors in written, they are written in |
598 |
a temporary file created in @file{/tmp}. You can however force the |
599 |
write back to the raw disk images by pressing @key{C-a s}. |
600 |
|
601 |
NOTE: The snapshot mode only works with raw disk images. |
602 |
|
603 |
@subsection Copy On Write disk images |
604 |
|
605 |
QEMU also supports user mode Linux |
606 |
(@url{http://user-mode-linux.sourceforge.net/}) Copy On Write (COW) |
607 |
disk images. The COW disk images are much smaller than normal images |
608 |
as they store only modified sectors. They also permit the use of the |
609 |
same disk image template for many users. |
610 |
|
611 |
To create a COW disk images, use the command: |
612 |
|
613 |
@example |
614 |
qemu-mkcow -f myrawimage.bin mycowimage.cow |
615 |
@end example |
616 |
|
617 |
@file{myrawimage.bin} is a raw image you want to use as original disk |
618 |
image. It will never be written to. |
619 |
|
620 |
@file{mycowimage.cow} is the COW disk image which is created by |
621 |
@code{qemu-mkcow}. You can use it directly with the @option{-hdx} |
622 |
options. You must not modify the original raw disk image if you use |
623 |
COW images, as COW images only store the modified sectors from the raw |
624 |
disk image. QEMU stores the original raw disk image name and its |
625 |
modified time in the COW disk image so that chances of mistakes are |
626 |
reduced. |
627 |
|
628 |
If the raw disk image is not read-only, by pressing @key{C-a s} you |
629 |
can flush the COW disk image back into the raw disk image, as in |
630 |
snapshot mode. |
631 |
|
632 |
COW disk images can also be created without a corresponding raw disk |
633 |
image. It is useful to have a big initial virtual disk image without |
634 |
using much disk space. Use: |
635 |
|
636 |
@example |
637 |
qemu-mkcow mycowimage.cow 1024 |
638 |
@end example |
639 |
|
640 |
to create a 1 gigabyte empty COW disk image. |
641 |
|
642 |
NOTES: |
643 |
@enumerate |
644 |
@item |
645 |
COW disk images must be created on file systems supporting |
646 |
@emph{holes} such as ext2 or ext3. |
647 |
@item |
648 |
Since holes are used, the displayed size of the COW disk image is not |
649 |
the real one. To know it, use the @code{ls -ls} command. |
650 |
@end enumerate |
651 |
|
652 |
@node linux_compile |
653 |
@section Linux Kernel Compilation |
654 |
|
655 |
You can use any linux kernel with QEMU. However, if you want to use |
656 |
@code{qemu-fast} to get maximum performances, you should make the |
657 |
following changes to the Linux kernel (only 2.4.x and 2.5.x were |
658 |
tested): |
659 |
|
660 |
@enumerate |
661 |
@item |
662 |
The kernel must be mapped at 0x90000000 (the default is |
663 |
0xc0000000). You must modify only two lines in the kernel source: |
664 |
|
665 |
In @file{include/asm/page.h}, replace |
666 |
@example |
667 |
#define __PAGE_OFFSET (0xc0000000) |
668 |
@end example |
669 |
by |
670 |
@example |
671 |
#define __PAGE_OFFSET (0x90000000) |
672 |
@end example |
673 |
|
674 |
And in @file{arch/i386/vmlinux.lds}, replace |
675 |
@example |
676 |
. = 0xc0000000 + 0x100000; |
677 |
@end example |
678 |
by |
679 |
@example |
680 |
. = 0x90000000 + 0x100000; |
681 |
@end example |
682 |
|
683 |
@item |
684 |
If you want to enable SMP (Symmetric Multi-Processing) support, you |
685 |
must make the following change in @file{include/asm/fixmap.h}. Replace |
686 |
@example |
687 |
#define FIXADDR_TOP (0xffffX000UL) |
688 |
@end example |
689 |
by |
690 |
@example |
691 |
#define FIXADDR_TOP (0xa7ffX000UL) |
692 |
@end example |
693 |
(X is 'e' or 'f' depending on the kernel version). Although you can |
694 |
use an SMP kernel with QEMU, it only supports one CPU. |
695 |
|
696 |
@item |
697 |
If you are not using a 2.5 kernel as host kernel but if you use a target |
698 |
2.5 kernel, you must also ensure that the 'HZ' define is set to 100 |
699 |
(1000 is the default) as QEMU cannot currently emulate timers at |
700 |
frequencies greater than 100 Hz on host Linux systems < 2.5. In |
701 |
@file{include/asm/param.h}, replace: |
702 |
|
703 |
@example |
704 |
# define HZ 1000 /* Internal kernel timer frequency */ |
705 |
@end example |
706 |
by |
707 |
@example |
708 |
# define HZ 100 /* Internal kernel timer frequency */ |
709 |
@end example |
710 |
|
711 |
@end enumerate |
712 |
|
713 |
The file config-2.x.x gives the configuration of the example kernels. |
714 |
|
715 |
Just type |
716 |
@example |
717 |
make bzImage |
718 |
@end example |
719 |
|
720 |
As you would do to make a real kernel. Then you can use with QEMU |
721 |
exactly the same kernel as you would boot on your PC (in |
722 |
@file{arch/i386/boot/bzImage}). |
723 |
|
724 |
@node gdb_usage |
725 |
@section GDB usage |
726 |
|
727 |
QEMU has a primitive support to work with gdb, so that you can do |
728 |
'Ctrl-C' while the virtual machine is running and inspect its state. |
729 |
|
730 |
In order to use gdb, launch qemu with the '-s' option. It will wait for a |
731 |
gdb connection: |
732 |
@example |
733 |
> qemu -s -kernel arch/i386/boot/bzImage -hda root-2.4.20.img -append "root=/dev/hda" |
734 |
Connected to host network interface: tun0 |
735 |
Waiting gdb connection on port 1234 |
736 |
@end example |
737 |
|
738 |
Then launch gdb on the 'vmlinux' executable: |
739 |
@example |
740 |
> gdb vmlinux |
741 |
@end example |
742 |
|
743 |
In gdb, connect to QEMU: |
744 |
@example |
745 |
(gdb) target remote localhost:1234 |
746 |
@end example |
747 |
|
748 |
Then you can use gdb normally. For example, type 'c' to launch the kernel: |
749 |
@example |
750 |
(gdb) c |
751 |
@end example |
752 |
|
753 |
Here are some useful tips in order to use gdb on system code: |
754 |
|
755 |
@enumerate |
756 |
@item |
757 |
Use @code{info reg} to display all the CPU registers. |
758 |
@item |
759 |
Use @code{x/10i $eip} to display the code at the PC position. |
760 |
@item |
761 |
Use @code{set architecture i8086} to dump 16 bit code. Then use |
762 |
@code{x/10i $cs*16+*eip} to dump the code at the PC position. |
763 |
@end enumerate |
764 |
|
765 |
@chapter QEMU Internals |
766 |
|
767 |
@section QEMU compared to other emulators |
768 |
|
769 |
Like bochs [3], QEMU emulates an x86 CPU. But QEMU is much faster than |
770 |
bochs as it uses dynamic compilation and because it uses the host MMU to |
771 |
simulate the x86 MMU. The downside is that currently the emulation is |
772 |
not as accurate as bochs (for example, you cannot currently run Windows |
773 |
inside QEMU). |
774 |
|
775 |
Like Valgrind [2], QEMU does user space emulation and dynamic |
776 |
translation. Valgrind is mainly a memory debugger while QEMU has no |
777 |
support for it (QEMU could be used to detect out of bound memory |
778 |
accesses as Valgrind, but it has no support to track uninitialised data |
779 |
as Valgrind does). The Valgrind dynamic translator generates better code |
780 |
than QEMU (in particular it does register allocation) but it is closely |
781 |
tied to an x86 host and target and has no support for precise exceptions |
782 |
and system emulation. |
783 |
|
784 |
EM86 [4] is the closest project to user space QEMU (and QEMU still uses |
785 |
some of its code, in particular the ELF file loader). EM86 was limited |
786 |
to an alpha host and used a proprietary and slow interpreter (the |
787 |
interpreter part of the FX!32 Digital Win32 code translator [5]). |
788 |
|
789 |
TWIN [6] is a Windows API emulator like Wine. It is less accurate than |
790 |
Wine but includes a protected mode x86 interpreter to launch x86 Windows |
791 |
executables. Such an approach as greater potential because most of the |
792 |
Windows API is executed natively but it is far more difficult to develop |
793 |
because all the data structures and function parameters exchanged |
794 |
between the API and the x86 code must be converted. |
795 |
|
796 |
User mode Linux [7] was the only solution before QEMU to launch a Linux |
797 |
kernel as a process while not needing any host kernel patches. However, |
798 |
user mode Linux requires heavy kernel patches while QEMU accepts |
799 |
unpatched Linux kernels. It would be interesting to compare the |
800 |
performance of the two approaches. |
801 |
|
802 |
The new Plex86 [8] PC virtualizer is done in the same spirit as the QEMU |
803 |
system emulator. It requires a patched Linux kernel to work (you cannot |
804 |
launch the same kernel on your PC), but the patches are really small. As |
805 |
it is a PC virtualizer (no emulation is done except for some priveledged |
806 |
instructions), it has the potential of being faster than QEMU. The |
807 |
downside is that a complicated (and potentially unsafe) host kernel |
808 |
patch is needed. |
809 |
|
810 |
@section Portable dynamic translation |
811 |
|
812 |
QEMU is a dynamic translator. When it first encounters a piece of code, |
813 |
it converts it to the host instruction set. Usually dynamic translators |
814 |
are very complicated and highly CPU dependent. QEMU uses some tricks |
815 |
which make it relatively easily portable and simple while achieving good |
816 |
performances. |
817 |
|
818 |
The basic idea is to split every x86 instruction into fewer simpler |
819 |
instructions. Each simple instruction is implemented by a piece of C |
820 |
code (see @file{op-i386.c}). Then a compile time tool (@file{dyngen}) |
821 |
takes the corresponding object file (@file{op-i386.o}) to generate a |
822 |
dynamic code generator which concatenates the simple instructions to |
823 |
build a function (see @file{op-i386.h:dyngen_code()}). |
824 |
|
825 |
In essence, the process is similar to [1], but more work is done at |
826 |
compile time. |
827 |
|
828 |
A key idea to get optimal performances is that constant parameters can |
829 |
be passed to the simple operations. For that purpose, dummy ELF |
830 |
relocations are generated with gcc for each constant parameter. Then, |
831 |
the tool (@file{dyngen}) can locate the relocations and generate the |
832 |
appriopriate C code to resolve them when building the dynamic code. |
833 |
|
834 |
That way, QEMU is no more difficult to port than a dynamic linker. |
835 |
|
836 |
To go even faster, GCC static register variables are used to keep the |
837 |
state of the virtual CPU. |
838 |
|
839 |
@section Register allocation |
840 |
|
841 |
Since QEMU uses fixed simple instructions, no efficient register |
842 |
allocation can be done. However, because RISC CPUs have a lot of |
843 |
register, most of the virtual CPU state can be put in registers without |
844 |
doing complicated register allocation. |
845 |
|
846 |
@section Condition code optimisations |
847 |
|
848 |
Good CPU condition codes emulation (@code{EFLAGS} register on x86) is a |
849 |
critical point to get good performances. QEMU uses lazy condition code |
850 |
evaluation: instead of computing the condition codes after each x86 |
851 |
instruction, it just stores one operand (called @code{CC_SRC}), the |
852 |
result (called @code{CC_DST}) and the type of operation (called |
853 |
@code{CC_OP}). |
854 |
|
855 |
@code{CC_OP} is almost never explicitely set in the generated code |
856 |
because it is known at translation time. |
857 |
|
858 |
In order to increase performances, a backward pass is performed on the |
859 |
generated simple instructions (see |
860 |
@code{translate-i386.c:optimize_flags()}). When it can be proved that |
861 |
the condition codes are not needed by the next instructions, no |
862 |
condition codes are computed at all. |
863 |
|
864 |
@section CPU state optimisations |
865 |
|
866 |
The x86 CPU has many internal states which change the way it evaluates |
867 |
instructions. In order to achieve a good speed, the translation phase |
868 |
considers that some state information of the virtual x86 CPU cannot |
869 |
change in it. For example, if the SS, DS and ES segments have a zero |
870 |
base, then the translator does not even generate an addition for the |
871 |
segment base. |
872 |
|
873 |
[The FPU stack pointer register is not handled that way yet]. |
874 |
|
875 |
@section Translation cache |
876 |
|
877 |
A 2MByte cache holds the most recently used translations. For |
878 |
simplicity, it is completely flushed when it is full. A translation unit |
879 |
contains just a single basic block (a block of x86 instructions |
880 |
terminated by a jump or by a virtual CPU state change which the |
881 |
translator cannot deduce statically). |
882 |
|
883 |
@section Direct block chaining |
884 |
|
885 |
After each translated basic block is executed, QEMU uses the simulated |
886 |
Program Counter (PC) and other cpu state informations (such as the CS |
887 |
segment base value) to find the next basic block. |
888 |
|
889 |
In order to accelerate the most common cases where the new simulated PC |
890 |
is known, QEMU can patch a basic block so that it jumps directly to the |
891 |
next one. |
892 |
|
893 |
The most portable code uses an indirect jump. An indirect jump makes it |
894 |
easier to make the jump target modification atomic. On some |
895 |
architectures (such as PowerPC), the @code{JUMP} opcode is directly |
896 |
patched so that the block chaining has no overhead. |
897 |
|
898 |
@section Self-modifying code and translated code invalidation |
899 |
|
900 |
Self-modifying code is a special challenge in x86 emulation because no |
901 |
instruction cache invalidation is signaled by the application when code |
902 |
is modified. |
903 |
|
904 |
When translated code is generated for a basic block, the corresponding |
905 |
host page is write protected if it is not already read-only (with the |
906 |
system call @code{mprotect()}). Then, if a write access is done to the |
907 |
page, Linux raises a SEGV signal. QEMU then invalidates all the |
908 |
translated code in the page and enables write accesses to the page. |
909 |
|
910 |
Correct translated code invalidation is done efficiently by maintaining |
911 |
a linked list of every translated block contained in a given page. Other |
912 |
linked lists are also maintained to undo direct block chaining. |
913 |
|
914 |
Although the overhead of doing @code{mprotect()} calls is important, |
915 |
most MSDOS programs can be emulated at reasonnable speed with QEMU and |
916 |
DOSEMU. |
917 |
|
918 |
Note that QEMU also invalidates pages of translated code when it detects |
919 |
that memory mappings are modified with @code{mmap()} or @code{munmap()}. |
920 |
|
921 |
@section Exception support |
922 |
|
923 |
longjmp() is used when an exception such as division by zero is |
924 |
encountered. |
925 |
|
926 |
The host SIGSEGV and SIGBUS signal handlers are used to get invalid |
927 |
memory accesses. The exact CPU state can be retrieved because all the |
928 |
x86 registers are stored in fixed host registers. The simulated program |
929 |
counter is found by retranslating the corresponding basic block and by |
930 |
looking where the host program counter was at the exception point. |
931 |
|
932 |
The virtual CPU cannot retrieve the exact @code{EFLAGS} register because |
933 |
in some cases it is not computed because of condition code |
934 |
optimisations. It is not a big concern because the emulated code can |
935 |
still be restarted in any cases. |
936 |
|
937 |
@section Linux system call translation |
938 |
|
939 |
QEMU includes a generic system call translator for Linux. It means that |
940 |
the parameters of the system calls can be converted to fix the |
941 |
endianness and 32/64 bit issues. The IOCTLs are converted with a generic |
942 |
type description system (see @file{ioctls.h} and @file{thunk.c}). |
943 |
|
944 |
QEMU supports host CPUs which have pages bigger than 4KB. It records all |
945 |
the mappings the process does and try to emulated the @code{mmap()} |
946 |
system calls in cases where the host @code{mmap()} call would fail |
947 |
because of bad page alignment. |
948 |
|
949 |
@section Linux signals |
950 |
|
951 |
Normal and real-time signals are queued along with their information |
952 |
(@code{siginfo_t}) as it is done in the Linux kernel. Then an interrupt |
953 |
request is done to the virtual CPU. When it is interrupted, one queued |
954 |
signal is handled by generating a stack frame in the virtual CPU as the |
955 |
Linux kernel does. The @code{sigreturn()} system call is emulated to return |
956 |
from the virtual signal handler. |
957 |
|
958 |
Some signals (such as SIGALRM) directly come from the host. Other |
959 |
signals are synthetized from the virtual CPU exceptions such as SIGFPE |
960 |
when a division by zero is done (see @code{main.c:cpu_loop()}). |
961 |
|
962 |
The blocked signal mask is still handled by the host Linux kernel so |
963 |
that most signal system calls can be redirected directly to the host |
964 |
Linux kernel. Only the @code{sigaction()} and @code{sigreturn()} system |
965 |
calls need to be fully emulated (see @file{signal.c}). |
966 |
|
967 |
@section clone() system call and threads |
968 |
|
969 |
The Linux clone() system call is usually used to create a thread. QEMU |
970 |
uses the host clone() system call so that real host threads are created |
971 |
for each emulated thread. One virtual CPU instance is created for each |
972 |
thread. |
973 |
|
974 |
The virtual x86 CPU atomic operations are emulated with a global lock so |
975 |
that their semantic is preserved. |
976 |
|
977 |
Note that currently there are still some locking issues in QEMU. In |
978 |
particular, the translated cache flush is not protected yet against |
979 |
reentrancy. |
980 |
|
981 |
@section Self-virtualization |
982 |
|
983 |
QEMU was conceived so that ultimately it can emulate itself. Although |
984 |
it is not very useful, it is an important test to show the power of the |
985 |
emulator. |
986 |
|
987 |
Achieving self-virtualization is not easy because there may be address |
988 |
space conflicts. QEMU solves this problem by being an executable ELF |
989 |
shared object as the ld-linux.so ELF interpreter. That way, it can be |
990 |
relocated at load time. |
991 |
|
992 |
@section MMU emulation |
993 |
|
994 |
For system emulation, QEMU uses the mmap() system call to emulate the |
995 |
target CPU MMU. It works as long the emulated OS does not use an area |
996 |
reserved by the host OS (such as the area above 0xc0000000 on x86 |
997 |
Linux). |
998 |
|
999 |
It is planned to add a slower but more precise MMU emulation |
1000 |
with a software MMU. |
1001 |
|
1002 |
@section Bibliography |
1003 |
|
1004 |
@table @asis |
1005 |
|
1006 |
@item [1] |
1007 |
@url{http://citeseer.nj.nec.com/piumarta98optimizing.html}, Optimizing |
1008 |
direct threaded code by selective inlining (1998) by Ian Piumarta, Fabio |
1009 |
Riccardi. |
1010 |
|
1011 |
@item [2] |
1012 |
@url{http://developer.kde.org/~sewardj/}, Valgrind, an open-source |
1013 |
memory debugger for x86-GNU/Linux, by Julian Seward. |
1014 |
|
1015 |
@item [3] |
1016 |
@url{http://bochs.sourceforge.net/}, the Bochs IA-32 Emulator Project, |
1017 |
by Kevin Lawton et al. |
1018 |
|
1019 |
@item [4] |
1020 |
@url{http://www.cs.rose-hulman.edu/~donaldlf/em86/index.html}, the EM86 |
1021 |
x86 emulator on Alpha-Linux. |
1022 |
|
1023 |
@item [5] |
1024 |
@url{http://www.usenix.org/publications/library/proceedings/usenix-nt97/full_papers/chernoff/chernoff.pdf}, |
1025 |
DIGITAL FX!32: Running 32-Bit x86 Applications on Alpha NT, by Anton |
1026 |
Chernoff and Ray Hookway. |
1027 |
|
1028 |
@item [6] |
1029 |
@url{http://www.willows.com/}, Windows API library emulation from |
1030 |
Willows Software. |
1031 |
|
1032 |
@item [7] |
1033 |
@url{http://user-mode-linux.sourceforge.net/}, |
1034 |
The User-mode Linux Kernel. |
1035 |
|
1036 |
@item [8] |
1037 |
@url{http://www.plex86.org/}, |
1038 |
The new Plex86 project. |
1039 |
|
1040 |
@end table |
1041 |
|
1042 |
@chapter Regression Tests |
1043 |
|
1044 |
In the directory @file{tests/}, various interesting testing programs |
1045 |
are available. There are used for regression testing. |
1046 |
|
1047 |
@section @file{test-i386} |
1048 |
|
1049 |
This program executes most of the 16 bit and 32 bit x86 instructions and |
1050 |
generates a text output. It can be compared with the output obtained with |
1051 |
a real CPU or another emulator. The target @code{make test} runs this |
1052 |
program and a @code{diff} on the generated output. |
1053 |
|
1054 |
The Linux system call @code{modify_ldt()} is used to create x86 selectors |
1055 |
to test some 16 bit addressing and 32 bit with segmentation cases. |
1056 |
|
1057 |
The Linux system call @code{vm86()} is used to test vm86 emulation. |
1058 |
|
1059 |
Various exceptions are raised to test most of the x86 user space |
1060 |
exception reporting. |
1061 |
|
1062 |
@section @file{linux-test} |
1063 |
|
1064 |
This program tests various Linux system calls. It is used to verify |
1065 |
that the system call parameters are correctly converted between target |
1066 |
and host CPUs. |
1067 |
|
1068 |
@section @file{hello-i386} |
1069 |
|
1070 |
Very simple statically linked x86 program, just to test QEMU during a |
1071 |
port to a new host CPU. |
1072 |
|
1073 |
@section @file{hello-arm} |
1074 |
|
1075 |
Very simple statically linked ARM program, just to test QEMU during a |
1076 |
port to a new host CPU. |
1077 |
|
1078 |
@section @file{sha1} |
1079 |
|
1080 |
It is a simple benchmark. Care must be taken to interpret the results |
1081 |
because it mostly tests the ability of the virtual CPU to optimize the |
1082 |
@code{rol} x86 instruction and the condition code computations. |
1083 |
|