Higher Half Kernel: Difference between revisions
[unchecked revision] | [unchecked revision] |
mNo edit summary |
mNo edit summary |
||
(21 intermediate revisions by 14 users not shown) | |||
Line 1: | Line 1: | ||
{{Stub}} |
|||
{{Template:Kernel designs}} |
{{Template:Kernel designs}} |
||
It is traditional and generally good to have your kernel mapped in every user process. Linux |
It is traditional and generally good to have your kernel mapped in every user process. Linux and many other Unices, for instance, reside at virtual addresses ''0xC0000000 – 0xFFFFFFFF'' of every address space, leaving the range ''0x00000000 – 0xBFFFFFFF'' for user code, data, stacks, libraries, etc. Kernels that have such design are said to be "in the higher half" by opposition to kernels that use lowest virtual addresses for themselves, and leave higher addresses for the applications. |
||
Kernels that have such design are said to be "in the higher half" by opposition to kernels that use lowest virtual addresses for themselves, and leave higher addresses for the applications. |
|||
In addition, there is some non-x86 ISA (the [[MIPS Overview|MIPS]] [[User:Schol-r-lea/Understanding RISC vs CISC|RISC architecture]] and [[AArch64|ARM]]) which partly forces the issue. On MIPS and ARM systems, addresses using the high bit (either bit 31 or bit 63, depending on the system word width) are reserved for use in Supervisor mode, and are exception trapped when in User mode. |
|||
Advantages of a higher half kernel are: |
Advantages of a higher half kernel are: |
||
* |
* It's easier to set up [[Virtual 8086 Mode|VM86]] processes since the region below 1 MB is userspace. |
||
* More |
* More generally, user applications are not dependent on how much memory is kernel space (your application can be linked to 0x400000 regardless of whether kernel is at ''0xC0000000'', ''0x80000000'' or ''0xE0000000'' ...), which makes the ABI nicer. |
||
* If your OS |
* If your OS is 64-bit, then 32-bit applications will be able to use the full 32-bit address space. |
||
* ' |
* 'Mnemonic' invalid pointers such as ''0xCAFEBABE'', ''0xDEADBEEF'', ''0xDEADC0DE'', etc. can be used. |
||
== |
== Bootloader support == |
||
To make things easier, some bootloaders natively support higher half kernels, by directly loading and mapping a kernel to the higher half in virtual memory. |
|||
* [[BOOTBOOT]] only supports higher half kernels by design. It has example Hello World kernels written in [[C]], [[Pascal]], [[Rust]] and [[Go]] |
|||
⚫ | |||
* The [[Limine]] protocol also only support higher half kernels by design. See [[Limine Bare Bones]] for a tutorial on how to write a simple 64-bit higher half kernel using Limine. |
|||
== Initialization == |
|||
=== Custom Bootloader === |
|||
⚫ | |||
How to do this basically depends on '''when''' you'd like your kernel to believe it's in the higher end, and '''when''' you set up paging. Without a boot loader help, you'll need a small trampoline code which runs in lower half, sets up higher half paging and jumps. |
|||
The easiest way is to load your kernel to any physical location you wish (for instance in the lowest 1MB) and prepare page tables that will perform the appropriate translation. Let's say you loaded your kernel starting at ''0x00010000'' up to ''0x0009ffff'' and want it to appear at ''0xC0010000''. |
|||
⚫ | |||
*Pick 3 4096-aligned addresses where you'll put your page directory and system tables. Wipe them (with zeroes). |
|||
*Fill the lowest 256 entries of one table to set up [[Identity Paging]] for at least the BIOS and your bootloader (I'd even use 1:1 mapping for the whole lowest 1MB if I were you :)). |
|||
*In the other table, fill entry #0x10 with ''0x00010003'', entry #0x11 with ''0x00011003'' ... (For as much pages as your kernel has.). |
|||
*Fill the entry #0 of the directory with the address of the first table (and make it present). |
|||
*Fill the entry #768 of the directory with the address of the second table (and make it present). |
|||
=== Articles === |
|||
When switching to [[Protected Mode]], use this assembly example: |
|||
* [[Higher Half x86 Bare Bones]] |
|||
mov eax, physical_address_of_the_directory |
|||
mov cr3,eax |
|||
mov eax,cr0 |
|||
or eax,0x80000001 ; enables both pmode and paging |
|||
mov cr0,eax |
|||
=== The GDT Trick === |
|||
If we don't want to enable paging right from the start, it is still possible to have your kernel appearing in the higher half. [http://www.osdever.net/tutorials/pdf/memory1.pdf Tim Robinson's GDT Trick] works by using [[segmentation]] instead to select an appropriate base for the code and data segments. Say we have loaded the kernel at ''0x10000'' and we want it to appear at ''0xC0000000''. All we need is to find a base _X_ such as _X_+''0xC0000000'' = ''0x10000''. The bootloader will then initialize the [[GDT]] with cs.base = 0x40010000 = ds.base. This also means that special care must be taken for vram access, as ''(void*)0xb8000'' is now somewhere above 1GB. Either use a special 0-based additional data-segment or use |
|||
#define logical_to_physical(x) (((void*)x)+0x40010000) |
|||
#define physical_to_logical(x) (((void*)x)-0x40010000) |
|||
short* vram=physical_to_logical(0xb8000) |
|||
When you eventually enable paging, create the page tables as mentioned above (keeping in mind that you need to undo the address conversion again, e.g. |
|||
pgentry *pagedirectory=physical_to_logical(0x9D000); |
|||
pgentry *lowesttable=physical_to_logical(0x9C000); |
|||
pgentry *kerneltable=physical_to_logical(0x9B000); |
|||
/* prepare lowesttable and kerneltable */ |
|||
pagedirectory[0]=mkpgentry(0x9C000,PG_PRESENT|whatever); |
|||
pagedirectory[0xC0000000>>12]=mkpgentry(0x9B000,PG_PRESENT|PG_SYSTEM|whatever); |
|||
set_cr3_and_update_gdt(0x9D000); |
|||
Along with setting CR3's value, you'll have to clear the base of all the previous segments and reload segment registers so that, at the exit of ''set_cr3_and_update_gdt'', any memory references now use the 0-based code and data segment and that the page tables perform the translation required to still make ''0x10000'' appear at ''0xC0000000''. |
|||
=== With GRUB From the Start === |
|||
[[GRUB]] will load your kernel at the desired physical target address, but it will leave segments in the [[GDT]] with a 0 base and it will not enable paging. |
|||
=== With GRUB Later On === |
|||
Even with GRUB it's still possible to use TimRobinson's GDT trick: just set up a "fake" GDT then jump to your C code and enable [[paging]]. See [[Higher Half With GDT]] |
|||
⚫ | |||
=== Threads === |
=== Threads === |
||
*[ |
* [[Topic:11160|I wrote a simple HigherHalf kernel]] |
||
[[Category:Kernel]] |
[[Category:Kernel]] |
Latest revision as of 12:03, 8 September 2022
Kernel Designs |
---|
Models |
Other Concepts |
It is traditional and generally good to have your kernel mapped in every user process. Linux and many other Unices, for instance, reside at virtual addresses 0xC0000000 – 0xFFFFFFFF of every address space, leaving the range 0x00000000 – 0xBFFFFFFF for user code, data, stacks, libraries, etc. Kernels that have such design are said to be "in the higher half" by opposition to kernels that use lowest virtual addresses for themselves, and leave higher addresses for the applications.
In addition, there is some non-x86 ISA (the MIPS RISC architecture and ARM) which partly forces the issue. On MIPS and ARM systems, addresses using the high bit (either bit 31 or bit 63, depending on the system word width) are reserved for use in Supervisor mode, and are exception trapped when in User mode.
Advantages of a higher half kernel are:
- It's easier to set up VM86 processes since the region below 1 MB is userspace.
- More generally, user applications are not dependent on how much memory is kernel space (your application can be linked to 0x400000 regardless of whether kernel is at 0xC0000000, 0x80000000 or 0xE0000000 ...), which makes the ABI nicer.
- If your OS is 64-bit, then 32-bit applications will be able to use the full 32-bit address space.
- 'Mnemonic' invalid pointers such as 0xCAFEBABE, 0xDEADBEEF, 0xDEADC0DE, etc. can be used.
Bootloader support
To make things easier, some bootloaders natively support higher half kernels, by directly loading and mapping a kernel to the higher half in virtual memory.
- BOOTBOOT only supports higher half kernels by design. It has example Hello World kernels written in C, Pascal, Rust and Go
- The Limine protocol also only support higher half kernels by design. See Limine Bare Bones for a tutorial on how to write a simple 64-bit higher half kernel using Limine.
Initialization
To setup a higher half kernel, you have to map your kernel to the appropriate virtual address. When using a boot protocol which supports higher half kernels directly, such as BOOTBOOT, or Limine, your kernel will already be properly mapped.
How to do this basically depends on when you'd like your kernel to believe it's in the higher end, and when you set up paging. Without a boot loader help, you'll need a small trampoline code which runs in lower half, sets up higher half paging and jumps.