pagetable.com

How the Final Cartridge III Freezer works

Michael Steil — Sat, 14 Jun 2025 16:21:34 +0000

by Daniël Mantione

Daniël contributed the commented disassembly of the FC3 freezer functionality to the reverse engineering effort at github.com/mist64/final_cartridge. Thanks to Eric Schlaepfer for his input on 6502 timing.

Freezer cartridge theory

One key reason why the Commodore 64 was so successful in the 80s was that it was able to do things it wasn’t designed for. Freezer cartridges, which allowed stopping any running program or game, applying cheat codes and resuming, or saving the complete computer’s state to disk so it could be continued from later, were one of those clever innovations: They were possible on the Commodore 64, but not on many other computers. A Commodore 64 with a good cartridge was a significantly more capable computer than a Commodore 64 without, this did contribute to the longetivity of the computer and is one reason why the Commodore 64 could remain in production for more than a decade.

However, because the Commodore 64 wasn’t designed at all to support freezing, a cartridge has to exploit quirks in the hardware in order to achieve its functionality. The C64 feature that is essential to freezer cartridges is the so-called Ultimax mode.

The Commodore Ultimax or Max machine was a games console that uses the same custom chips as the Commodore 64: It has a VIC-II and SID. The Ultimax had 2 KB of RAM and no ROM. Instead, the ROM on the cartridge is mapped into memory. The Ultimax was a commercial flop: Commodore only sold it in Japan, and it wasn’t a success there either. However, it somehow happened that the Commodore 64 was designed to be compatible with game cartridges developed for the Ultimax.

If an Ultimax cartridge is inserted, the Commodore 64 disables most of its RAM. 4 KB of RAM remains active and the C64 maps the cartridge ROM into memory, just like the Ultimax. This ROM is mapped from $E000..$FFFF, the place in memory where you will normally find the KERNAL.

Ultimax mode is activated when the cartridge pulls the GAME pin on the cartridge port low. It mode can be activated and deactivated in real-time. This means that if a cartridge activates Ultimax mode, it forces cartridge ROM memory into the C64’s memory map. Software on the C64 cannot prevent this from happening.

The ROM is mapped into the KERNAL memory region, and this means that the 6502 interrupt vectors at the end of memory will be read from cartridge ROM. A freezer cartridge will use an NMI interrupt to take control of the C64: Software cannot prevent an NMI interrupt from happening. So by combining NMI and Ultimax, a freezer cartridge can make the C64 execute code from cartridge ROM when the freeze button is pressed.

Doing the freeze correctly

In order to successfully freeze the Commodore 64, there is still a challenge that needs to be solved: Once the 6510 CPU receives an NMI interrupt signal, it will complete the current instruction before starting handling the interrupt. Because the C64 deactivates most of its RAM memory in Ultimax mode, if a cartridge immediately activates Ultimax mode as soon as the freeze button is pressed, the instruction may read or write from/to deactivated memory. This could prevent successful unfreezing.

On the Final Cartridge III, the freeze button controls both the NMI and GAME lines. If you press the freeze button, NMI is pulled down immediately, however. For GAME, there is a counter on the cartridge. This counter waits 7 cycles before GAME is pulled down. Both lines are released as soon as you release the button.

This is what the 6510 processor does when an NMI happens:

interrupt propagation cycle
interrupt propagation cycle
complete the current instruction (0-6 cycles)
internal cycle
internal cycle
store PC(hi) to the stack (1 cycle)
store PC(lo) to the stack (1 cycle)
store P to the stack (1 cycle)
fetch PC(lo) from $FFFA (NMI vector lo)
fetch PC(hi) from $FFFB (NMI vector hi)

Because the stack at $0100 remains visible in Ultimax mode, there some flexibility in which clock cycle Ultimax mode is activated. This flexibility period is 5 cycles, after which the NMI vector is read.

This means that the freeze process of the Final Cartridge III is not fully reliable: 6502 instructions can take up to 7 cycles. If a 7 cycle instruction is started during the propagation cycles, Ultimax mode is activated before the current instruction has finished. Because 7 cycles instructions relatively rare, the freezer function appears to be reliable, but if you have bad luck it fails.

This is shown in the following photo. The C64 was connected to a HP 1650A logic analyzer, which was set to trigger on NMI. We can see in each cycle what happens. The C64’s memory was filled with $FE, which results in the instruction INC $FEFE,x, which is a 7 cycle instruction. It was excuting this in an endless loop. Then after trying a few times, I got this:

We can cleary see that Ultimax mode is activated before the INC instruction has finished, so the two memory writes are written to memory that is disabled, and thus the writes have no effect.

Another issue with the freeze circuit is the risk of button bounces. When a button is pressed, there is often some noise on the line for a few tens of microseconds. While the hardware does use Schmitt-triggers on the button, there is no proper debounce circuit. This means that there is a risk of multiple NMI interrupts or Ultimax mode flipping on and off before the firmware pulls them low via the register.

(I have addressed these flaws in my Final Cartridge III 101% hardware. Once a digital low is received, a flip flop is set, indicating freeze is active. Instead of counting cycles, the FC3 101% waits for 3 consecutive writes before it pulls down GAME. (The 6510 writes flags and PC to the stack during these cycles.) NMI and GAME are not released by releasing the button, but the hardware waits for a write to the FC3 register before resetting the freeze flip-flop. This way the FC3 101% can guarantee that NMI and GAME will not bounce and GAME is always pulled down in the correct cycle.)

NMI and GAME can also be pulled down through a register on the Final Cartridge III. One of the first things the NMI handler in the FC3 firmware does, is to pull down NMI and GAME through the register. This means NMI and GAME remain low as long as the C64 is frozen.

Initializing the freezer

After the Final Cartridge III has taken control of the C64, the computer is in an unknown state and all of the memory of the C64 might be in use, therefore the cartridge cannot touch any memory. The cartridge takes some freedom, it assumes there is some free space on the stack available, but otherwise has to operate without using any memory.

Displaying the freezer menu and doing the actual freezer functionality is going to require memory, so something needs to be done, since unlike the Action Replay, the Final Cartridge III does not contain any additional RAM.

The freezer uses two slices of memory: The first slice is 103 bytes in size and is used for register backups and temporary variables, the second has a size of 87 bytes and contains the unfreeze routine. Once the freezer has taken control of the C64 and saved the most essential state on the stack, it starts to scan the C64’s memory for memory that can be RLE compressed, i.e. it looks for 103 and 87 bytes of continuous memory with the same value. Once found, it remembers the value, and that memory can now be safely overwritten.

What if no memory can be found? Well, if no memory can be found, the FC3 firmware will check the location of the screen RAM and use that as temporary memory. The hope is that the program that was frozen will restore this memory itself. I have never seen this happen in practice, it looks like such a situation is really rare and under normal situations, there is always some memory that can be compressed.

The zero page from $0070 to $00D6 is backed up into the first piece memory, the unfreeze routine is installed into the second piece of memory. You might wonder why the unfreeze routine is stored in C64 RAM. Why not use cartridge ROM and use less C64 memory? The reason is the backup functionality. Backups can be loaded from disk or tape without any Final Cartridge installed. Therefore, it should be possible to unfreeze and continue a program, without reliance on cartridge ROM.

After the unfreeze routine has been written to memory, the stack is set up in such a way that an RTS instruction is enough to exit the freezer, unfreeze the program and continue normal execution.

The CIA registers are completely backed up into the freed up zero page memory at $0070. Now, not all CIA registers can be read, therefore, their values are retrieved with some tricks. For example, if you read the timers, you read their current value, not their period. However, if you load $10 into the CIA timer control register at $DC0E/$DC0F/$DD0E/$DD0F, you load the timer without starting it, so that way, you can retrieve the timer period. In order to read the status of the interrupt bits, the timers are being run for one period with interrupts disabled and reading the interrupt flags of the CIA. Note that the NMI line is still held low by the cartridge hardware, as far as the 6510 CPU is concerned, the computer is servicing an NMI interrupt, so NMI interrupts are effectively disabled and the firmware can successfully discover the values of the interrupt bits on CIA 2.

No attempt is made to discover the value of the real time clock alarm interrupt bits, the serial shift register interrupt bits and the FLAG interrupt bit.

Next the VIC-II registers are stored into freed up memory at $0090. In order to use as little space as possible, the freezer only saves the values of sprite 2..5 (that it will use itself) and $D010..$D028. It will also need to write sprite pointers in memory, so these memory locations are saved as well. Just like with the CIA timers, the firmware will monitor the VIC-II interrupt status bit for one frame to discover the raster line where a raster interrupt is triggered.

Regarding the SID, it has the unfortunate property that its registers cannot be read. The FC3 firmware simply sets the volume to 0 and does not touch any of the other registers while the program is frozen. Upon unfreeze the volume will be set to 15 and it is assumed that the frozen program will set it to its desired value at some point. This has consequences for backups: It might not be the best idea to create a backup while music is being played.

Now, the freezer has been initialized and it is ready to display the menu.

The C64 has been successfully frozen, registers have been backed up and preparations for unfreeze have been done. There are a few bytes of memory available, but how on Earth is one going to display a full freezer menu with just a few bytes of memory available?

The secret to this is the so-called “invalid bitmap mode” of the VIC-II. If you enable both the bitmap enable bit (BMM) and enhanced background colour mode bit (ECM) at $D011, the VIC-II considers this an invalid mode and will display a black screen. The VIC-II internally behaves as if it was in bitmap mode, but output is forced to a black colour. The screen is still enabled: Sprites work normally. This is excellent for displaying the freezer menu, since this way the freezer can hide the current screen contents!

But we still need to display something, and need memory for that. Actually, not really, as the Final Cartridge III uses another property of the Ultimax mode of the Commodore 64: On the Commodore Max Machine, with its 2 KB of memory, there wasn’t any space in the memory of the machine for graphics. Therefore on the Max Machine, the VIC-II could read graphics directly from cartridge memory. On the Commodore 64, the VIC-II normally always reads from Commodore 64 RAM, but if you activate the Ultimax mode, the VIC-II is happy to directly read graphics from cartridge memory. This means that it is possible to display graphics without using C64 memory!

In order to display the menu bar, text mode is used. With help of a raster interrupt, the FC3 firmware disables invalid bitmap mode when the menu bar is being drawn and makes sure the VIC-II is in normal text mode. The screen buffer is located in cartridge memory, and so is the character font.

The currently open pull-down menu is rendered with sprites. Sprite 2..5 are set up to retrieve their shape from cartridge memory. The currently selected option in the menu is highlighted by changing the sprite colour inside the raster interrupt.

The click that you hear while navigating through the menus is made by switching SID volume to 15 and back to 0.

Accessing C64 memory

In Ultimax mode, most of the C64’s memory is disabled, however, in order to do its work, the freezer code regularly needs to access memory. It is difficult to release the GAME line while being in the freezer, because GAME is also pulled down by the freeze button, so releasing GAME through the register results in uncertainty whether the button is still pressed.

What the freezer code can do, is also pull down EXROM and if both GAME and EXROM are being pulled down, the C64 is in 16 KB cartridge mode. In 16 KB mode, ROM can be switched off by using the $0001 register and this way the freezer can access all of C64 memory whenever it needs.

Backups

While the freezer menu gives you the choice between slow and fast backups, this actually doesn’t do anything. Regardless of your choice, disk backups are always created with a fastloader inside and tape backups will always be written in turbotape format. A backup to disk is written using the low-level KERNAL API and therefore always slow, unless a KERNAL based speeder like JiffyDOS is installed.

A backup consists of two files “FC” and “-FC”. “FC” contains the loader and backups of the registers, colour RAM and memory from $0000 to $01ff. “-FC” contains the backup of $0402..$ffff, compressed with RLE and then after that, the backup of $0200..$0401, uncompressed.

While loading a backup from disk, the C64’s vectors at $0300 need to remain untouched until very late in the backup process. Therefore, before creating a backup, a routine that restores the vectors at $0300 is installed at $00A6, memory that was previously used for temporary variables while displaying the freezer. The values are read from disk by direct calls into KERNAL memory.

A backup starts by creating the “FC” file and writing the loader to it. Then the registers and memory from $0000..$0401 is written to the file, uncompressed.

Then, the memory from $0402..$ffff is compressed with an RLE algorithm. Then the “-FC” file is being written with the contents.

When loading a backup, the loader restores the memory from $0000 to $01ff right away. It can do so, because the variables needed by the KERNAL for serial I/O are inside the memory region that was freed up by the freezer. The loading thus happens with the original stack in memory.

The memory from $0200 onwards is not yet restored, it is needed for the disk/tape fastloader. The disk fastloader is an in-order version of the FC3 fastloader. The protocol used to transfer data between the floppy drive and the C64 is exactly the same as the normal fastloader, however, all sectors are read in file order, therefore the loading is not as as fast as the regular fastloader and depends on sector gap.

The RLE compressed memory is loaded towards the end of memory, i.e. the compressed block exists from $ffff downwards. This way the decompression routine can safely write the uncompressed memory from $0402 onwards without worrying that it overwrites the compressed memory.

When the memory has been read and decompressed, the loader reopens the “-FC” file, this time without the fastloader and starts loading page $0200. As a last step it jumps into the routine that was installed at $00A6 just before creating the backup, to load memory from $0300 to $0401.

When that has been done, the program can be unfrozen with a simple RTS instruction, just like an exit from the freezer.

Game trainer

The game trainer works by scanning C64 memory for reads of certain CIA and VIC-II registers. The machine code instructions to read these registers are replaced by a JSR to the IO1 window in bank 3 of the FC3 ROM. Because bank 3 remains active when the game trainer is used, the vectors at $0300 no longer point to their handlers in bank 0, i.e. as soon as the game trainer has been used, a game will no longer be able to read from disk. This can be considered an undocumented limitation of the FC3.

Screenshots

I will not discuss the screenshot code extensively, because I haven’t studied it myself enough yet. The screenshot code of the FC3 is pretty advanced in the way that it can convert all of the C64’s graphics modes into raster graphics, including any sprites that are visible. It supports multiple printer languages, and includes support for colour printers.

The screenshot code uses the same printer interface available in BASIC, which means it cannot only use printers connected to the Commodore serial bus, but thanks to the FC3’s additional drivers, also Centronics and RS232 printers connected to the user port.

The preview feature works quite similar to the normal freezer menu: The menu bar is displayed using a raster interrupt using a different location in cartridge memory. Unlike the freezer menu bar, the preview runs in 16 KB mode and temporarily switches to Ultimax mode during the raster interrupt. This allows the preview to disable invalid bitmap mode to temporarily display the original picture contained in C64 RAM.

Final words

I am quite impressed by the freezer, it shows that the creators of the Final Cartridge III had a really deep understanding of the Commodore 64. The creators show knowledge of how the 6510 handles interrupt, knowledge of what Ultimax mode does, knowledge how to retrieve values of registers than can normally not be read, knowledge about not so well known VIC-II features. It is all performed by machine code that needs to work under difficult conditions of using very little memory and be very careful what it touches.

The C64 is not controllable during the freezing process, and there were no modern tools like emulators, so debugging the code must have been a herculean effort.

All of this was used in an innovative way, with as a result, a simple, user-friendly button, that shows a pull-down menu in the look & feel of the Final Cartridge desktop.

When the FC3 came out in 1987, the Commodore 64 had been on the market for 5 years. For sure there was some time for the developers to study the machine, but there was no internet like it exists today to exchange information. There did exist books, but the documentation of the C64 wasn’t as extensive yet as it is today. There is absolutely no way the FC3 could have been created by just reading documentation.

The Commodore has always inspired people to explore how it works, push its limits and make it do more than it was originally designed to do. The creators of the Final Cartridge III were by no means the only C64 gurus at that time. However, they were some amazingly good gurus and for sure did succeed in making the C64 do more than it was originally designed to do.

64 Tips & Tricks [PDF]

Michael Steil — Sun, 25 May 2025 14:53:12 +0000

Michael Angerhausen, Lothar Englisch, Klaus Gerits, Frank Thrun:
64 Tips & Tricks
Düsseldorf: Data Becker, 1984. (4. erweiterte und überarbeitete Auflage)
ISBN 3-89011-001-0.
(386 pages, 26 MB)
Danke an Dirk Wagner für die Bereitstellung des Buchs.

Besonders interessant ist an diesem Buch das Kapitel über die C64 CP/M-Cartridge: Auf 71 Seiten gibt es eine Einführung in CP/M, eine Erklärung der Arbeitsweise der Cartridge, und ein vollständiges Listing der C64-Anpassungen.

Klappentext

DAS STEHT DRIN:
64 Tips & Tricks Bd. 1, mit weit über 70000 verkauften Exemplaren ein Bestseller aus dem Hause DATA BECKER, ist eine echte Fundgrube für jeden COMMODORE 64 Anwender. Mit POKEs und anderen nützlichen Routinen, interessanten Programmen sowie wichtigen Programmiertips & -tricks.

Aus dem Inhalt:

Definition eines eigenen Zeichensatzes
Tastaturbelegung und ihre Anderung
Dateneingabe mit Komfort
Simulation der Maus mit einem Joystick
BASIC für Fortgeschrittene
CP/M auf dem COMMODORE 64
Druckeranschluß über den USER-Port
Datenübertragung von und zu anderen Rechnern
Expansionport
Synthesizer in Stereo
Retten einer nicht ordnungsgemäß geschlossenen Datei
Erzeugen einer BASIC-Zeile in BASIC
Kassettenpuffer als Datenspeicher
Sortieren von Stringfeldern
Multitasking auf dem COMMODORE 64
POKE’s und die Zeropage
Repeat-Funktion für alle Tasten

und vieles mehr… .

UND GESCHRIEBEN HAT DIESES BUCH:
Das bewährte DATA-BECKER-Autorenteam mit Michael Angerhausen, Lothar Englisch, Klaus Gerits und Frank Thrun. Alle sind nicht nur begeisterte Programmierer, die ihren 64er in- und auswendig kennen, sondern auch bekannte Autoren vieler weiterer Bücher.

ISBN 3-89011-001-0

Inhaltsverzeichnis

1 Vorwort

2 Graphik für Fortgeschrittene
2.1 Graphik auf dem Commodore 64
2.2 3D Graphik – BASIC-Programm
2.3 Farbige Balkengraphik
2.4 Definition eines eigenen Zeichensatzes
2.5 Modifikation des Zeichensatzes mit dem Joystick
2.6 Der geteilte Bildschirm
2.7 Soft-Scrolling
2.8 Die Tastaturbelegung und ihre Änderung

3 Dateneingabe mit Komfort
3.1 Cursorpositionierung und Abfrage der Cursorposition
3.2 Cursor ein- und ausschalten
3.3 Repeatfunktion für alle Tasten
3.4 Der WAIT-Befehl: Warten auf einen Tastendruck
3.5 Die Belegung der Funktionstasten
3.6 Eine komfortable INPUT-Routine
3.7 Die “Maus” auf dem 64er: Simulation mit dem Joystick

4 BASIC für Fortgeschrittene
4.1 Oft versucht, selten gelungen: Erzeugen einer BASIC-Zeile in BASIC
4.2 Kopieren des BASIC-Interpreters ins RAM
4.3 Keine negativen Zahlen mehr bei der FRE-Funktion
4.4 Rückkehr ins BASIC-Programm nach LIST
4.5 GOTO, GOSUB und RESTORE mit berechneten Zeilennummern
4.6 Der MID$-Befehl
4.7 INSTR und STRING-Funktion
4.8 Automatische Zeilennummerierung
4.9 DEF FN das unbekannte Wesen
4.10 Ihr Commodore 64 spricht deutsch
4.11 Verwendung einer HARDCOPY-Routine für kommerzielle Programme
4.12 Mengenlehre auf dem CBM 64 am Beispiel der Berliner Ku’damm-Uhr unter Verwendung der Echtzeituhr und von Sprites
4.13 Ein kleiner Kopierschutz

5 Der CBM 64 kann nicht nur BASIC
5.1 Die Programmierung von FORTH
5.2 Vergleichsprogramm FORTH – BASIC
5.3 Weitere Sprachen: PASCAL, LOGO
5.4 ADA für den Commodore 64

6 CP/M auf dem Commodore 64
6.1 Das ist CP/M
6.2 Der Umgang mit den einzelnen CP/M Programmen
6.3 Die Anpassung von CP/M Standardsoftware an den 64er
6.4 Die Speicherverwaltung des Z80 Prozessors
6.5 Die Diskettenverwaltung unter CP/M
6.6 Die Zusammenarbeit der Prozessoren 6510 Z80
6.7 Kommentiertes BIOS-Listing
6.8 Implementierung eigener Ein-/Ausgabefunktionen ins BIOS
6.9 Übertragung von Programmen und Daten vom CP/M ins Commodore-BASIC und umgekehrt

7 Anschluß- und Erweiterungsmöglichkeiten des Commodore 64
7.1 Eine sinnvolle Anwendung des USER-Port am Beispiel eines Centronics-Druckers
7.2 Datenübertragung von und zu anderen Rechnern mittels USER-Port
7.3 Der Expansionport: Eine Fallstudie mit der CP/M-Cartridge
7.4 Synthesizer in Stereo

8 Dateiverwaltung: Kein Buch mit sieben Siegeln
8.1 Cassette – Diskette
8.2 Das Prinzip der Dateiverwaltung: Sequentielle Dateien
8.3 Kopieren von Dateien mit einem und zwei Laufwerken
8.4 So geht’s schneller: Relative Dateien
8.5 Eine andere Methode: Direktzugriff
8.6 Retten einer nicht ordnungsgemäß geschlossenen Datei
8.7 Der Blockverfolger

9 Poke’s und andere nützliche Routinen
9.1 Der Kassettenpuffer als Programmspeicher
9.2 Sortieren von Stringfeldern
9.3 Minimum und Maximum von numerischen Feldern
9.4 DUMP – Ausgabe sämtlicher Variablen und ihrer Werte
9.5 Modifizierte PEEK-Funktion
9.6 Multitasking auf dem Commodore 64
9.7 ΡΟΚΕ’s und die Zeropage
9.8 Kontrolle von Texteingaben über die Tastatur
9.9 Formatiertes Programmlisting
9.10 Retten von Variablen und Warmstart

6502 Illegal Opcodes in the Siemens PC 100 Assembly Manual (1980)

Michael Steil — Thu, 08 May 2025 12:17:06 +0000

The 6502’s “illegal” opcodes were of intense interest to home computer enthusiasts, and analyses were published in various magazines. But one would have never expected a company like Siemens to document illegal opcodes in a programming manual from 1980.

The PC 100 Assembly Manual

The Siemens PC 100 is basically a Rockwell AIM-65 single-board computer in a case, featuring a 6502 processor, integrated keyboard, LED display, and thermal printer, tailored for educational and development purposes with localized documentation and modified ROMs.

Siemens’ German-language manuals were largely based on Rockwell’s originals, but shuffled the contents. Of particular interest is the assembly manual:

Siemens Assembler-Handbuch Personal-Computer PC 100, Ausgabe 1980/1981
(124 pages, 17 MB)

It consists of:

Assembler-Handbuch	Rockwell	Description
Chapters 1–9	User’s Guide, Chapter 5	Assembler Reference
Chapter 10	Programming Manual, Appendix B	6502 Reference
Chapter 11	User’s Guide, Chapter 3	Monitor Reference
Chapter 12	–	Tables

“Special Instructions”

The 6502 Reference has three extra pages at the end describing “Sonderbefehle” (“special instructions”) that were not in the original MOS/Rockwell 6502 reference that this chapter is a translation of. We can assume this was original research by Siemens.

Here is the translated transcription:

10.3 Special Instructions

The microprocessor recognizes a number of special instructions that are largely unknown but can provide valuable assistance to the user in program development. The chosen mnemonics in the following tables are merely recommendations for effectively representing these instructions.

Note:
These instructions are not part of the specification and may be changed at any time without notice. The special commands cannot be decoded by the assembler program and must be programmed using the .BYT directive (see Chapter 10.2).

AAX Logical “AND” operation between the accumulator and X-register, with result storage.

Operation A ∧ X → M with Zero Page und (A) ∧ X ∧ $02 → M with Absolute

Flags

N	Z	C	I	D	V
–	–	–	–	–	–

Addressing Mode	Assembler Mnemonic	OP CODE	No. Bytes
Zero Page	AAX Oper	87	2
Zero Page, Y	AAX+16 Oper	97	2
X-Reg. ∧ $02 Absolute	AAX+23 Oper	9E	3
X-Reg. ∧ Accu ∧ $02 Absolute	AAX+24 Oper	9F	3

DCM Decrement memory location by one and compare result with accumulator.

Operation M – 1 → M and A – M

Flags

N	Z	C	I	D	V
V	V	V	–	–	–

Addressing Mode	Assembler Mnemonic	OP CODE	No. Bytes
Zero Page	DCM Oper	C7	2

LAX Load accumulator and X register

Operation M → A and M → X

Flags

N	Z	C	I	D	V
V	V	–	–	–	–

Addressing Mode	Assembler Mnemonic	OP CODE	No. Bytes
Immediate	LAX Oper	AB	2
Zero Page	LAX-4 Oper	A7	2

ISB Increment memory cell by one and subtract result from accumulator

Operation M + 1 → M and A – M → A

Flags

N	Z	C	I	D	V
V	V	V	–	–	–

Addressing Mode	Assembler Mnemonic	OP CODE	No. Bytes
Zero Page	ISB Oper	E7	2

Note:
The instructions LAX Immediate, AAX X reg $02 and AAX X reg accu $02 are not always processed correctly.

10.4 Programming in Assembly Language

Since the special instructions mentioned above cannot be decoded by the assembler program, they must be programmed using the .BYT directive.

Analysis

Opcodes	Siemens Mnemonic	Modern Mnemonic	Comment
87/97	AAX	SAX	Correct. Also exists with izx/abs addressing modes.
9E/9F	AAX	SHX/SHA	It’s aby, not abs. Also, their constant ($02) is actually the high-byte of the instruction address. (It should not be grouped with the other “AAX” opcodes.)
C7	DCM	DCP	Correct. Also exists with zpx/izx/izy/abs/abx/aby addressing modes.
AB	LAX	LAX	Basically correct. The documented unstability comes from bits of the A register sometimes bleeding into the calculation. (More info)
A7	LAX	LAX	Correct. Also exists with zpy/izx/izy/abs/aby addressing modes.
E7	ISB	ISC	Correct. Also exists zpx/izx/izy/abs/abx/aby addressing modes.

All in all:

They got the ones they designated as “stable” right, but were missing many addressing modes.
They got 9E/9F wrong (which they designated as “unstable”), probably by not testing enough inputs combinations.
They got AB right (actually unstable), but they did not research the root of the instability further.

We can assume that this is either original research by a Siemens author – or they copied it from some other source. If the information had come from MOS or Rockwell, they would surely have

listed all addressing modes.
not added a statement like “The instructions […] and […] are not always processed correctly.” – or better yet, omitted the unstable ones.

Credits

Thanks to Gerald Schiepeck for his AIM-65 and PC 100 exhibition at the VCFE 2025, and Marco Baye for bringing the section in the assembly manual to my attention.

Siemens Personal Computer PC 100 Bedienungsanleitung, Ausgabe 1981/1982

Michael Steil — Wed, 07 May 2025 15:44:39 +0000

The Siemens PC 100 was a version of the 6502-based “AIM-65” SBC in a case and with slightly modified ROMs. Siemens offered a set of German-language manuals, which included translated Assembler (MOS Resident Assembler) and BASIC (Microsoft BASIC) manuals, but also a general manual (“Bedienungsanleitung”).

This is the 1981/1982 version of the “Bedienungsanleitung”, describing the hardware and its interfaces, as well as the monitor, editor and BASIC software:

Hans Otten’s Retro Computing site has a lot more documentation, as well as ROM images. (This version of the manual was missing though, and has since been added.)

Siemens Personal Computer PC 100 Bedienungsanleitung, Ausgabe 1981/1982
(232 pages, 39 MB)

Brotkastenfreunde Interview

Michael Steil — Sun, 04 May 2025 12:58:02 +0000

In Folge 017 des C64-Podcasts Brotkastenfreunde bin ich diesmal Gast.

Das Titelbild vom 64’er Sonderheft 2/85

Michael Steil — Sun, 06 Apr 2025 19:48:34 +0000

Am 25. März 1985 kam das zweite 64’er Sonderheft mit dem Thema Abenteuerspiele raus. Genau 40 Jahre später ist es nun als HTML und PDF online verfügbar – im Kontext des Projekts 64er-magazin.de. Der Drachenreiter auf dem Original-Titelblatt ist der damals 18jährige Boris Schneider-Johne – wer genau schaut, wird auf der Website erkennen, dass wir das Bild aktualisiert haben:

(Das gilt nur für die HTMLs; in der PDF befindet sich das Originalbild.)

Als Bonus hier die Entstehungsgeschichte zu diesem Titelbild in Boris’ eigenen Worten:

Das Foto entstand Anfang 1985, zu dem Zeitpunkt war ich noch freier Autor bei 64’er und gleichzeitig Schüler im 13 Jahrgang vorm Abitur. Statt zu lernen war ich etwa zweimal in der Woche einfach in der Redaktion und hab dort mit den Redakteuren abgehangen (Pitstop II) und an Artikeln gearbeitet oder die lektorierten Artikel nochmal überarbeitet. Der Workflow war damals: Vizawrite 64, dann mit ca. 28 Zeichen / Zeile und doppeltem Zeilenabstand ausdrucken, damit zum einen ungefähr die Zeilenlänge des Heftes erreicht wurde und von einem Redakteur mit Stift Anmerkungen und Korrekturen gemacht wurden. Die Artikel wurden erneut als Ausdruck in den Lichtsatz gegeben und erneut abgetippt.

Der Markt und Technik-Verlag hatte damals zwei Fotografen, Janos Feitser und Jens Jancke, der sich mehr um die “Heimcomputer”-Redaktionen kümmerte, während Janos für die seriösen Hefte zuständig war. Neben dem “Haupthaus” waren die Redaktionen von 64’er und Happy Computer im dritten Stock der Hans-Pinsel-Straße 10 über einem bekannten Büromöbel-Geschäft. Dort gab es zwischen den Büros fensterlose Mittelgänge, eigentlich als Lagerraum gedacht, und in einem besonders großen von diesen hatte Jens sein Fotostudio unterbringen dürfen. Ein kleinerer von diesen Räumen war unsere Screenshot-Kammer, in der wir Monitore abfotografiert hatten, immer die lästigste Arbeit gerade bei den Spieleheften.

Jens kam bei einem meiner Besuche auf mich zu, ob ich Lust hätte, als Modell für ein Titelblatt des Adventure-Sonderheftes zu arbeiten. Ich fand die Idee total witzig (das war alles noch vor Power Play und Co, wo mein Gesicht dann intensiv zu sehen war). Ich bin an einem Samstag für die Fotosession ins Büro gekommen, das wurde dann alles ganz klassisch von Jens ausgeleuchtet, mehrere Polaroids gemacht, auf Mittelformat belichtet und dann vom Hausgrafiker René Nestler mit dem Drachen versehen. Soweit ich mich erinnere gab es auch 150 Mark Honorar für diesen Auftritt.

Mit Jens haben wir dann viel am Look der späteren Spielesonderhefte gearbeitet. Er war einer der ersten, der Deluxe Paint eingesetzt hat, um die als Modelle doubelnden Redakteure in virtuelle Welten zu kopieren. Von ihm stammt, per Selbstauslöser, auch das schöne Porträt von Heinrich Lenhardt, ihm und mir im ersten Spiele-Sonderheft auf Seite 3.

Kultpower Archiv: Komplettscan Happy Computer Spielesonderheft 1 (1985)

64'er Magazin – mit 40 Jahren Verzögerung jetzt monatlich im Web

Michael Steil — Wed, 20 Mar 2024 08:15:35 +0000

Zum 40jährigen Jubiläum des 64’er Magazins präsentieren wir das Kunstprojekt www.64er-magazin.de: eine Website, die so tut, als wäre 1984. Exakt 40 Jahre nach der ursprünglichen Veröffentlichung erscheint hier jeden Monat eine neue Ausgabe:

4/84: 20. März 2024 (Erstausgabe)
5/84: 19. April 2024
6/84: 18. Mai 2024
usw.

Auf der modernen Homepage gibt es

durchsuchbare PDF-Dateien der einzelnen Ausgaben
alle Artikel im Web-Format mit Kommentar-Funktion
alle Listings zum Download statt zum Abtippen
Übersichtsseiten für alle Tests, alle Listings etc. über alle Ausgaben hinweg
eine Suche über den Text aller Artikel
einen RSS-Feed, der ab Veröffentlichung jeden Tag zwei Artikel liefert
die Funktion, einen Artikel auf Mastodon zu teilen

Alle Artikel sind mit dem Text im gedruckten Magazin identisch, Schreibfehler und sachliche Fehler sind also unverändert. Errata aus späteren Ausgaben (“Fehlerteufelchen”) werden den Artikeln allerdings angehängt, und später dokumentierte Fehler in Software sind in den Downloads bereits behoben.

Verbleibende Unterschiede im Text sowie Verbesserungen der Website sind auf GitHub willkommen.

Silo S01E06: 38911 BYTES FREE

Michael Steil — Fri, 21 Jul 2023 20:53:22 +0000

[Ankündigung] Vortrag “Apollo Guidance Computer” an der Embedded Computing Conference in Winterthur

Michael Steil — Tue, 16 May 2023 12:44:05 +0000

This post is about an upcoming talk in German.

Am Dienstag, den 06. Juni 2023 um 9:00 gibt es auf der Embedded Computing Conference in Winterthur (bei Zürich) meinen Vortrag zum Thema “Apollo Guidance Computer im Kontext von modernen Embedded Systems und Echtzeit-Betriebssystemen”.

Beim Vortrag handelt es sich um eine abgewandelte und erweiterte Version des Ultimate Apollo Guidance Computer Talk von Christian Hessmann und mir. Diese Version vertieft sich in die Software-Architektur und betrachtet den AGC aus dem Blickwinkel von modernen Echtzeitsystemen – schließlich handelt es sich um eine Konferenz über Embedded Systems!

Die eintägige Konferenz besteht aus 30 weiteren Vorträgen auf 3 bis 4 Tracks. Der Eintritt ist nach vorheriger Anmeldung kostenlos.

The Easter Egg in the “Schrott-Tornado” at the Deutsches Museum

Michael Steil — Mon, 26 Dec 2022 19:21:05 +0000

The Deutsches Museum in Munich (Germany) has a new art installation as part of the reopened Electronics exhibition: The “Schrott-Tornado”, a tornado-shaped sculpture made from scrap electronics. There is (at least) one item in it that is most definitely not trash.

Here is a picture of the full sculpture:

Let’s zoom into the interesting part:

And this is a high-res photo of the detail. You might recognize it.

And now look closely at what’s written on the cartridge port shield.

Related: Here’s the Schrott-Tornado Making-Of video:

darmok.com: Memes in the Tamarian Language

Michael Steil — Sun, 25 Dec 2022 13:30:43 +0000

I have created darmok.com, a website that lets you share common memes in the Tamarian language.

Try it out, and fork at github.com/mist64/darmok!

PostScript Cartridge for HP LaserJet

Michael Steil — Sat, 24 Dec 2022 20:00:31 +0000

We have recently dissected and dumped the Level 2 “Plus” version of HP’s PostScript cartridge series. This time, we will look at the earlier Level 1 “PostScript Cartridge”.

Article Series

There are two cartridges for the HP LaserJet series that add Adobe PostScript support. They differ in the PostScript Level:

Year	Name	Description
1991	HP LaserJet PostScript Cartridge	PostScript Level 1
1991	HP LaserJet III PostScript Cartridge Plus	PostScript Level 2

Note that the article on the Level 2 “Plus” cartridge is the main one. This article only describes the differences of the Level 1 cartridge to the Level 2 cartridge.

Cartridge

The cartridge is about 9×14 cm in size.

The front says

HEWLETT PACKARD

PostScript* Cartridge

ITC Avant Garde Gothic
ITC Bookman
Courier
Helvetica
Helvetica-Narrow
New Century Schoolbook
Palatino
Times
ITC Zapf Chancery
TIC Zapf Dingbats*
Symbol
33439Q ©Hewlett-Packard 1989, 1990, 1991

HP
LASERJET
POSTSCRIPT®

The back says

Adobe and PostScript are registered trademark of Adobe Systems
Incorporated in the U.S, and other countries. Helvetica, Palatino and
Times Roman are registered trademarks of Linotype AG and/
or its subsidiaries in the U.S. and other countries. IT Avant Garde
Gothic, ITC Bookman, ITC Zapf Chancery and ITC Zapf Dingbats are
registered trademarks of International Typeface Corporation in the
U.S. and other countries.

Board

This is the very same board as used by the later “Plus” cartridge, except that it is fitted with only three instead of four ROM chips:

The ROM chips are:

two 512 KB Toshiba TC534200P mask ROM chips
one 512 KB Toshiba TC574200D-150 EPROM

Both types conform to the 27C400 pinout. The mask ROMs are marked with

© 1989-90 HP BOISE
© 1984-90 ADOBE
© 1981 LINOTYPE

and the EPROM is marked with

© 1986-91 HP BOISE
© 1984-91 Adobe
© 1981 Linotype AG

ROM

These are the verbatim dumps (adjacent bytes are swapped):

1818-4788, MD5 e2d740f3b15bfdf837b9385490e8dafd
1818-4789, MD5 1fe13fabb47f0719b9a840df8e5844e6
33439-60115, MD5 520d4f2e0ccf7f143fcdbe62f31792a2

This is the combined (byte-swapped) 1.5 MB ROM image:

HP PostScript Cartridge 33439Q ROM, MD5 929ba4050a8a48c3ed4761c3f9267837

The ROM image starts with a signature of “SYST” and the following messages at 0x30:

C V6.20 PSCRIPT
06.20
Copyright © Hewlett-Packard Company, 1991. All rights reserved.

(This part comes from the EEPROM.)

Just like the “Plus” cartridge, the ROM contains Adobe’s PostScript rasterizer (level 1 in this case) compiled for the 68000 CPU, the PostScript base fonts as well as some LaserJet-specific software (messages, errors and settings texts for the 15 char display in several languages).

PostScript Files

Like in the “Plus” cartridge, there is some PostScript source code in the ROMs:

(The missing %! file header has been added to the downloads.)

FONTPAGE

“FONTPAGE” is identical to the file in the “Plus” cartridge, see there.

TEST PAGE

“TEST PAGE” prints various internal printer settings which are unsupported by computer-based PostScript rasterizers, so the file has been patched to work with the the macOS 12.6 PSNormalizer.framework rasterizer (based on Adobe Acrobat Distiller 5.0):

STARTUP PAGE

Manuals and Extras

Future Work

Are there versions of this cartridge with different EPROM contents?

Scanntronik Manuals

Michael Steil — Fri, 23 Dec 2022 18:00:18 +0000

The German company “Scanntronik” offered a lot of high-quality hardware and software for the Commodore 64 series computers, most in the space of graphics and desktop publishing. They are well-known for their Pagefox and Printfox software as well as their Handyscanner 64 hardware. This page offers most of the German-language manuals from across their product range as searchable PDFs.

If you have any additional manuals, or manuals in different languages, please reach out to me and we can get them added.

Pagefox

PAGEFOX

Tips & Tricks für den PAGEFOX

Printfox

Printfox Errata

Character Fox; Erweiterungsdisk 1 zum Printfox

Die Fox-Bibel zum Printfox-Basar

Videofox

VIDEOFOX

VIDEOFOX II

MOVIES FÜR DEN VIDEOFOX

COLOUR MOVIES; Eine Kollektion von Farbbildern und Vorspännen für den Videofox II

Eddison & Eddifox

EDDISON

EDDIFOX

Cheese

Bedienungsanleidung Malprogramm Cheese

Cheese Add-on

Colourprinter

Handyscanner 64

Catalogs

Scanntronik-Katalog 12/90

Scanntronik Katalog [ca. 1993+]

Katalog-Beilage [ca. 1993+]

Scanntronik-Katalog 12/92

Scanntronik Bestell-Postkarte

The Commodore AUTOMODEM (Model 1650)

Michael Steil — Thu, 22 Dec 2022 17:06:57 +0000

The Commodore 1650, also known as the “AUTOMODEM”, is Commodore’s first full modem directly connected to the phone line. It supports pulse dialing in software and 300 baud duplex connections.

Historical Context

Year	Name	Model	Description
1982	VICMODEM	1600	connected to phone’s handset connector; manual dialing through phone; Motorola MC14412
1982	AUTOMODEM	1650	connected to phone line, pulse dialing in software; Motorola MC14412
1985	MODEM/300	1660	added tone dialing support by feeding SID output into modem; Texas Instruments TMS99532A
1987	MODEM/1200	1670	Hayes command set; pulse and tone dialing in hardware; 300/1200 baud support; U.S. Robotics chipset

Photos

On the front, there is the VIC-20/C64 user port connector and an activity LED.

On the left, there is
* the “LINE” jack that is supposed to be connected to the telephone network
* the “PHONE” jack for connecting an existing telephone
* a D/T switch. In D mode, the modem takes over the phone line, in T mode, “LINE” gets passed through to the telephone.
* an A/O switch. When making a modem call, it is to be put into the “O” (originate) position, and when answering a call, it is to be put into the “A” (answer) position.

On the right, there is an H/F switch to select Half Duplex (“H“) or Full Duplex (“F“).

The label on the bottom says:

FCC ID: B4V8N2AUTOVIC
Commodore Business Machines, Inc
Made in USA

Certified to comply with the limits for a Class B
computing device pursuant to Subpart J of
Part 15 of FCC Rules. See instructions if
interference to radio reception is suspected.

Complies with Part 68, FCC Rules; FCC
Registration Number B4V8N2-70317-
DM-R; Ringer Equivalence 0.1B; Jack
(USOC) RJ11

Model 1650
Serial Number: 018208

The only text on the board is “00312A” on the back. The core component is the Motorola MC14412 modem chip at the bottom center.

The design of the AUTOMODEM is basically the same as its predecessor’s, with the extra circuitry added to allow it work with the phone line directly instead of acting as the handset of an existing telephone.

Box

Manual

A 1960s Children's Book about Computers

Michael Steil — Wed, 21 Dec 2022 21:43:22 +0000

The 1963 book “Robots and Electronic Brains” (by Robert Scharff) from the “How and Why Wonder Books” series is an early children’s book about computers. Let’s look at some of the interesting contents – and how the German translation “Was ist was: Roboter und Elektronengehirne” from 1967 changed some details.

Historical Context of the Books

Published in 1963, this might very well be one of the first children’s books on computers ever. To put this into context, this was:

less than 20 years after the first workable computers
four years after the introduction of the IBM 1401, one of the first commercial computers based on transistors (as opposed to tubes), weighing five tons and with programs on punch cards
six years before the moon landings

With this context, the somewhat weird title “Robots and Electronic Brains” might be more understandable:

Computers are called “electronic brains”, since “computers” might not have been a common enough term (especially for children!) at the time.
Even though the book is mostly about computers, “robots” are mentioned first in the title, because they are a familiar concept. Robots like “Robby” (“Forbidden Planet”, 1956) were part of the pop culture.

While the original German version went with a literal translation, later editions updated the title to “Computer und Roboter” – computers and robots. They kept updating it until 1999, but it’s now out of print.

Differences in the German Version

The German translation was done by Käte und Heinrich Hart. While the book retains the same chapters and the identical layout, the text was slightly adapted. Here are some examples:

Who invented computers?

This question has no easy answer, as both versions correctly state. Yet the original version claims it’s an American invention – this part has been removed from the German version.

English	German (translated back)	German
The electronic computer, among the foremost American inventions of this century, was not an overnight discovery. It is the fruit of the practical science of mathematics and has its roots far in the past.	Electronic computing systems, one of the most significant technical achievements of this century, were not accidental inventions. They are the result of developed modern technology and applied mathematical science. The origin of mathematics lies far in the past.	Die elektronischen Rechenanlagen, eine der bedeutendsten technischen Leistungen dieses Jahrhunderts, waren keine Zufallserfindungen. Sie sind das Ergebnis der entwickelten modernen Technik und der angewandten mathematischen Wissenschaft. Der Ursprung der Mathematik liegt weit in der Vergangenheit.

The translation also adds an explicit credit to the German Leibniz for his calculator:

English	German (translated back)	German
The first adding machine, invented in 1642, was followed by a four-operation arithmetic machine composed of a difference engine that performed calculations, a mechanical tabulator, a punch-paper control system, and a differential analyzer. Although these inventions increased computation speeds, they failed to fulfill the needs of our complex world.	The first simple adding machine was invented in 1642, the first calculating machine for all four basic arithmetic operations around 1672 by the German philosopher and mathematician Leibniz.	Die erste einfache Addiermaschine wurde im Jahre 1642 erfunden, die erste Rechenmaschine für alle vier Grundrechnungsarten um 1672 von dem deutschen Philosophen und Mathematiker Leibniz.

The two versions of the book heavily diverge on the topic of the first “workable” computer:

English	German (translated back)	German
In 1936, a young Harvard physicist, Professor Howard Aiken, happened upon some of the writings of Dr. Babbage. Like Babbage, Dr. Aiken saw the possibility of a robot that could do the thinking of hundreds of men in a fraction of the time it took any one of them to work out routine mathematical problems. Aiken teamed up with other researchers and, by 1944, they built the first workable computer.	An electromechanical calculator was first built in Germany in 1941 by K. Zuse. At the same time, similar types were being worked on in North America.	Eine elektromechanische Rechenmaschine wurde zuerst im Jahre 1941 in Deutschland von K. Zuse gebaut. Zur gleichen Zeit wurde in Nordamerika an ähnlichen Typen gearbeitet.

Some historical context from today’s perspective: There were actually several electronic or electromechanical and more or less general-purpose computers in the 1940s. The original version of the book picked the Harvard Mark I by the team around Howard Aiken as the first “workable” computer. The German adaptation replaced this with the Z3 of the German Konrad Zuse, which pre-dated the Mark I as the first programmable computer by 3 years.

To be fair to the original book, Zuse’s work had been largely unknown outside of German-speaking countries until at least the late 1970s. The talk “The Early Development of Digital Computing In Central Europe” given by Friedrich L. Bauer at the 1976 First International Research Conference on the History of Computing in Los Alamos, NM presented Zuse’s work to an international audience. Zuse himself also had a talk at the conference.

In fact, even the Colossus Mark 1 at Bletchley Park predated the Harvard Mark I (by 3 months), but the UK’s codebreakers efforts were still secret at the time the books were written – in fact, they were revealed at the very same conference in 1976.

Here is what the two versions say about ENIAC:

English	German (translated back)	German
Two years later, the first general-purpose, all-electronic computer, called the ENIAC computer (from Electric Numerical Integrator and Calculator), was built. ENIAC was the grandfather of today’s electronic brains, room-size robots who answer to the unlikely names as UNIVAC, STRETCH, MANIAC, UNICALL, MINIVAC, SEAC, and BIZMAC.	In 1946, there was also the first real electron computer in the USA, named ENIAC (Electronic Numerical Integrator And Computer). ENIAC was, so to speak, the ancestor of today’s electron brains, the room-sized robots. Some American manufacturers give them names like UNIVAC, MANIAC, UNICALL, MINIVAC and BIZMAC; others, including the German manufacturers, refer to their various computer types only by numbers.	1946 gab es auch in den USA den ersten wirklichen Elektronenrechner, ENIAC genannt (Electronic Numerical Integrator And Computer). ENIAC war sozusagen der Ahnherr der heutigen Elektronengehirne, der zimmergroßen Roboter. Einige amerikanische Hersteller geben ihnen Namen wie UNIVAC, MANIAC, UNICALL, MINIVAC und BIZMAC; andere, auch die deutschen Hersteller, bezeichnen ihre verschiedenen Computertypen nur mit Nummern.

The German version downgrades ENIAC into an “also-ran”. In all fairness, ENIAC should be credited as the first working computer designed to be Turing-complete.

Finally, the German text clarifies that the UNIVAC-style naming scheme does not apply to all computers, especially non-US ones.

Does an electronic brain ever fail?

Thankfully (and surprisingly), the German version removes the sexism from the garbage-in/garbage-out chapter:

English	German (translated back)	German
A computer, of course, gives wrong answers if given wrong information. One experiment with the decision-making ability of computers was a failure. A television quiz program used a computer to select the ideal wife for a contestant. To accomplish this, the programmer fed into the machine all facts known for a perfect marriage – likes and dislikes, interests in various hobbies, movies, music, food, etc. When the computer compared the qualifications of many women with those of the male contestant, it recommended one as ideal. But, when the two got to know each other, they decided they were mismatched and should not marry each other. Whose fault was this? The machine programmer’s? Perhaps it only proves that even a computer cannot understand a woman’s mind.	Of course, if a computer is fed the wrong information, it will give a wrong answer, but other attempts to use a computer’s capability can also lead to failure. Example: In a television program, a computer was used to find out the ideal wife for a certain man. The programmer gave the electronic computer all the characteristics desired for an ideal marriage – likes and dislikes, interests in various hobbies, in cultural values, and so on. After the computer compared the characteristics of many women with those of the man in question or according to his wishes, it declared a particular woman to be the ideal partner. But when the two got to know each other, it turned out that they did not like each other. Whose fault was that? The programmer’s? Perhaps the experiment only proves that the human heart and its inclinations are not calculable.	Natürlich gibt ein Computer, wenn er mit falschen Angaben gefüttert wurde, eine falsche Antwort. Aber auch andere Versuche, die Fähigkeit eines Computers zu nutzen, können zum Mißerfolg führen. Ein Beispiel: In einem Fernsehprogramm wurde ein Computer dazu benutzt, jeweils für einen bestimmten Mann die ideale Ehefrau herauszufinden. Der Programmierer gab dem Elektronenrechner alle für eine ideale Ehe gewünschten Eigenschaften auf – Neigungen und Abneigungen, Interessen an verschiedenen Hobbies, an kulturellen Werten usw. Nachdem der Computer die Eigenschaften vieler Frauen mit denen des betreffenden Mannes oder gemäß dessen Wünschen verglichen hatte, erklärte er eine bestimmte Frau zur idealen Partnerin. Aber als die beiden dann einander kennenlernten, stellte sich heraus, daß sie sich nicht sympathisch waren. Wessen Fehler war das? Des Programmierers? Vielleicht beweist der Versuch nur, daß das menschliche Herz und seine Neigung nicht berechenbar ist.

Complete Comparison

Here are both books side by side. If you find any more interesting details (or differences), please add them in the comments of this article!

Digitizing Analog Video through a Digital Camcorder

Michael Steil — Sat, 19 Nov 2022 22:06:04 +0000

This article explains a setup and workflow for digitizing analog video (e.g. VHS, Beta, Video 2000, LaserDisc, …) using a Mac and digital camcorder – in high quality and with interlacing intact; optimized for archival. We will use a old-school digital camcorder (they are cheap!) to convert the analog signal to a high-quality digital “DV” stream and then record the DV stream on a Mac using a FireWire connection.

The Problem with Interlaced Video

Standard definition analog video is either

576i50 (PAL/SECAM): 25 full frames per second of 720×576 pixels, interlaced
480i60 (NTSC): 30 full frames per second of 720×480 pixels, interlaced

Interlaced means that every full frame is split into two “fields”, one with the picture’s 288 (PAL; NTSC: 240) odd lines, and one with the 288 (240) even lines. These two fields can represent one moment in time: When combining them, they will give 25 (30) full 720×576 (720×480) frames. Or they can be recorded 1/50 (1/60) second apart, giving motion at a resolution of 50 (60) Hz, but at half the vertical resolution, and with the set of lines alternating every time.

Interlaced video was natural for old CRT displays, but in order to show interlaced video on modern displays or encode them into modern video compression formats, they need to be converted into progressive format, i.e. deinterlaced.

The two fields are always transmitted one after the other, and it is unclear whether every two fields should be combined (“comb filter”) for a 25 (30) Hz video or whether the 50 (60) fields should be vertically upscaled for a 50 (60) Hz video. Using the wrong method means either combing artifacts or flickering.

The worst part is when the pairing of fields is inconsistent, like in this example of Futurama or whenever two scenes are composited in the original SD version of Star Trek TNG.

So deinterlacing is hard, especially if you want to do it well. When digitizing analog video, it is best to keep the interlacing intact. When playing the file, VLC for instance will already do pretty good deinterlacing by default – and tomorrow’s VLC will certainly do a better job. And if you come back to the footage years later to share it or reuse parts in an HD video, you can use the best deinterlacer available then!

So all in all, if you want to archive analog video, you should keep it interlaced. Many simple solutions won’t do this, but this solution does.

Digital Video and DV

A full, uncompressed digital representation of a PAL signal is 50 times a second a 720×576 image at 24 bits per pixel, which is 25x720x576x24: almost 240 Mbits/sec.

This implies “4:4:4”, meaning every 2×2 pixels have four brightness values (Y’) and four values each for the two color components (Cb and Cr). Most digital formats use chroma subsampling, meaning that a 2×2 pixel grid has fewer chroma values. Since the human visual system is less sensitive to color than it is to brightness, 4:2:2 (half horizontal chroma resolution) is practically indistinguishable from 4:4:4.

The 1986 D-1 format uses 4:2:2 chroma subsampling and thus reduces the data rate by a factor of 1.5 to about 160 MBits/sec. Sony’s 1993 Digital Betacam additionally uses lossy compression to reduce the data rate by a factor of 2.34:1 down to about 70 MBits/sec. Both formats were meant for professional use.

The consumer format for digital SD camcorders is the 1994 DV (“Digital Video”). It uses 4:2:0 chroma subsampling (one color value per 2×2) for PAL and 4:1:1 (one color value per 4×1) for NTSC, which reduces 4:4:4 data by a factor of 2. The resulting data is lossily DCT-compressed by a factor of 5, leading to a data rate of 25 MBit/sec.

DV embeds an uncompressed 48 kHz 16 bit stereo PCM audio stream, which adds 1.5 MBit/sec (or alternatively, two 32 kHz 12 bit stereo streams with the same total bitrate).

Unlike today’s common compression methods (e.g. MPEG-2, H.264, H.265), DV compresses images individually, so in the stream, there are no dependencies between images. You can imagine DV as a stream of 64 KB JPEG images. This allows frame-exact editing and allows for simpler encoder and decoder hardware, but means that a higher data rate is required for the same quality. A 25 MBit/sec DV stream is roughly equivalent to a 10 MBit/sec MPEG-2 stream and a 3 MBit/sec H.264 stream.

DV is an excellent match as an intermediate digital representation or even as an archival format for pretty much all analog media, it even surpasses LaserDisc and matches DVD in luma and chroma resolution (PAL values; NTSC are similar):

Format	Luma	Chroma
VHS	320×576	40×288
Betamax	333×576	40×288
S-VHS	560×576	40×288
Broadcast	440×576	120×288
LaserDisc	560×576	120×288
DVD	720×576	360×288
DV	720×576	360×288¹

DV’s DCT-based compression is lossy but quite gentle, and given the increased resolution of DV compared to all analog media, it will capture virtually all data from the analog media.

Digital SD Camcorders

There are two types of digital standard definition camcorders:

MiniDV (1995) was the industry standard. It uses a proprietary cassette format.
Digital8 (1999) by Sony re-uses the same tapes as the analog Hi8 format.

Both Digital8 and MiniDV camcorders store a DV stream on tape and all devices come with a 4-pin FireWire/IEEE-1394/i.LINK connector that allows losslessly copying the DV stream from the camcorder to a second device or to a computer, or copying a DV stream to the device.

In addition to an analog video output (for connecting it to a TV), many camcorders also have an analog composite or even S-Video input. Unless the firmware has this feature disabled to avoid the European tax on video recorders (check the manual!), these camcorders can record analog video from an external input, or output a live digitized DV stream over FireWire (“DAC”).

This is what we will be using, so the tape format of the camcorder does not matter, since we won’t be using tape.

Setup

You need

a high quality player for your VHS, Beta, Video 2000, LaserDisc etc. media
a composite or S-Video cable plus audio to connect the player to the camcorder¹
a Digital8 or miniDV camcorder that supports digitizing external sources (“DAC” functionality).
an Apple Mac with a FireWire or Thunderbolt port, so:
- any iMac, Mac mini, Mac Studio, Mac Pro or MacBook Pro
- MacBook Air since Mid 2011
- MacBook up to (!) Mid 2009, except Aluminum 2008

Depending on what kind of Port your Mac has, you need the following cables and adapters:

Port on Mac	Cables	Images
FireWire 400 (1999-)	4-pin (DV) to 6-pin (FW 400)
FireWire 800 (2009-)	4-pin (DV) to 9-pin (FW 800)
Thunderbolt 1/2 (2011-)	4-pin (DV) to 9-pin (FW 800) Thunderbolt to FireWire adapter
Thunderbolt 3/4 (2016-)	4-pin (DV) to 9-pin (FW 800) Thunderbolt to FireWire adapter Thunderbolt 3 (USB-C) to Thunderbolt 2 Adapter

The older the Mac, the fewer adapters you will need, but the more hassle it will be installing the necessary software. The sweet spot seem to be late FireWire 800 Macs (model years 2011-2012) or Macs with Thunderbolt 1/2 (model years 2011-2015).

Installing Tools

macOS supports DV video over FireWire natively, so you can open QuickTime Player, and select “File -> New Movie Recording…” to preview what is being transmitted by the camcorder. While QuickTime Player can record, the resulting video will already be deinterlaced using the low-quality “blend” method.

Apple’s iMovie has a dedicated DV/FireWire import function that will save MOV-encapsulated DV video, but has the habit of silently stopping recording when there is an empty area of the source tape.

The open source ffmpeg tool not only allows grabbing the original DV bits from FireWire, it can also convert video in all kinds of formats.

If you are running macOS 11 (Big Sur) or later, install Homebrew, and then install ffmpeg with this Terminal command:

brew install ffmpeg

Homebrew should also work but is unsupported on 10.11 (El Capitan) through 10.15 (Catalina). If Homebrew does not work on your version of macOS, you may want to find an ffmpeg binary through other means or consider upgrading to a later version of macOS – maybe even through OpenCore Legacy Patcher.

Digitizing

Make sure the camcorder’s audio is configured to 48 KHz 16 bit mode (as opposed to 32 KHz 12 bit). Connect the camera to the Mac and switch it into “PLAY” or “DAC” mode. Then run the following Terminal command to list the available capture devices:

ffmpeg -f avfoundation -list_devices true -i ""

On my MacBook Pro, this prints:

AVFoundation video devices:
[0] DCR-TRV520E
[1] FaceTime HD Camera
[2] Capture screen 0
AVFoundation audio devices:
[0] Speaker Audio Recorder
[1] MacBook Pro Microphone

It detected the Sony DCR-TRV520E. We will have to pass this device name whenever we want to read DV data with ffmpeg.

To capture the video tape into a DV stream, enter the following command (replacing the device name) and press PLAY on the VCR immediately after.

ffmpeg -f avfoundation -capture_raw_data true -i "DCR-TRV520E" -c copy -map 0 -f rawvideo video.dv

Once the tape is finished, press Ctrl+C to stop recording. If you want to automatically stop the recording after a certain time, you can add something like -t 4:10:00 (4h 10m) to the command line.

You can monitor the progress by looking at the camcorder’s viewfinder or LCD, or by using this command in a different Terminal window:

tail -f video.dv | ffplay -i -

Compressing

While DV is an excellent archival format, it’s also big: about 11 GB per hour. MPEG-2 can slash this by a factor of three (4.5 GB per hour). The following line recompresses the DV into DVD-quality MPEG-2, leaving the interlacing intact.

ffmpeg -i video.dv -b:v 10M -flags +ildct+ilme video.vob

Here is the corresponding line to encode the video in H.264 at 3 MBit/sec, which is about the same quality, and also with interlacing intact. This will be a little more than 1 GB per hour – 10 times smaller than DV.

ffmpeg -i video.dv -b:v 3M -flags +ildct+ilme video.mp4

While H.265 has some basic support for interlaced video, neither ffmpeg nor VLC support it without tricks, and AV1 does not support interlaced video at all. Consequently, H.264 is effectively the latest compression format that you should use to store interlaced video.

On-the-fly Compression

Any Intel or Apple Silicon Mac can also encode MPEG-2 in real-time. The following line skips the intermediate DV file:

ffmpeg -f avfoundation -capture_raw_data true -i "DCR-TRV520E" -c copy -map 0 -f rawvideo pipe:1 | ffmpeg -i - -b:v 10M -flags +ildct+ilme video.vob

And this is the line to encode straight into H.264. You will need a 2015 or newer Mac for this, otherwise it won’t be able to keep up with the incoming data.

ffmpeg -f avfoundation -capture_raw_data true -i "DCR-TRV520E" -c copy -map 0 -f rawvideo pipe:1 | ffmpeg -i - -b:v 3M -flags +ildct+ilme video.mp4

Deinterlacing

If you need the video in progressive format, you can use the following commands to deinterlace the video. You should try the 50/60 Hz command first, which will retain motion smoothness. If the resulting video contains every frame twice (verify by single stepping with the arrow keys in QuickTime), the original material was 25/30 Hz, so you can to re-do the deinterlacing with a frame rate of 25/30.

MPEG-2, 10 MBit, 50/60 frames per second:

ffmpeg -i video.dv -vf yadif=1:-1:0 -b:v 10M video.vob

MPEG-2, 10 MBit, 25/30 frames per second:

ffmpeg -i video.dv -vf yadif=0:-1:0 -b:v 10M video.vob

H.264, 3 MBit, 50/60 frames per second:

ffmpeg -i video.dv -vf yadif=1:-1:0 -b:v 3M video.mp4

H.264, 3 MBit, 25/30 frames per second:

ffmpeg -i video.dv -vf yadif=0:-1:0 -b:v 3M video.mp4

H.265, 1.5 MBit, 50/60 frames per second:

ffmpeg -i video.dv -vf yadif=1:-1:0 -c:v libx265 -b:v 1.5M -tag:v hvc1 video.mp4

H.265, 1.5 MBit, 25/30 frames per second:

ffmpeg -i video.dv -vf yadif=0:-1:0 -c:v libx265 -b:v 1.5M -tag:v hvc1 video.mp4

While the YADIF filter in ffmpeg does a pretty good job, there are now also machine-learning based tool like Topaz Video AI for deinterlacing.

Remember though that you should archive the original interlaced data, since deinterlacing is lossy.

Limitations

This method only captures the 720×576 (720×480) video signal and one stereo audio track of your media. Depending on the media, there may be information that is not captured:

PAL broadcast recordings usually contain teletext, albeit with lots of errors, because of the insufficient bandwidth of tape.
NTSC broadcast recordings and pre-recorded media usually contain closed captioning.
VHS contains a mono track and an optional HiFi stereo track. VHS players pick the HiFi stereo track if it exists, so this solution will not capture the mono track which, in theory, could contain entirely different audio.
LaserDisc can contain digital audio (PCM, Dolby Digital or DTS). If you want to capture the audio losslessly, you need to record it in parallel using an S/PDIF connection.

In general, the recording will only be as good as the player can decode the media. If you don’t have access to a good player or if the media has defects, you may want to talk to a company that specializes in digitization services.

Simpler Solutions

If the solution described in this setup seems overkill, there are simpler solutions as well:

A DVD recorder (or a VHS/DVD combo device like the Panasonic DMR-EX98V and DMR-EX99V) can convert analog sources (or VHS directly) into high-quality interlaced MPEG-2 files written to a recordable DVD. The downside is that these devices are usually limited to two (DVD-5), maybe four (DVD-9, i.e. dual-layer) hours of recording at 10 MBit/sec because of the limitied capacity of a DVD.
There are devices that connect to the analog signal on one side and to a computer’s USB port on one side. The PC/Mac software will usually create a deinterlaced MP4 file.
Technology Connections describes a solution where you connect the video player to an analog-to-HDMI box, and its output in turn to a device that records HDMI onto an SD card. This will also deinterlace the video.

Dissecting a Dummy Promo MiniDisc

Michael Steil — Thu, 17 Nov 2022 23:08:35 +0000

Many pre-recorded MiniDiscs are rare and expensive. An extra rare special case is the dummy promo copy of Michael Jackson’s “Dangerous”, which we will dissect in this article.

Just looking at the case, the dummy promo copy looks like the regular retail MiniDisc:

Instead of the inner pages with the song lyrics, the booklet only contains a sheet with the MiniDisc logo:

The front of the MiniDisc looks inconspicuous:

But instead of the track listing, the back only contains a sticker saying “NOT PLAYABLE”:

If you look closely, you can see that the actual disc behind the shutter has “DUMMY” etched into it. The full text is “DUMMY-CDF01-1/100”.

When trying to play the MiniDisc, any player will complain that it cannot read the TOC (table of contents). But a closer look reveals that the data area is not in fact empty:

Like audio CDs, MiniDiscs are played from the inside to the outside. The innermost area (outside of the readable text) looks uniform, which usually indicates an empty area. Then, there are three areas of what looks like random data, with smaller regular stripes in between – this reminds of tracks on a vinyl record. For comparison, here is a proper pre-recorded MiniDisc:

The empty area is at the end of the MiniDisc, that is, on the outside. The data is in one large area – individual tracks are in fact not visible.

The random areas do look like they contain some data, so we have to try to read it! But how do you play a MiniDisc without a valid TOC? By hot-swapping a good MiniDisc with this one! I used a toothpick on the door detection switch of a SHARP MD-MT15 so I could open the door without the player noticing, as described in my article about dumping MiniDiscs:

No matter which of the 11 tracks (i.e. offsets) of the good MiniDisc I picked, the player always shows a READ ERROR after a few seconds. It seems the “data” area does not in fact contain any correctly formatted data sectors.

Yes, it’s disappointing, but I chose to publish this anyway to counter publication bias, and to maybe motivate someone else to come up with a different method to analyze what’s on these dummy MiniDiscs!

Here’s a question though: What was the point of these dummy promo MiniDiscs? What were they used for?

References

Toutes les versions de “Dangerous” en MD by macaddict77, 2020-08-12
Discogs: Dangerous (Promo, Not Playable – Mock-Up)
Discogs: Dangerous (Promo)

The Commodore VICMODEM (Model 1600)

Michael Steil — Mon, 27 Jun 2022 15:25:28 +0000

The Commodore 1600, also known as the “VICMODEM”, is Commodore’s very first modem (1982): It supports 300 baud duplex connections, and is connected to an existing telephone’s handset connector instead of the phone line. This kept the price down, but required the user to dial manually through the phone.

Historical Context

Year	Name	Model	Description
1982	VICMODEM	1600	connected to phone’s handset connector; manual dialing through phone; Motorola MC14412
1982	AUTOMODEM	1650	connected to phone line, pulse dialing in software; Motorola MC14412
1985	MODEM/300	1660	added tone dialing support by feeding SID output into modem; Texas Instruments TMS99532A
1987	MODEM/1200	1670	Hayes command set; pulse and tone dialing in hardware; 300/1200 baud support; U.S. Robotics chipset

Photos

On the back, there is the RJ11C handset connector. On the side, there is an LED indicating that indicates when the phone is transmitting or receiving, and a switch that can be put into the “A” (answer) or “O” (originate) position, depending whether a phone call is supposed to be made or accepted.

On the front, there is the VIC-20/C64 user port connector.

The label on the bottom says:

FCC ID: B4V8N2VIC 20
Commodore Business Machines, Inc.
Made in USA

Certified to comply with the limits for a Class B
computing device pursuant to Subpart J of
Part 15 of FCC Rules. See instructions if
interference to radio reception is suspected.

COMPLIES WITH PART 68, FCC RULES FCC
REGISTRATION NUMBER B4V8N2-68331-
KX-N RINGER EQUIVALENCE O.0B JACK
(USOC) N.A. (KX)

Model 1600
Serial Number: 093333

The only text on the board is “PWB 00201 B” on the back. The core component is the Motorola MC14412 modem chip (bottom, second from the left).

The RJ11 connector on the back must be connected to an existing phone, instead of its handset, like this:

The VICMODEM does not talk on the phone line level, but on the handset level, this reusing the hardware in the existing phone. This allowed the modem to be produced for under $33, so it could be the first modem to be sold for under $100¹.

The flip side of this design was that the modem could not dial or answer by itself. To dial a number, it had to be keyed into the telephone with the handset connected, and once one would hear that the remote side picked up, switch the cable to the modem. To wait for a call and answer it, the modem has to be connected and the (unconnected) handset has to be in the cradle. Once it rings, the handset has to be picked up and set aside.

This is exactly how one would operate an acoustic coupler. After all, this modem is more of an acoustic coupler with a direct audio connection than a full modem.

Box

In late 1983, the modem was already sold for under $70.

Manual

Tape

64 TERM.PRG (2577 bytes)

Michael Tomczyk: Commodore VIC-20 Developer, Computer Pioneer ↩

PostScript Cartridge Plus for HP LaserJet III

Michael Steil — Tue, 21 Jun 2022 06:31:49 +0000

The HP LaserJet III laser printer from 1990 used the “Printer Command Language” PCL 5 by default, but could be upgraded with the “HP PostScript Cartridge Plus” cartridge, which contained 2 MB of ROM with Adobe’s PostScript Level 2 rasterizer. Let’s look at the ROM contents and some of its hidden gems.

Cartridge

The cartridge is about 9×14 cm in size.

The front says

HEWLETT PACKARD

PostScript Cartridge Plus

ITC Avant Garde Gothic®
ITC Bookman®
Courier
Helvetica®
Helvetica-Narrow
New Century Schoolbook
Palatino®
Times®
ITC Zapf Chancery®
TIC Zapf Dingbats®
Symbol
C2089A ©Hewlett-Packard 1989, 1990, 1991

HP
LASERJET III
POSTSCRIPT®

The back says

Adobe and PostScript are registered trademark of Adobe Systems
Incorporated in the U.S, and other countries. Helvetica, Palatino and
Times Roman are registered trademarks of Linotype AG and/
or its subsidiaries in the U.S. and other countries. IT Avant Garde
Gothic, ITC Bookman, ITC Zapf Chancery and ITC Zapf Dingbats are
registered trademarks of International Typeface Corporation in the
U.S, and other countries.

Board

Here is the front without the components:

The board contains 6 74-series logic chips:

1x SN74ALS139N: Dual 2-to-4 Decoder/Demultiplexer
4x SN74ALS244BN: Octal Buffer and Line Driver with 3-State Output
1x SN74LS32N: Quadruple 2-Input Positive-Or Gates [marked as HP part number 1820-1208]

and four 512 KB mask ROM chips of the type Fujitsu MB834200B-15 (27C400 pinout). They are all marked with

© 1991 HP-BOISE
© 1984-90 ADOBE
© 1981 LINOTYPE AG
© 1991 FUJITSU

ROM

These are the verbatim dumps (adjacent bytes are swapped):

1818-5336 / 16A AA, MD5 258464faa19a1ff78bbb57270eec8835
1818-5318 / 03A AA, MD5 da113b6c6c53e21858b30a71c7be017c
1818-5319 / 04A AA, MD5 f6e368806aa8caf22b4c28d235a2df1d
1818-5320 / 05A AA, MD5 ed1700895daeac733f80ce20278c4a64

This is the combined (byte-swapped) 2 MB ROM image:

HP PostScript Cartridge Plus C2089A ROM, MD5 8a5d1f66ab1624e7188fc07154f4224d

The ROM image starts with a signature of “SYST” and the following messages at 0x30:

V9H-18f PSCRIPT
09.H
Copyright © Hewlett-Packard Company, 1991. All rights reserved.

The ROM contains Adobe’s PostScript Level 2 rasterizer compiled for the 68000 CPU, the PostScript base fonts as well as some LaserJet-specific software (messages, errors and settings texts for the 15 char display in several languages).

PostScript Files

There is also some PostScript source code in the ROMs!

(The missing %! file header has been added to the downloads.)

Since this is printer-specific PostScript code, it may not work with computer-based rasterizers, so let’s go over them one by one.

FONTPAGE

This is “FONTPAGE” converted to PDF using GPL GhostScript:

fontpage.pdf

The PostScript code contains the product operator, which returns the name of the printer, so the second line – “GPL Ghostscript printer” – would read “HP LaserJet III printer” on an actual LaserJet.

TEST PAGE

“TEST PAGE” prints various internal printer settings which are unsupported by computer-based PostScript rasterizers, so some lines had to be removed for the file to work. These are the files hacked for different rasterizers:

Adobe Acrobat Distiller 5.0 (2001)

test_page_acrobat.ps

test_page_acrobat.pdf

macOS 12.4 PSNormalizer.framework

GhostScript 9.56.1

(The almost identical contents of Acrobat Distiller 5.0 and Apple’s PS to PDF converter built into macOS (down to the internal version number!) is no coincidence: Apple’s converter is in fact a licensed “Adobe Normalizer 5.0”¹, the same engine powering Distiller 5.0.)

There is a second page which only prints if the product is “HP LaserJet IIP” or “HP LaserJet IIIP”:

STARTUP PAGE

There is no such device as a “LaserJet IIx” – this is what the PostScript code falls back to if the product is none of these:

“HP LaserJet IID”
“HP LaserJet IIP”
“HP LaserJet III”
“HP LaserJet IIID”
“HP LaserJet IIIP”

Tests for these can be found across all PostScript code in the ROM. The IIID (“duplex”) and IIIP (“personal”) and variants of the LaserJet III. HP never offered PostScript for the LaserJet II series, so it is unknown why these product names show up in the ROM.

Future Work

There are several open questions that might be interesting:

What’s up with PostScript for the LaserJet II?
Did the cartridge extend the printer’s internal ROM or replace it?
What other PostScript features are supported that are not official API?
What is the computer inside the LaserJet III like? Can we emulate it and run this rasterizer on a computer?
What is the pinout of the cartridge connector?

strings /System/Library/PrivateFrameworks/PSNormalizer.framework/Versions/A/Resources/PS.VM | grep -i Adobe↩

CCGMS Future 0.2

Michael Steil — Sun, 20 Mar 2022 17:56:09 +0000

CCGMS Future 0.2 was just released. It adds 80 columns support, a true ASCII charset (in 80c mode), and bug fixes.

The Punter C1 Protocol

Michael Steil — Tue, 15 Mar 2022 01:17:43 +0000

The Punter file transfer protocol (“New Punter”/“Punter C1”) is an alternative to the XMODEM family of protocols, which was and still is very popular on BBSes for Commodore computers. It is notorious for being badly documented. Let’s fix that.

I quickly developed quite a distaste for the badly designed punter protocol. Especially with the awful and rather smug “documentation”.

— MagerValp

The Protocol

The protocol allows the transmission of one file from the “sender” to the “receiver”. No file name, but a 1-byte file type is transmitted (0=PRG, 1=SEQ).

For handshaking, the three-byte ASCII strings GOO, BAD, ACK, S/B, SYN are used.

The transmission of a file consists of two almost identical phases:

Phase A transmits the file type, and consists of a lot of handshaking and overhead to transmit a single block with a single payload byte (the file type).
Phase B transmits the file data in one or more blocks. “B2” is repeated for each block.

A File Type

B File Contents

A1 Start

	Sender	Receiver
1	`GOO`
2		`GOO`

B1 Start

	Sender	Receiver
17		`GOO`

A2 Block

	Sender	Receiver
3	`ACK`
4		`S/B`
5	block data
6		`GOO`*

B2 Blocks(repeated)

	Sender	Receiver
18	`ACK`
19		`S/B`
20	block data
21		`GOO`*

A3 End-Off

	Sender	Receiver
7	`ACK`
8		`S/B`
9	`SYN`
10		`SYN`
11	`S/B`
12	(wait 1s)	(ignored)
13	`S/B`
14	(wait 1s)	(ignored)
15	`S/B`
16	(wait 1s)	(ignored)

B3 End-Off

	Sender	Receiver
22	`ACK`
23		`S/B`
24	`SYN`
25		`SYN`
26	`S/B`
27	(wait 1s)	(ignored)
28	`S/B`
29	(wait 1s)	(ignored)
30	`S/B`
31	(wait 1s)	(ignored)

*Step 6/21: After receiving the block data, if the checksum is incorrect, the receiver sends BAD, moving the protocol to step 3/18 with the same block.

Blocks

Every block has a 7 byte header:

offset	size	description
0	2	“additive” checksum
2	2	“cyclic” checksum
4	1	size of next block
5	2	block index

The two checksums are calculated over all bytes of the block, starting with index 4.

The “additive” checksum is calculated by adding all bytes.
The “cyclic” checksum is calculated by XORing all bytes, rotating the 16 bit result left after each byte.

Block sizes can be freely chosen and can vary from block to block. The maximum size is 255. This includes the 7 header bytes, so the payload is up to 248 bytes. Every block contains the size of the following block. The size of the first block is fixed (per phase). (For the last block in a sequence, this field is unused.)

The block index starts with 0. The last block (or only block) in a sequence has index -1 ($FFFF), though it is enough to only check the hi byte. This allows file sizes of almost 16 MB.

File Type Transmission

File type transmission only consists of a single 8 byte block. Its index is -1, since it is the last block. The single payload byte is the file type.

File Data Transmission

File data transmission starts with a 7 byte header block that contains no payload. Its purpose is to communicate the size of the first block that actually contains file data.

Bugs

Checksum

The two checksums are 32 bits total, yet CRC16 would probably provide better integrity guarantees.

End-Off Sequence

Steps 11-16 and 26-31 in the “End-Off” sequence in both phases are buggy. In the original implementation

The sender repeats the following three times: transmit S/B, then keep reading a byte from the modem about 5000 times, discarding the data. This takes about one second on a C64.
The receiver expects just one S/B, then proceeds to the next step. The GOO sent in step 17 will be ignored by the sender.

Because of the resends in the original implementation, the protocol can recover from this.

(Lines 5070-5120 in the original source are responsible for this. The call to accept with an argument of 0 will continue reading bytes from the modem until it times out, because it can’t match any of the codes. Maybe this was meant to match one or any specific code at some point.)

New implementations should expect three S/B and then wait for two seconds before continuing.

Multi-Punter

Multi-Punter is a very simple extension to the protocol that can transmit several files at a time and includes the file names and types. It has no error detection or re-send capability on a file level whatsoever.

Every file is transmitted like this:

16× code 0x09 (TAB)
FILENAME,P (PRG) or FILENAME,S (SEQ)
code 0x0D (CR)

…followed by the transfer using the Punter protocol.

After the last file, the following is sent:

16× code 0x09 (TAB)
16× code 0x04 (EOT)
code 0x0D (CR)

Documentation

punter_c1.txt: The original documentation by Steve Punter with additional comments by Geoffrey Welsh and Matthew Desmond
Thread on CSDb, mostly by Oliver VieBrooks (Six), about reverse engineering the protocol

Implementations

The original 6502 source by Steve Punter was posted to net.micro.cbm in 1985. Almost all software supporting the Punter protocol was based on this implementation.
- The binary and a BASIC program to control it are #1797 and #1798 in the GEnie Commodore File Library
CCGMS Future contains a cleaned-up and heavily commented version of the original code.
C*Base contains an independent 6502 implementation. (Source is available in the downloads.)
There is a C implementation as part of CGTerm by Per Olofsson (MagerValp). It only supports downloading and interoperates with the C*Base implementation, but not the original implementation. There is a hacked version in the CCGMS Future repo that works with the original code (and CCGMS).
CBMTerm by Oliver VieBrooks (Six) contains a C# implementation, including Multi-Punter.
There is also a Rust implementation called punter-server by Steven Walter.

UP9600: How to Bit-Bang 9600 Baud RS-232 on the C64

Michael Steil — Mon, 07 Mar 2022 18:11:14 +0000

The user port of the Commodore 64 exposes a TTL-level RS-232 serial port that supports up to 1200 baud¹. In 1997, Daniel Dallmann came up with a very sophisticated trick that allowed sending and receiving at 9600 baud², using slightly different wiring and a dedicated driver. This “UP9600” wiring has become the de-facto standard for all modern accessories, like C64 WiFi modems. Let’s see how UP9600 works.

History

Before diving into the details of RS-232 and the UP9600 solution, let’s look at some historical context.

MOS 6551 ACIA

MOS made an RS-232 chip for the 6502: the 6551 ACIA (“Asynchronous Communications Interface Adapter”), which, per specification, can support up to 19200 baud. Commodore used it in the SuperPET (1981), the CBM-II series (1982) as well as the business-oriented Plus/4 (1984). It is exposed through a KERNAL driver as device #2.

6551 Emulator

For cost saving reasons, the VIC-20 and its successors, the C64 and C128³, did not contain a 6551 chip. Instead, Commodore included a bit-banging driver in the KERNAL that emulated the 6551 and exposed it as device #2. This emulator supports up to 2400 baud, but due to DMA from the VIC-II video chip (“badlines”), only speeds up to 1200 are stable on the C64 (and the C128 in 40 columns mode).

Software 2400 baud

In the article “Toward 2400” in Transactor volume 9, issue 3 (Feburary 1989), George Hug presented software to achieve 2400 baud reliably on the C64 without any hardware modifications⁴.

SwiftLink-232

The primary use for RS-232 on the C64 was for modems, and as modems faster than 2400 baud became available, Dr. Evil Laboratories released the $30 SwiftLink-232 cartridge for the C64 expansion port in 1990. It contained a 6551 chip that could even reach 38400 baud, thanks to doubling the rate of the external oscillator.

UP9600

In 1997, Daniel Dallmann created UP9600, a solution that allowed 9600 baud on the user port. The idea is to use the hardware shift registers of the two CIA 6526 I/O controllers to do the timing-critical part of the transfer.

RS-232 Pins on the User Port

Every Commodore 8 bit computer except for the C16/C116 has a user port, and all user ports except for the original PET support TTL-level RS-232 through eight dedicated pins.

The following table describes the C64 user port. The RS-232 pins are marked in red:

Pin	Description	Pin	Description
1	GND	A	GND
2	+5V	B	/FLAG2
3	/RESET	C	PB0: RS-232 RXD
4	CNT1	D	PB1: RS-232 RTS
5	SP1	E	PB2: RS-232 DTR
6	CNT2	F	PB3: RS-232 RI
7	SP2	H	PB4: RS-232 DCD
8	/PC2	J	PB5
9	SER ATN IN	K	PB6: RS-232 CTS
10	9 VAC	L	PB7: RS-232 DSR
11	9 VAC	M	PA2: RS-232 TXD
12	GND	N	GND

RXD is the receive line, and TXD is the transmit line. The remaining lines deal with flow control, among other things, and are optional.

RS-232

Before we can talk about UP9600, we first need some understanding of the RS-232 serial protocol.

To send serial data from one sender to one receiver, only a single data wire is needed. Beforehand, the two devices need to agree on the baudrate (e.g. 1200 bits/sec) and the format of each unit of data: how many data bits, whether there is an added parity bit, and how many stop bits. The most common parameter setting (and in fact the only one supported by the original UP9600 driver) is 8-N-1, meaning 8 data bits, no parity, and a single stop bit.

The transmission of one byte in 8-N-1 mode will look like this on the wire:

“1” on the wire signals an idle line: the sender has no new data to send.
The transmission of a byte is indicated by sending a “0” for the duration of one bit.
The 8 data bits are transmitted afterwards, LSB first.
After that, a “1” is sent for at least the duration of one bit.
If there is no additional data, the line stays at “1”.

Data can be sent back to back if there are multiple bytes available for sending:

As usual, the 8 data bits are followed by a stop bit (“1”), then immediately by a start bit (“0”) and then the next 8 data bits.

The point in time when the sender sets the wire from “1” to “0” for the start bit is the synchronization point for the transmission of the following byte. The receiver will have to sample the line at the right intervals counting from the sync point for the following 8 data bits and the stop bit. The next start bit will re-sync sender and receiver again, so the clocks of sender and receiver only have to be matching well enough for the duration of the transmission of a single byte.

After the line has been idle, a start bit can be sent at any time, it does not have to fall into the time raster of the bit rate. As just described, time starts anew with each start bit for both sender and receiver.

If the receiver starts listening while the sender is already in the middle of a transmission, incorrect data will be received. The first “0” bit it encounters will be understood as the start bit, and the next 8 bits will be the data byte. If the following bit is not a “1”, the receiver will discard the byte, and wait for the next “0” bit, which hopefully is the correct start bit this time. In the worst case, it will detect a few garbled bytes, but statistically, the receiver will sync itself over time.

The CIA 6526 Shift Register

The core idea of UP9600 is to use the two otherwise unused hardware shift registers in the C64, which can automatically send or receive a byte bit-by-bit over a single wire, faster than the CPU would be able to do it.

The C64 has two CIA 6526 I/O controllers, each of which contains a serial shift register that can transfer bytes using a clock and a data line to a peer that speaks the same protocol, e.g. the CIA in another computer⁵. They are unused by a stock C64, but exposed on the user port. Here is the table again, this time with the shift register wires marked in red:

Pin	Description	Pin	Description
1	GND	A	GND
2	+5V	B	/FLAG2
3	/RESET	C	PB0
4	CNT1	D	PB1
5	SP1	E	PB2
6	CNT2	F	PB3
7	SP2	H	PB4
8	/PC2	J	PB5
9	SER ATN IN	K	PB6
10	9 VAC	L	PB7
11	9 VAC	M	PA2
12	GND	N	GND

CNT1 and SP1 are the clock and data lines of CIA#1, and CNT2 and SP2 are the clock and data lines of CIA#2.

In order to send a byte, we need to set timer A so it fires at the correct interval between bits⁶

    timer = (system_freq) / (2 * bit_output_freq) - 1

and set it to continuous mode, set the serial port to output, and then write the byte to the serial data register. On every timer underflow, the CNT line will be toggled, and on every falling edge, the next bit is put on the SP line – this is why the timer has to fire twice per bit, explaining the extra factor of 2 in the divisor of the formula.

Once all 8 bits are shifted out, a bit in the interrupt control register is sent, which can optionally trigger an interrupt. The SP line will retain the value of the last bit.

To send a continuous stream of bytes, you have to write the first two bytes to the serial data register immediately after one another – the CIA will start sending out the first byte and cache the second one – and after each interrupt, a new byte can be written to the shift register, while the previous one is still being shifted out.

To receive a byte through the shift register, you have to set it to input mode, and it will sample a bit on every rising edge of CNT. Again, after 8 bits, it will set a bit in the interrupt control register and optionally trigger an interrupt. Now, the 8 bits can be read from the serial data register.

CIA#1 and CIA#2

The two 6526 CIA chips in the C64 are identical, but connected differently.

Most pins of the user port, and all RS-232 pins go to CIA#2.
The serial ports of both CIAs are exposed on the user port.
The interrupt output of CIA#1 is connected to the CPU’s IRQ line, while the interrupt output of CIA#2 is connected to the CPU’s NMI line.

Receiving data is more timing-critical than sending, so we will use CIA#2 for receiving, since NMIs are useful for tighter timing requirements.

Sending RS-232 Data

Now how do we use the shift register to send data in the RS-232 transmission format? After all, it was not exactly designed with RS-232 in mind:

The user port wiring of the serial port is different than what the KERNAL driver uses.
The order of the bits in a byte is reversed in the CIA compared to RS-232.
The shift register works on 8 bits at a time, while RS-232 deals with groups of 10 bits.

TXD Wiring

The data output of the CIA#1 shift register is wired to SP1 (pin 5) on the user port, but the old RS-232 KERNAL driver outputs data on PA2 (pin M). This means that existing user port RS-232 hardware (which uses pin M) wouldn’t be compatible. But new hardware that is aware of UP9600 can just bridge pin M to pin 5, and it will be compatible both with the original pinout and with UP9600.

The additional clock output of the CIA is not a problem: It will appear at pin 4 (CNT1) on the user port, and existing user port RS-232 hardware wouldn’t have it connected, and there is no reason to connect it on new hardware.

Bit Order

The shift register sends data with the most significant bit first, while RS-232 sends the least significant bit first. The 8 data bits will have to be reversed in order before sending them. The fastest solution is a 256 byte lookup table. The UP9600 code actually uses a 128 byte lookup table and patches the remaining bit into the resulting byte, trading some execution speed for memory.

Sending 10 Bits

The trickiest problem in sending is that the shift register works on 8 bits at a time, while for RS-232, each data byte results in a total of 10 bits: one start bit, 8 data bits and one stop bit. If all the data to be sent were known beforehand, one could shift the data bytes into a set of bytes that corresponds to the RS-232 bit stream and continuously write them to the shift register:

Unfortunately, software that wishes to send data often doesn’t have all the data available, or data arrives from software in bursts with pauses in between. In a terminal program, text typed by the user arrives one byte at a time, with significant pauses. During file transmission (e.g. XMODEM), a number of bytes will be sent together, but without additional communication, the RS-232 driver wouldn’t know at what point it needs to flush the accumulated data. It’s possible to do it this way, but it is quite complicated.

Instead, the UP9600 software sends every data byte as 16 bits, the 10 RS-232 bits (one stop bit, 8 data bits, one stop bit), and 6 filler bits with a value of “1” – you can see them either as additional stop bits or as a signal that the line is idle.

While this only achieves 62.5% of the peak data rate – 600 bytes/sec instead of 960 bytes/sec, assuming 9600 baud – this is completely legal and will be understood by any receiver. It does not look any different than a sender that had to add a little extra pause between bytes.

So in practice, the UP9600 code always sends one data byte at a time, like this (using CIA#1):

Set the serial port to output.
Enable serial port interrupts.
Set timer A to 1022727 / (2 * 9600) – 1 = 52 (NTSC) or 985249 / (2 * 9600) – 1 = 50 (PAL)⁷.
Set the timer to continuous and start it.
Write a “0” (start bit) and bits 0 through 6 to the serial data register – note the reversed bit order!
Write the remaining bit and seven “1” bits (stop bits) to the serial data register. Again, note the bit order.

The next byte can be written once two interrupts have been triggered by the CIA.

(There is a little added complication: Serial timing has to come from the timer A, but the C64 KERNAL has timer A of CIA#1 set up as the 60 Hz interrupt source that deals with updating the TI$ clock, scanning the keyboard and blinking the cursor. We have to migrate the 60 Hz interrupt to timer B if we want to continue using these KERNAL services.)

Let’s look at the timing of this code: We have to make sure that the second byte is written before the first byte is fully transmitted. Otherwise, the last bit of the first byte (i.e. second-to-last data bit) would stay on the line too long, leading to incorrect data transmission. With 9600 baud, we have a window of about 100 clock cycles.

Every 8 raster lines, the VIC-II takes control from the CPU for 40 cycles (“badlines”).
We could be interrupted by an NMI caused by the receiving part of the UP9600 software – see below.

As long as the receiver NMI code takes less than 60 cycles, we’re good at 9600 baud, even when a badline and an NMI occur at the same time. In fact, even higher speeds could be possible by avoiding VIC-II badlines – after all, we control when to start the transmission of a byte. But since the baud rate is the same for sending and receiving, it’ll be the receiving part that will limit the maximum bit rate.

Receiving RS-232 Data

Receiving is also tricky because:

Again, the user port wiring of the serial port is different than what the KERNAL driver uses.
A start bit and thus the transmission of a data byte can happen at any time, and at 9600 baud, the window for a single bit is just 100 cycles.
The shift register has no way to shift in a bit every n cycles.

RXD Wiring

The data input of the CIA#2 shift register is wired to SP2 (pin 7) on the user port, but the old RS-232 KERNAL driver receives data on PB0 (pin C). As with the output wire, new hardware that is aware of UP9600 can just bridge pin C to pin 7 and be compatible with both drivers.

Detecting the Start Bit

To detect a start bit, we have the following options:

Busy waiting. This way, there will be no way to do anything else on the system.
A timer interrupt that fires every 100 cycles, so we can check the line. This would be a lot of interrupts, slowing down the system significantly.
Finding a way to have the stop bit cause an interrupt.

In fact, it is possible to have an external pin on the user port generate an interrupt – a falling edge on FLAG2 (pin B) will make CIA#2 trigger an NMI. So UP9600-aware hardware needs to bridge PB0 (pin C), which normally carries RDX (receive line) to FLAG2 (pin B).

The UP9600 software thus enables the FLAG interrupt on CIA#2 and hooks the NMI vector. Once the NMI fires, it sets up the CIA#2 to read the next 8 bits into the shift register.

Using a CIA Timer for the Clock

The trickiest problem in receiving is the fact that for data input, the shift register has to be clocked from the CNT line, and we don’t have such a signal! There is no way to tell the shift register to just sample the data wire every n clock cycles.

The trick is to have the CIA generate a matching clock signal, output it on the user port, and use a bridge on the UP9600-aware device to feed this signal back into the CIA through the CNT2 pin.

The CIA has a way to send a pulse every n cycles on pins PB6 and PB7. When enabled, an underflow of timer A will pulse PB6, and an underflow of timer B will pulse PB7. And both these CIA outputs are available on the user port! UP9600 uses timer B for this, so we need a bridge from PB7 (pin L) to CNT2 (pin 6).

So inside the NMI handler for the start bit, we have to

Disable FLAG interrupts – we are not listening for a start bit any more.
Set the serial port to input
Enable serial port interrupts.
Set timer B to 1022727 / (9600) – 1 = 106 (NTSC) or 985249 / (9600) – 1 = 102 (PAL).
Set the timer to continuous, set it to pulse on PB7 and start it.
Change the NMI vector to the second NMI handler.

The second NMI handler will then

Read the data byte from the serial data register and reverse the bit order.
Disable timer B.
Enable FLAG interrupts for the next start bit.
Change the NMI vector to the first NMI handler.

Note that we never read the stop bit. This means that the UP9600 does not plausibility check of the data. If the receiver starts listening while the sender is already in the middle of a transmission, the UP9600 driver will pass several garbled bytes to the receiving software and take longer than necessary to eventually sync itself to the signal.

UP9600 Wiring

So for a C64 user port RS-232 device to be UP9600-aware, all it needs to do is bridge the following pins:

Pin 1	Pin 2	Description
M (PA2)	5 (SP1)	RS-232 output to CIA#1 Serial Port
C (PB0)	7 (SP2)	RS-232 input to CIA#2 Serial Port
C (PB0)	B (/FLAG2)	RS-232 input to CIA#2 FLAG, for NMI on start bit
L (PB7)	6 (CNT2)	CIA#2 timer B pulse to CIA#2 Serial Port clock

UP9600 does not work on a C128 in 128 mode, since it uses the CIA#1 shift register for communication with Fast Serial disk drives (1571, 1581, …). It will even hang on boot if a Fast Serial device is attached. To work around the latter, a jumper can be installed to disconnect M from 5.

Why is 9600 the Maximum Baudrate?

The highest baudrate that is possible on a C64 with the described algorithm is 9600 baud.

At this speed, a bit arrives roughly every 100 cycles. It takes about 30 cycles from the moment of the start bit NMI to disable the FLAG NMI, enable the timer and hook the serial port NMI. 50 cycles after the start bit NMI would be the center of the start bit and the optimal time to sample every 100 cycles, so starting the timer 30 cycles into the start bit is fine.

The interesting part on the C64 are VIC-II badlines, which can stall the CPU for 40 cycles at practically any time. If a badline happens just as the NMI is taken, setting up the CIA to read the 8 data bits will be finished 40 cycles later, i.e. at 70 cycles after the moment of the start bit NMI. This is still fine, because in either case, we will fall into the window where the start bit is valid.

The next highest standard baudrate would be 19200, with a bit every 50 cycles. With the same algorithm, the following will happen: If there is no badline, we will set up the CIA at 30 cycles again, and it will sample every 50 cycles, so pretty much in the center of each bit. Great. But if there is a badline, we’re already at 70 cycles, which is in the middle of the first data bit. The first bit would have to be read by the serial port immediately, 50 cycles later would be too late.

We can’t detect whether a badline just happened, but since the window to read a bit is 50 cycles and the fuzz introduced by badlines is just 40 cycles, we have a chance to time it just right to read the bit within the window.

Depending on whether there is a badline, the latency from the moment of the start bit NMI to the point where we can access the CIA is 30 to 70 cycles. The process would be to:

Wait 25 cycles, so we fall into the 55 to 95 cycle range, in the center of the window of bit #0. These cycles can be used to set up the shift register and disable the FLAG NMI.
Output a manual pulse on PB7 to sample the first bit into the shift register immediately.
Set up the timer to pulse PB7 every 50 cycles and start it.

The implementation of UP19200 is left as an exercise to the reader.

Source Code

The original UP9600 source code is available as part of an email from Daniel Dallmann to the developer of Novaterm, dated 30 Nov 1997. It is designed as a library that can be used from BASIC or assembly.
Bo Zimmerman maintains a version of UP9600 as part of the ZiModem repository that hooks itself into the KERNAL vectors for the Channel I/O calls, so UP9600 can be used from BASIC or assembly using the existing KERNAL interface.
The CCGMS terminal program contains a version adapted by alwyz. It is the only version to also support, in addition to 9600, baudrates of 300, 1200, 2400 and 4800. The current version in the repository contains fixes, cleanpus and comments by myself.

over5 by Daniel Kahlin can do 38400/8N2 with the screen off.
Retroterm (CSDb) by Jorge Castillo can do 57600/8N1 with the screen off, and – using RTS/CTS flow control – an effective throughput of 1500 (PAL) or 1800 (NTSC) baud equivalent by only receiving in the border area.

It is supposed to support 2400 baud as well, but this speed does not work reliably in practice. ↩
Sending won’t achieve the full data rate though; we’ll cover this in the article. ↩
Development machines of the C128 did contain a 6551. ↩
A modern version of this software exists in the form of a cc65 driver. ↩
This is the same shift register that was supported, but broken in the MOS 6522 VIA chip, which is why the 1541 disk drive was so slow. ↩
The timer value has to be one less than the quotient, because in continuous mode, automatic reloading of the timer takes one extra cycle. ↩
1022727 and 985249 are approximations of the respective system clock rates in Hz. The exact numbers are 4/14 * 315/88 for PAL and 4/18 * 4433618.75 for NTSC. The two constants are the respective colorbust frequencies.↩

The Commodore Modem/1200 (Model 1670)

Michael Steil — Sat, 05 Mar 2022 19:23:11 +0000

The Commodore 1670, also known as the “Modem/1200”, is Commodore’s first Hayes-compatible modem: It connects directly to the phone line and supports pulse and tone dialing for 1200 and 300 baud duplex connections. There were two revisions, the original 1670 and the “new” 1670, a.k.a. CR-1670. (The unit in this article is the later revision.)

Historical Context

Year	Name	Model	Description
1982	VICMODEM	1600	connected to phone’s handset connector; manual dialing through phone; Motorola MC14412
1982	AUTOMODEM	1650	connected to phone line, pulse dialing in software; Motorola MC14412
1985	MODEM/300	1660	added tone dialing support by feeding SID output into modem; Texas Instruments TMS99532A
1987	MODEM/1200	1670	Hayes command set; pulse and tone dialing in hardware; 300/1200 baud support; U.S. Robotics chipset

Photos

On the front, there is the user port connector.

On the back, there are

two phone line connectors. “LINE” is connected to the telephone network, and an existing telephone can be connected to the “PHONE” line.
four DIP switches (default to down):

DIP	Description
1	Auto Answer Enable	down: auto-answer on second ring disabled
2	Carrier Detect Enable	down: carrier detect on pin H-K of edge connector (needed for Plus/4)
3	Speed Indicate Enable	down: speed indicate on pin J of edge connector
4	Data Terminal Ready	down: DTR always on (instead of computer controlled)

The label on the bottom says:

C= COMMODORE

MODEL NO. 1670

SERIAL NO. CA1111714

COMPLIES WITH PART 68, FCC RULES; FC
REGISTRATION NUMBER BR98YV-19442-MDE
RINGER EQUIVALENT 0.4A 0.6B; JACK (USOC) RJ11

CERTIFIED TO COMPLY WITH CLASS B LIMITS,
PART 15, SUBPART J OF FCC RULES. SEE
INSTRUCTIONS IF INTERFERENCE TO RADIO
RECEPTION IS SUSPECTED.

FCC ID BR98YV1670
MADE IN USA 310476-03

The board is marked “COMMODORE CR-1670 MODEM” and “A/W 311956 REV 4”.

The 1670 is based on a U.S. Robotics (“USR”) chipset:

U2: © USR'85 U2J2 / OKI C49-387 / JAPAN 7X2033: This is an OKI MSM80C49-387RS, a 80C49 microcontroller with 2KB of mask ROM and 128 bytes of RAM. “387” indicates which ROM image it contains.
U3: © USR'85 U3J2 / OKI C49-388 / JAPAN 7Y2013: This is an OKI MSM80C49-388RS, another 80C49 microcontroller, with a different ROM image (“388”).
U4: © USR86 / USR101 16-249 / S8749 / 35561: This socketed IC is probably the ROM that holds the bulk of the modem’s firmware.

There is a speaker glued to the top shell that allows monitoring the audio data on the line.

The modem comes with a phone cable.

Box

Manual

QuantumLink Material

Disk

The Hayes/AT Interface

ATI/ATI0 (product code) prints
```
  121

  OK
```
ATI1 (ROM checksum) prints
```
  1003

  OK
```
The manual defines registers (ATSn=a, ATSn?) 0-8, 10-12 and 16. Except for 16 (self test), they are identical to other U.S. Robotics modems – as late as the 1997 Sportster Flash x2!
Register 9 is undocumented. Later U.S. Robotics manuals document it as:

Sets the required duration, in tenths of a second, of the remote modem’s carrier signal before recognition by your modem. (default: 6)

On the 1670, register 9 reads back as 6, so it may as well be the same feature.

The registers just seem to map to the 128 bytes of RAM of one of the 80C49 microcontrollers, they wrap around at 128, e.g. 130 is the same as 2. Here is a complete dump after power-on; much of the data may be random, but note the current AT command (S19?) starting at register 0x11 (17):

  00000000  00 00 2b 0d 0a 08 02 1e |..+.....|
  00000008  02 06 07 46 32 00 02 00 |...F2...|
  00000010  00 53 31 39 3f ff ff 00 |.S19?...|
  00000018  12 01 00 00 00 00 02 04 |........|
  00000020  20 00 00 04 00 00 20 a4 | ..... .|
  00000028  00 80 00 00 00 00 50 01 |......P.|
  00000030  00 00 00 00 00 00 04 00 |........|
  00000038  20 80 00 00 02 00 00 00 | .......|
  00000040  98 02 00 00 00 40 02 80 |.....@..|
  00000048  05 00 10 00 00 7f 47 81 |......G.|
  00000050  00 00 0a 00 86 10 02 63 |.......c|
  00000058  d4 23 30 24 bd 63 6e 63 |.#0$.cnc|
  00000060  00 08 00 00 00 00 04 00 |........|
  00000068  6b 00 00 00 e5 66 00 08 |k....f..|
  00000070  80 60 00 65 01 3b 00 07 |.`.e.;..|
  00000078  01 80 65 00 00 0a 01 00 |..e.....|

Like all Bell 212A-compatible modems, when on a 1200 baud call, the data rate between the modem and the C64 is actually 1219 bits/sec. The manual states that it is necessary to specify this manual bitrate when using the KERNAL’s software RS232 implementation:
```
OPEN2,2,2,CHR$(0)+CHR$(0)+CHR$(61)+CHR$(1)
```
Modern versions of CCGMS won’t work with the 1670 for this reason.

Open Questions

How to dump the ROM chip? I had no success reading it as a 2764..27512.
How to dump the ROM of the 80C49 microcontrollers?

The Commodore Modem/300 (Model 1660)

Michael Steil — Wed, 02 Mar 2022 00:55:00 +0000

The Commodore 1660, also known as the “Modem/300”, is Commodore’s first full-featured modem: It connects directly to the phone line and supports pulse and tone dialing for 300 baud duplex connections.

Historical Context

Year	Name	Model	Description
1982	VICMODEM	1600	connected to phone’s handset connector; manual dialing through phone; Motorola MC14412
1982	AUTOMODEM	1650	connected to phone line, pulse dialing in software; Motorola MC14412
1985	MODEM/300	1660	added tone dialing support by feeding SID output into modem; Texas Instruments TMS99532A
1987	MODEM/1200	1670	Hayes command set; pulse and tone dialing in hardware; 300/1200 baud support; U.S. Robotics chipset

Photos

On the front, there is the user port connector. On the left, there is a switch that can be put into the “A” (answer) or “O” (originate) position, depending whether a phone call is supposed to be made or accepted.

On the back, there are

two phone line connectors. “LINE” is connected to the telephone network, and an existing telephone can be connected to the “PHONE” line.
an RCA audio connector. The circuitry in the modem only does the 300 baud data transmission part once the telephone connection is established – tone dialing is done by using software to generate the audio using the C64’s sound chip, which is looped into this connector! (Pulse dialing is also done in software, by timing on-hook and off-hook events.)

The label on the bottom says:

C= commodore

MODEL NO. 1660

SERIAL NO. 158297

COMPLIES WITH PART 68, FCC RULES; FC
REGISTRATION NUMBER BR 9608-15671-DM-E
RINGER EQUIVALENT 0.4A 0.6B; JACK (USOC) RJ11

CERTIFIED TO COMPLY WITH CLASS B LIMITS,
PART 15, SUBPART J OF FCC RULES. SEE
INSTRUCTIONS IF INTERFERENCE TO RADIO
RECEPTION IS SUSPECTED.

FCC ID BR 9608-1660
MADE IN: HONG KONG 310476-01

The board is marked “MAGIC MODEM TI-1660” and “ARTWORK NO. 310484 REV 4”. The “TI” might stand for “Texas Instruments”, the manufacturer of the TMS99532A modem chip on the top left of the board, which is the central component of the device.

There is a speaker glued to the top shell that allows monitoring the audio data on the line.

The modem comes with one phone cable.

And there are two cables that connect the C64’s audio output to the modem:

The DIN to RCA cable takes the audio signal from the C64/C128 AV connector. It is used if the C64 is connected to a TV through the RF connector, so the AV connector is available.
The RCA Y-cable takes the audio signal from the monitor cable. This is used if the C64 is connected to a monitor using the AV connector.

Page 8 in the manual visualizes this:

Box

Manuals

Disk

Announcing CCGMS Future 0.1

Michael Steil — Fri, 25 Feb 2022 15:32:06 +0000

The CCGMS Terminal Program for the Commodore 64 is maintained again, and there is a new version: CCGMS Future 0.1, with bug fixes and new features.

History

CCGMS has a rich history: It was originally written in 1985-1988 by Craig Smith, then binary patched by many people over the years, and finally maintained again by alwyz from 2016 to 2020, based on the rediscovered source code.

Cleanup

As a first step, I cleaned up the source of the last version (v2021), splitting it into multiple files, renaming symbols and adding comments. The resulting source uses cc65/ca65 to build and will generate a byte-for-byte identical v2021 PRG file – you can find this version in a branch.

Fixes

Then I started implementing fixes. In v2021, the standard user port driver was broken for PAL systems because of a bug in the lookup of the timings. Similarly, the UP9600 driver had a timing issue on PAL, but it was minor; but the fix may improve data transfer stability.

Features

Finally, I added features to the XMODEM transfer protocol:

The XMODEM-1K protocol has been added. This increases the block size to 1 KB (instead of 128 bytes) and will significantly increase throughput. Both regular checksum and CRC are supported with XMODEM-1K, and since the protocol specifies that the sender decides on the block size, CCGMS will accept 128 bytes and 1 KB blocks on receive, no matter the setting.
The XMODEM protocol specifies that the receiver decides whether a simple checksum or CRC16 should be used. The original code would only accept its own settings on uploads. For example, if CCGMS was set to regular “XMODEM” (i.e. no CRC16) and the sender used the XMODEM-CRC protocol, the transfer would fail. This has been changed to always accept the sender’s choice.

Because of the added flexibility, the upload and download prompts are now a little clearer about the current settings:

XMODEM/XMODEM-CRC Upload: forces 128 B blocks, will accept checksum or CRC (more compatible)
XMODEM-1K Upload: forces 1 KB blocks, will accept checksum or CRC (faster)
XMODEM Download: forces checksum, will accept 128 B or 1 KB blocks (more compatible)
XMODEM-CRC/XMODEM-1K Download: forces CRC16, will accept 128 B or 1 KB blocks (more reliable)

Download

The .PRG files for this release are available on GitHub.

Future

I set up a GitHub repository for the project, which is licensed under the terms of the 3-clause BSD license.

While I am working on further features, I am also more than happy to accept pull requests for features, bug fixes as well as clean up work!

Free Joystick Extension Cable to Build Your DB9 Competition Pro

Michael Steil — Wed, 02 Feb 2022 15:07:32 +0000

This article explains how to convert a “Competition Pro Extra USB” (which you can still buy new) to work with a C64, Amiga or Atari. For the conversion, you need a joystick extension cable like this:

I have a huge amount of them, with the wire colors described in the article, and I am happy to mail one (or more) to you for free within Europe. Reach out to mist64@mac.com/@pagetable.

[UPDATE] Converting the “Competition Pro Extra USB” to C64/Amiga/Atari DB9

Michael Steil — Thu, 06 Jan 2022 08:53:47 +0000

I updated the instructions to a USB Competition Pro to DB9, so you can use it with a C64, Amiga etc. They now include the new “V3” and “V04T” pinouts and were updated with the use of a joystick extension cable.

The Ultimate Commodore 1541 Disk Drive Talk [video]

Michael Steil — Thu, 16 Sep 2021 05:57:10 +0000

This is the video recording of “The Ultimate Commodore 1541 Disk Drive Talk” at VCF West 2021. As always, if you think it’s too fast, try watching it at 0.75x speed!

I will post the slides in Apple Keynote format later.

If you enjoyed this, you might also like my talks

[Announcement] The Ultimate Commodore 1541 Disk Drive Talk @ VCFW 2021

Michael Steil — Fri, 06 Aug 2021 16:25:38 +0000

After

The Ultimate Commodore 64 Talk (2008)
The Ultimate Game Boy Talk (2016)
The Ultimate Apollo Guidance Computer Talk (2017)

my fourth talk from the “Ultimate” series will take place at the Vintage Computer Festival West 2021 in Mountain View on 2021-08-08 at 12:00.

This talk discusses floppy disk drives, with the 5,25” Commodore 1541 as a case study. We discuss the history of magnetic recording formats, how data is represented on a disk and how it gets from the drive to the computer. We also talk about fast loaders, alternate recording formats, copy protection schemes, and how to preserve disks using modern tools.

Commodore's Assemblers: Part 5: 6502ASM

Michael Steil — Sun, 13 Jun 2021 18:32:44 +0000

In the series about the assemblers Commodore used for developing the ROMs of their 8-bit computers, this article covers the 1989 “Commodore 6502 Assembler” (6502ASM), a cross-assembler written in C that ran on VAX and PC.

Series Overview

History

To build the ROMs for their 8-bit computers, Commodore originally used an assembler that ran on their own 6502-based machines (part 2 of the series). From 1984 on, they used the “Boston Systems Office” (BSO) cross-assembler running on VAX/VMS (part 3).

In 1989, Commodore started working on the C65 project, a much enhanced successor to the C64. The C65 had a 4510 CPU, which supported the extended 65CE02 instruction set.

In order to be able to use the new 65CE02 instructions, they had several options:

Write a set of macros that wrapped the new instructions: This worked well for some instructions, but required non-standard syntax for other cases. Dennis Jarvis, who developed the C65 DOS, started out with this approach while developing using the Merlin 128 assembler. Here is an example for the INW instruction:

    INW MAC
     DFB $E3 ;INW BP
     DFB ]1
     <<<

Add 65CE02 support to an existing assembler: Three years earlier, Commodore had written the HCD65 assembler for C128 (part 4), and they did in fact add 65CE02 support to it. Unfortunately, developing with HCD65 was slow and cumbersome.
Write a new assembler from scratch: Using modern tools on a modern platform, writing a new assembler would not be too much work.

The “Commodore 6502 Assembler” (6502ASM") is a 6502/65CE02 cross-assembler written in C, written by Bob Norby of Commodore Semiconductor Group (CSG, formerly MOS). It aimed at full compatibility with the “BSO” cross-assembler on VAX (part 3), which Commodore had been using for all their projects before. It was developed on VAX (using VAX C V2.4-026) to allow for easy comparisons of its outputs with BSO’s.

In June 1989, Fred Bowen ported the source to MS-DOS. An Amiga version was mentioned in the 1991 C65 specification, but it is unclear whether the source was ever actually ported to an Amiga C compiler.

Usage

The 6502ASM was closely modeled after the BSO assembler, so it used the same command line interface. The user could pass the arguments either as command line options, or type it into a prompt, if no command line arguments were specified:

C65>PROG.OBJ,PROG.LST=PROG.SRC

C65> is the prompt printed by the assembler. It matches the prompt of the BSO assembler, and symbolizes “Cross assembler for 6502” – it has nothing to do with the Commodore C65 project.

This is the help text that is printed if /H is passed:

the assembler command line consists of the file names and switches

C65> [object],[listing]=source[,source]...[,source][/switch]...[/switch]

Items enclosed in brackets [...] are optional.

The default file extentions are .obj , .lst and .src

Switches  (either upper or lower case)
/A absolute assembly (default)
/Cn cpu instruction set /C0 for NMOS 6502   /C1 for CMOS 6502
     /C2 for CMOS 6502 w/bit instructions   /C3 for Commodore 4502 (default)
/Dpath specify path for intermediate file ( usually RAM disk on PCs).
/H help - prints this message
/L assume long branches on pass1. (default= assume short branches)
/Mnnn maximum macro nesting depth (default=50, limits=2-999).
/Pnn maximum number of passes to try (default=15, limits=2-99).
/N don't print errors to console during assembly
/R relocatable assembly - illegal since this is an absolute assembler only
/S narrow list format
/T don't print symbol table
/V don't print cross reference
/X print cross reference (default)

Details on the usage as well as the supported syntax are described in the unofficial manual.

Differences

6502ASM is very similar to the BSO assembler. It supports the same directives, and uses the same syntax for local symbols, conditional assembly and macros.

The only real difference seems to be the lack of the .IF directive, just like with HCD65. .IF seems to have been undocumented on the BSO assembler, which would explain its omission from both HCD65 and 6502ASM. That BSO supported it and treated it as a synonym for .IFN can be seen from its use in the TED KERNAL source.

Versions

6502ASM B0.0 (1989)

This is the MS-DOS binary of version B0.0, built with Microsoft C 5.00 for MS-DOS (1987):

6502ASM B0.0 (1989-06-14, 67233 bytes, from DJ 4502-asm-for-pc-7-89.img)

This version has a few bugs:

A ; character in a string terminates it.
If an operand is 8 bits wide and the instruction does not support the zero-page addressing mode (i.e. LDZ on the 65CE02), the assembler does not support falling back to absolute addressing. The following code would fail:

ldz $80

This issue prevents some C65 source code from building.
If a macro is used within an .INCLUDEd file, assembly is stopped after the current file. serlib.zip on zimmers.net contains an LST file generated by 6502ASM B0.0 from the original 1581 source that shows this problem. The LST file is incomplete; it ends after the included file “mrout”, and the remaining LST therefore shows 150 undefined symbols.
Projects like RAMDOS (a solution for exposing a RAM Expansion Units (REU) as a disk), the HCD65 assembler and its editor (EDT_C128) as well as DOS_SHELL use BSO assembler features extensively. 6502ASM fails to build it for various reasons.

The source code of 6502ASM B0.0 has been preserved through DJ 4502-asm-for-pc.img.

6502ASM V1.0 (1990)

Version V1.0 fixes a few bugs, including the ; and LDZ problems.

LST files of the source are included in c65_src.tar.gz. The reconstructed source is available as part of the cbmsrc repository.

Current Version (2021)

I adapted version V1.0 to current compilers so that it builds and runs on modern Unix systems. It is available on GitHub:

https://github.com/mist64/cbm6502asm

.IF has been added and the bug with macros within .INCLUDEs has been fixed. It can currently compile all known Commodore BSO source code, with the exceptions mentioned above. Contributions are welcome!

Use at Commodore

At the beginning of the C65 project, three different assemblers had been in use at Commodore:

Dennis Jarvis (DOS) originally had used Merlin 128, with added 65CE02 macros.
Fred Bowen (KERNAL, BASIC) had originally used the BSO assembler, possibly also with added 65CE02 macros.
The external contractor Walrus Software Inc. (graphics extensions for BASIC) had used HCD65.

All development was switched to 6502ASM in mid-1990.

The A2232 7-port serial card for the Amiga also contained a 65CE02 CPU. The project started in 1988, using the BSO assembler. The final version was built with 6502ASM on VAX¹.

This marks the end of the 5-part series on the assemblers used by Commodore.

The Amiga makefile in the driver source does not build the 65CE02 source, but only links an already built .OBJ file, which is a strong indication that 6502ASM did not in fact exist on the Amiga, and the source was built on VAX and transferred to the Amiga instead.↩

Commodore's Assemblers: Part 4: HCD65

Michael Steil — Sun, 06 Jun 2021 04:31:41 +0000

In the series about the assemblers Commodore used for developing the ROMs of their 8-bit computers, this article covers the 1987 “HCD65” assembler that ran on the C128.

Series Overview

History

To build the ROMs for their 8-bit computers, Commodore originally used an assembler that ran on their own PET machines (part 2 of the series). From 1984 on, they used the “Boston Systems Office” (BSO) cross-assembler running on VAX/VMS (part 3).

In 1986, Commodore started working on a clone of the BSO cross-assembler that was supposed to run on the C128. An internal document (c65.doc) in the original C128 source archive states:

HCD65XX is a powerful macro assembler syntactically identical to the assembler used to orginally develop the C128 source code running on a VAX-8600 under VMS. This tool is capable of assembling the same source files on a C128.

They announced the project in a usenet post to net.micro.cbm from April 1986:

There is a package developed in house that includes a macro assembler and a program editor that both take advantage of the C128 hardware. […] the object was to make it do everything that the VAX based cross assemblers can do, and to allow truely large assemblies. The editor is similar to the DEC EDT keypad editing mode.

It is too soon to know when or if this package will be available, but I am sure that it will be publicized on the various networks…

It was mentioned again in a post from October 1986:

We have a developers package containing a new macro assembler and editor that we hope to have available by year end. The assembler is compatible with the BSO 6502 Assembler that has been used by Commodore for software development of late.

The new assembler, called “HCD65” (in the manual) or “HCD65XX” (in the software), named after its developer Hedley Davis, was released in 1987 as part of the Commodore 128 Developer’s Package (manual, internal, internal (65CE02 version)).

Usage

The assembler consists of two components: A BASIC program to query the arguments from the user, and the non-interactive assembler binary. The BASIC frontend presents the following menu:

(C)1986 COMMODORE ELECTRONICS, LTD.
ALL RIGHTS RESERVED     V3.5
ENTER FILE NAME FOR SOURCE = KERNAL.SRC

PICK A CONFIGURATION
0) NO LISTING         : SOURCE,OBJ ON 8
1) LISTING TO SCREEN  : SOURCE,OBJ ON 8
2) LISTING TO UNIT 4  : SOURCE,OBJ ON 8
3) ERRORS ONLY        : SOURCE     ON 8
4) LIST/XREF TO UNIT 4: SOURCE,OBJ ON 8
5) NO LISTING         : SOURCE,OBJ ON 9
6) LISTING TO SCREEN  : SOURCE,OBJ ON 9
7) PRINT LISTING ON 4 : SOURCE,OBJ ON 9
8) ERRORS ONLY        : SOURCE     ON 9
9) LIST/XREF TO UNIT 4: SOURCE,OBJ ON 9

It allows entering a filename (the extension .SRC is added automatically) and configuration options that control whether to hide/show/print the LST, whether to generate a cross-reference and what drive to read the sources from. Pressing RETURN in the menu will allow the user to configure all options manually.

It then asks for the date string, since the C128 does not have a real-time clock, and asks the user to verify the settings:

(C)1986 COMMODORE ELECTRONICS, LTD.
ALL RIGHTS RESERVED     V3.5
ASSEMBLY OF     KERNAL.SRC
SOURCE FILES ON UNIT 9

LISTING OUTPUT = PRINTER
   NO CROSS REFERENCE
   DATE STRING ="2021-05-08"

ERROR OUTPUT = SCREEN
OBJECT OUTPUT = KERNAL.OBJ ON UNIT  9

PRINTER DEVICE NUMBER =  4
LIST AND ERROR FILE WIDTH = 80 COLUMNS


RUN THIS CONFIGURATION ? (Y/N)

Pressing Y will start the assembly.

Differences

HCD65 was designed to be as similar to BSO as possible, but there are a few differences:

HCD65 accepts both ASCII and PETSCII source files. When using existing BSO source files, it is important that the source files use CR line breaks and that the filenames referenced by .INCLUDE have the right case.
HCD65 is missing the .IF directive that BSO understands as a synonym to .IFN.

Versions

There are four known versions of the HCD65 assembler:

HCD65XX V3.1

Little is known about this version. The LST files that shipped with the C128 Devpack were built with this version.

HCD65XX V3.5

This is the version that shipped as part of the C128 Devpack.

HCD65XX V3.5 (BAS: 6613 bytes, BIN: 10498 bytes, from DJ devpack1of4.d64)

HCD65XX 65CE02 V0.1

In 1989, Commodore started writing the ROM for the upcoming C64DX/C65. The CPU of the machine was a 4510, which implemented the extended 65CE02 instruction set, so in order to use the additional CPU features, they needed an assembler that supported them.

The BSO/VAX cross-assembler that they had been using for all development at the time was written by a third party and Commodore did not have its source, so Fred Bowen added support to the C128 HCD65.

HCD65XX 65CE02 V0.1 (BIN: 11166 bytes, built from the source using HCD65XX V3.5)

HCD65XX 65CE02 V0.2

Version 0.2 is changes the AUG/MAP opcode from 65CE02 to the 4510 semantics, and adds an alternative syntax to the BBR/BBS mnemos.

HCD65XX 65CE02 V0.2 (BAS: 6769 bytes, BIN: 11156 bytes, from DJ asm45.d81)

Source

The sources of the V3.5 and 65CE02 0.1 and 0.2 versions are available as part of the Commodore Source collection:

https://github.com/mist64/cbmsrc

HCD65_3.5
HCD65_65CE02_0.1
HCD65_65CE02_0.2

Use at Commodore

It is unclear why Commodore developed HCD65 in the first place.

It is unlikely that they developed HCD65 mainly for a commercial release. There were plenty of assemblers on the market, and BSO compatibility is not something users were looking for.

Another hypothesis would be that they wanted to replace the BSO cross-assembler for in-house use.

In 1986, they were using BSO on a 12.5 MHz VAX-8600. The original PDP-10 version of BSO’s tools were implemented in hand-written machine code, but the later PDP-11/VAX/ULTRIX versions were probably written in a high-level language. There might have been a chance that a new assembler written in well-optimized machine code for the 2 MHz C128 could have matched the speed of the BSO assembler, but the C128’s I/O system (even using the new Fast Serial bus) would have certainly ruined that.

So they wouldn’t have replaced BSO for a speed benefit or added convenience. Original LST files that have been preserved show that Commodore used BSO, not HCD65, for all 6502-based projects until 1990 (C64GS).

One realistic reason for developing HCD65 would be as an insurance policy: to have an in-house alternative to BSO, so they would not be dependent on a third party. The in-house solution could also be altered and extended, especially in the context of the upcoming 4510/65CE02 CPU. The 65CE02 project, an effort to re-design the 6502 in CMOS, started in late 1985. And in fact, 4510/65CE02 support was added to HCD65 in 1989.

The graphics extensions to BASIC 10 by the external contractor Walrus Software Inc. were developed using HCD65 65CE02, and so was the C65 DOS ROM for a while, before all development switched to the “Commodore 6502ASM” cross-assembler in 1990.

The next article will discuss this new cross-assembler that Commodore developed in 1989.

Commodore's Assemblers: Part 3: BSO CY6502

Michael Steil — Sat, 29 May 2021 17:37:59 +0000

In the series about the assemblers Commodore used for developing the ROMs of their 8-bit computers, this article covers the 1984 “Boston Systems Office” (BSO) cross-assembler running on VAX/VMS.

6502 relocating cross assembler version 20.53.12
Copyright, The Boston Systems Office, Inc. 1978
(617) 894-7800   TWX: 710-324-0760
Type /H for help

Series Overview

History

Commodore bought the chip-maker MOS in 1976 and with it its development tools: They used the so-called MOS “Resident Assembler” (part 2 of the series) – first on the MDT650 and later on the PET – to develop for machines like the VIC-20 and the C64, and for disk drives like the 8250 and the 1541.

In 1984, they switched to a cross-assembler by the company “Boston Systems Office” (BSO).

Boston Systems Office

Boston Systems Office had been specializing on cross-development tools since 1975, offering dozens of cross-assemblers for Tymshare systems that were hand-written in PDP-10 assembly for maximum speed, and that aimed at full compatibility with the CPU-makers’ own tools.

Since 1976, they had been offering a 6502 assembler. The 1979 version of the BSO 6502 cross-assembler, called “CA6500” (documentation, PDP-10 binary) was highly compatible with the original MOS assemblers, and had added a few features.

According to internal documentation, Atari had used CA6500 on a PDP-11 for their in-house 6502 development as early as mid-1980.

In early 1984, BSO released a version for PDP-11 and VAX/VMS named CY65XX¹, which was then licensed by Commodore. This version had many additional features added.

No binary of the assembler seems to be archived anywhere, but we do have the manual of the last version Commodore used:

CY6502 Version 20.53.12 Manual (1985)

Usage

When run, the BSO assembler shows the following prompt:

6502 relocating cross assembler version 20.53.12
Copyright, The Boston Systems Office, Inc. 1978
(617) 894-7800   TWX: 710-324-0760
Type /H for help
C65>

The output file (ABS) file, the LST file and the SRC file would be specified like this:

C65>PROG.ABS,PROG.LST=PROG.SRC

The file name extensions are optional. If they are omitted, the defaults are used.

The output file is in BSOs’ own “ABS” format, and needs another conversion step using the tool OBJCNV:

Object file converter  Version 3.25
Copyright, The Boston Systems Office, Inc. 1978
TEL: (617) 894-7800 TWX: 710-324-0760
Type /H for help
>

To convert the resulting ABS file into the MOS “OBJ” format, you would type this at the prompt:

>PROG.OBJ/F:MTK=PROG.ABS/F:BSO

MTK stands for MOSTEK, a chipmaker unrelated to MOS Technology, Inc. BSO seems to have had confused the two companies: ABS files also contain the string 6502,MOSTEK.

In practice, one would use a DCL script to invoke both tools with the right arguments.

Differences to Original MOS Spec

The BSO assembler diverged in two aspects from the original MOS syntax:

Labels have to start at column 1, so lines with labels cannot have any leading whitespace. But it is still legal for statements to also start on column 1: The assembler can tell them apart, since all mnemos are reserved keywords, which can’t be used as labels.
The .OPT directive is not supported. Several of the options are supported through dedicated directives though. See below.

Differences to Updated Resident Assembler

The syntax of both the Resident Assembler and the BSO cross-assembler are based on the original MOS syntax, but some features that are supported by later versions of both have diverged.

Include Syntax

The Resident Assembler uses the .LIB directive to include source files. The BSO assembler uses the .INCLUDE directive:

.include disclaimer

Macro Syntax

Macros for the BSO assembler look like this:

dpinc  .macro arg    ;double precision increment
       inc arg
       bne 1$
       inc arg+1
1$     .endm

The name of the macro is defined as a label. Arguments can be any string of characters after the .MACRO directive, and are blindly searched and replaced in the macro’s body. .ENDM marks the end of the body.

Conditional Assembly Syntax

Conditional assembly on the BSO assembler uses an .IF/.ELSE/.ENDIF construct, like this:

.ifn gdebug
       jsr grbpri      ;garbage debug
.endif

New Features

There are a number of additional features supported by the BSO assembler compared to the original MOS assemblers.

Text Format

Source files can now use the complete ASCII character set, as opposed to just uppercase. This means that comments can use mixed case and that TABs can be used for indenting.

In addition, labels, statements and operands are now case insensitive. In practice, all Commodore BSO source code uses lower case for those.

Local Labels

Local labels are in the form of a number from 1 to 255 followed by a $ sign, e.g.:

100$    ; this is legal
001$    ; this is the same as the next
1$      ; this is the same as the previous
999$    ; this is illegal (1-255).

The range of local labels is delimited by global labels as well as the .LOCAL directive.

Additional Directives

The BSO assembler supports a number of additional directives:

.NAME/.TITLE: set name of project
.SUBTTL: set name of section (like .PAGE on MOS)
.SKIP/.SPACE: skip lines in LST
.FORMLN: set number of lines per page
.LIST/.NLIST: enable/disable LST (like .OPT LIST)
.CLIST/.NCLIST: show/hide disabled conditional assembly in LST
.MLIST/.NMLIST/.BLIST: control macro expansions in LST
.GEN/.NOGEN: enable/disable extra lines in LST (like .OPT GENERATE)
.INCLUDE: include file
.MACRO/.ENDM: macro definition (existed on MOS, but different syntax)
.REPT/.ENDR: repeat
.IRP/.IRPC: extended repeat
.IFE conditional assembly if == 0 (existed on MOS, but different syntax)
.IFN/.IF conditional assembly if != 0 (existed on MOS, but different syntax)
.IFGE conditional assembly if is >= 0
.IFGT conditional assembly if is > 0
.IFLT conditional assembly if is < 0
.IFLE conditional assembly if is =< 0
.IFB/.IFNB conditional assembly if (not) blank
.IFIDN/.IFNIDN conditional assembly if strings are (not) identical
.IFDEF/.IFNDEF conditional assembly if symbol is (not) defined
.LOCAL: delimit the range of local labels
.MESSG: print message during pass 2
.RMB: reserve memory byte
.RADIX: change default radix for literals without a prefix

All these features are also documented in the “Commodore 128 Developer’s Package”, which cloned the BSO assembler for the C128. (More details in part 4 of the series.)

Relocating

It was possible to create relocatable object files with the BSO assembler. Commodore never made use of this feature. The following directives control this:

.AORG/.RORG/.ZORG
.SECT/.ASECT/.RSECT/.ZSECT
.EXTERN/.INTERN
.LINK

LST Files

Listing files had an updated format. Here is an example:

1581 DOS v10  318045-01  (c)1987 CBM    CR6502/11 version 20.53.12  19-Mar-87  20:17:11 Page 26
"copy"   COPY.SRC

Error Addr  Code          Seq   Source statement

                         1755   ; copy file(s) to one file
                         1756
     87A2   20 82B9      1757   copy    jsr  lookup     ; look ip all files
     87A5   AD 022F      1758           lda  f2cnt

The header distinguishes between the project name (.NAME, first field of first line), the section name (.SUBTTL, first field of second line) and the filename (second field of second line), and contains the assembler’s name and the current date.

The body lines have a new first column that can contain one or more characters that indicate errors in the line, e.g. U for undefined symbol. This is instead of added lines with error messages.

Use at Commodore

From the headers of the Commodore LST that have been preserved, we can see that Commodore was using at least the following versions:

version 10.36.6 since at least July 1984 (TED KERNAL and Char ROM)
version 20.51.10 since at least August 1984 (TED BASIC, 1570, original C128 ROM)
version 20.53.12 since at least December 1985 until July 1990 (later C128 ROM, 1551, later 1541/1541C/1541-II, 1571, 1571CR, 1581, C64GS)

CY6502 was available for VAX/VMS and PDP-11 systems. Everything points towards Commodore using VAX hardware. The original C128 source archive contains a VAX reference dated October 1984 (file monitor.sum). A different file (c65.doc) in the C128 source archive mentions that the C128 ROM was developed on a VAX-8600 (12.5 MHz, 4+ MB RAM; released in October 1984).

According to a usenet post to net.micro.cbm from December 1985, Commodore had multiple VAX systems at least as early as December 1985:

[…] which consists of the cbmvax 11/750 and a host of VMS Vaxen, Sun’s and developmental systems at both Commodore and Commodore [Amiga].

(cbmvax was the UUCP-connected machine at Commodore. Usenet identifiers and (later) email addresses of Commodore employees contained this machine name.)

Another document (howto.doc) in the source archive describes how the result was transferred from the VAX to the Commodore computer: VAX computers were multi-user, and each developer had their own VT100-like terminal connected through RS-232. Each terminal had a second AUX/printer RS-232 connection that was connected to the Commodore computer (at 9600 baud if it had an ACIA chip). The tool “DOWNLOAD” on the VAX then sent the contents of the OBJ file through the terminal to the Commodore computer, which used a tool named “RSX” to receive the data.

The next article will discuss the HDC65 assembler on C128 that Commodore built in 1986 as a clone of the BSO assembler.

The BSO 6502 cross-assembler seems to have many names. The 1976 documentation called it CA6500: Back then, the 6502 was part of the 6500-series that consisted of the 6501 and the 6502. The version Commodore used 8 years later was called CY6502 in the documentation, but in LST files, it called itself CR6502/11: The “11” must have stood for either PDP-11 or VAX-11, or both. The “R” could stand for “relocatable”, which seemed to have been a recent feature.↩

Commodore's Assemblers: Part 2: MOS Resident Assembler

Michael Steil — Sat, 22 May 2021 06:19:07 +0000

In the series about the assemblers Commodore used for developing the ROMs of their 8-bit computers, this article covers the 1976 “MOS Resident Assembler” that ran on a variety of 6502-based computers.

Series Overview

Part 0: Overview
Part 1: MOS Cross-Assembler
Part 2: MOS Resident Assembler ← this article
Part 3: BSO CY6502
Part 4: HCD65
Part 5: 6502ASM

History

MOS Technology, Inc. released two assemblers for the newly introduced 6502 architecture: the “Cross-Assembler” (1975), available for various mainframes and minicomputers, and the “Resident Assembler” (1976), running natively on 6502 systems.

The Resident Assembler was written by Michael Corder (of MOS contractor COMPAS Microsystems) by hand-assembling the Cross-Assembler FORTRAN code to native 6502 assembly.

Consequently, both assemblers were compatible in that they understood the same source format, with the same math features¹ and the same directives and options. That way, they defined the basic format supported by all future Commodore assemblers.

This means that the Resident Assembler took uppercase ASCII source files and created ASCII-encoded OBJ output files. It could even create the same LST output, but it was sent to the screen or the printer instead of a file.

Versions and Platforms

The Resident Assembler ran on almost all 8-bit computer systems by MOS and Commodore.

MDT650

MOS built two computers for demonstration of and development for their new 6502 CPU: the MDT650 and the KIM-1.

The very first version of the Resident Assembler targeted the MDT650 (“Microcomputer Development Terminal”; 1976), a development computer by MOS that contained two 6502 CPUs: One to develop software, and one to test it on. Its ROM came with an editor, the Resident Assembler and a disassembler. Source was saved on tape, and later on 8″ disks.

According to the manual, the feature list of this version of the Resident Assembler was the same as that of the Cross-Assembler, except:

no multiplication or division
no error files
no cross references
no symbol table sorting
no error messages; but errors numbers instead (1-23)

No ROM dumps are known.

KIM-1

The KIM-1 (1976) was a single-board computer originally intended as a demonstration board for the 6502, but with additional hardware, it could also be used as a full computer. For this, MOS released hardware like the KIM-2/KIM-3 RAM extension and the KIM-4 motherboard.

In 1977, Commodore announced the Resident Assembler for the KIM-1, which was supposed to ship in the form of the KIM-5 ROM board with three 6540 ROM chips (6 KB total) containing the assembler and a simple editor. The same year, they announced that the project was being postponed indefinitely, but it did appear in a price list in 1979.

There is a 1977 manual, which does not mention the .DBYTE, .PAGE and .SKIP directives. These were probably just undocumented, since the smaller AIM-65 version (see below) does support them.

The assembler could read source from cassette tape or paper tape, but it could not write OBJ files; instead, it would write the binary to RAM.

No ROM dumps are known.

(A company named “ARESCO” offered a 6 KB RAM-based assembler for the KIM-1 that may have been based on the MOS Resident Assembler. According to the documentation of the Apple II version, the supported directives and error codes matched the MOS Resident Assembler, and some sources call it the “MOS/ARESCO” assembler. Unfortunately, no dumps are known.)

AIM-65

The AIM-65 (“Advanced Interactive Microcomputer”, 1978) was a single-board computer similar to the KIM-1, sold by the 6502 second source Rockwell. A very stripped down and optimized 4 KB version of the MDT650 version of the Resident Assembler was available for this platform.

The AIM-65 is not a MOS/Commodore device, and it was not used internally at Commodore, but its version of the Resident Assembler is presumably based on the (lost) KIM-1 version.

Chapter 5 of the AIM-65 User’s Guide covers the assembler.

AIM-65 Resident Assembler R3224 (1978)

PET

After the release of the CBM 2040 disk drive for the PET in 1978, John Feagans (of KERNAL fame) ported the MDT650 version of the Resident Assembler to the new platform.

The oldest known version is undated:

PET Resident Assembler (undated, 6696 bytes, from DJ old-dos-sources.d81); for BASIC 2

The next known versions are the only ones publicly released:

PET Resident Assembler V112779 (1979-11-27, 7426 bytes); for BASIC 2
PET Resident Assembler V121579 BASIC4 (1979-12-15, 7546 bytes); for BASIC 4

These two binaries were published as part of the Commodore PET Assembler Development System in 1980.

They add multiplication and division support, as well as textual error messages. So except for the missing cross-reference support, this version basically has feature parity with the MOS Cross-Assembler. The manual is based on the KIM-1 Assembler Manual, and adds descriptions of the OBJ format as well as the added features. Conditional compilation (.IFN/.IFE) is undocumented, but supported by the BASIC 4 version of the assembler.

There are more versions, all of which have only been used only internally:

PET Resident Assembler V121579 BASIC2 (1979-12-15, 7546 bytes, from RB 6502ASM/MASTER.D80, UTIL001.D80, UTIL003.D80, UTIL004.D80, UTIL008.D80, UTIL009.D80); for BASIC 2: only direct calls into the PET ROM differ
PET Resident Assembler V090580 A (1980-09-05, 7858 bytes, from cbmhardware.de); adds cross-reference support
PET Resident Assembler V090580 B1 (1980-09-05, 7938 bytes, from DJ b128-editor-1of1.d64); later (!) version with optimizations
PET Resident Assembler V090580 B2 (1980-09-05, 7822 bytes, from RB UNKN003.D80, UTIL008.D80, UTIL009.D80); identical to B1, but for 72 lines per page instead of 66
PET Resident Assembler V102780 (1980-10-27, 8049 bytes, from DJ c64kernal.d64); detects BASIC 2/4 at runtime

These two versions are non-mainline:

PET Resident Assembler V050482 (1982-05-04, 7624 bytes, from RB UTIL008.D80); binary-patched version of V121579 BASIC4, asks “PAPER LENGTH (CR/66 OR 72)?”
CBM 6502 Copyright Assembler V26MAR82RR (1982-03-26, 7982 bytes, from DJ b128-kernal-2of2.d64); based on V090580 B, calls itself “CBM 6502 COPYRIGHT ASSEMBLER”; changes to LST format

C64

There is one known version of the Resident Assembler for the C64:

CBM Resident Assembler V080282 (C64) (1982-08-02, 9714 bytes)

It was released on disk as part of the Commodore 64 Macro Assembler Development System. The manual is based on the PET version, and adds documentation of the macro (see below) and cross-reference features, but does not document conditional compilation. An earlier version of the manual, which is part of the Commodore 64 Development Kit internal document, does document conditional compilation, but not macros.

TED Series (C16, C116, Plus/4)

Commodore also ported the assembler to the TED series, but never released it publicly:

Commodore Assembler V2.0 (TED) (1984-11-23, 10550 bytes, from DJ cbm-assembler.d64)

This binary was built from the original source code. Unlike all earlier and later versions, this one does not interactively ask for its arguments, but relies on a BASIC program to place the arguments in memory before calling the machine code.

C128

And finally, there is a C128 version:

C/128 6502 ASSEMBLER V022086 (1986-02-20, 10498 bytes, from DJ testing-c65-dos.d81)

It was based on the TED version, but went back to being standalone and asking for the arguments itself.

This version, too, was not released publicly.

New Features

While the MDT650 version of the Resident Assembler had no additional features compared to the Cross-Assembler, new features were added to later versions.

KIM-1/AIM-65

The AIM-65 version (and presumably the KIM-1 version) of the Resident Assembler added one new directive:

.FILE: switch to different source file, do not return

The size of the source of a more complex program could easily exceed the amount of RAM installed in a KIM-1/AIM-65. For instance, the 4 KB version of the Resident Assembler itself was, without comments, 24 KB in size, much larger than the typical amount of RAM in a KIM-1 or AIM-65.

While the powerful computers that the Cross-Assembler ran on had text editors that could handle files larger than the available RAM, the KIM-1/AIM-65 editors could not do this.

Therefore, the assembler allowed splitting the source into multiple files, which were then concatenated during the build.

PET V112779 (1979-11-27)

Version V112779 of the PET Resident Assembler had the following additional features:

.LIB: switch to a different source file (and unlike .FILE, return to the original source file after the end of the included file)
.OPT SYMBOL – NOSYMBOL: add symbol table to the end of the LST

PET V121579 (1979-12-15)

Version V121579 added conditional assembly:

.IFE: conditionally assemble if expression == 0
.IFN: conditionally assemble if expression != 0

The code guarded by the condition has to be between < and >:

.IFN GDEBUG <
       JSR GRBPRI      ;GARBAGE DEBUG
>

C64 V080282 (1982-08-02)

The C64 version adds support for macros. The manual contains the following example:

       .MAC DPINC    ;DOUBLE PRECISION INCREMENT
       INC ?1
       BNE ?2
       INC ?1+1
?2     .MND

The argument of .MAC is the name of the macro. The macro contents are defined between .MAC and .MND. Arguments are named ?1 etc., and local variables can be declared from the same namespace.

No known Commodore source code used this feature.

TED (1984-11-23)

The TED version adds the following .OPT arguments:

LONG/NLONG: allow long labels
MLI/NMLI: enable expanding macros in LST

Also, expressions can contain & (bitwise and), . (bitwise or) and ! (bitwise exclusive or).

C128 V022086 (1986-02-20)

The C128 version removes support for mnemo statistics (.OPT COUNT/CNT/NOCOUNT).

Source Code

The source code of the TED and C128 versions was preserved as part of the TED and C128 Source archives. it was added to the Commodore Source collection:

https://github.com/mist64/cbmsrc

The sources of the AIM-65 version, all PET versions as well as the C64 version have been reconstructed from the TED source and are also available in this repository:

ASSEMBLER_AIM65_REC
ASSEMBLER_PET_REC
ASSEMBLER_PET_V112779_REC
ASSEMBLER_PET_V121579_REC
ASSEMBLER_PET_V090580_A_REC
ASSEMBLER_PET_V090580_B_REC
ASSEMBLER_PET_V102780_REC
ASSEMBLER_PET_V26MAR82RR_REC
ASSEMBLER_C64_REC
ASSEMBLER_TED
ASSEMBLER_C128

Bugs and Quirks

The C64 version has at least two known bugs:

Commodore Disk User 1990-08 contains an article that fixes some problems with macros.
A thread in comp.sys.cbm discusses and fixes a problem with multiplication.

It remains to be researched whether the TED source and the C128 binary contain these bugs.

Also, there is a confusing error message:

**RAN OFF END OF CARD

This means that the assembler was expecting more input, but encountered the end of the line. Back in the mainframe days, lines were also called “cards”, since one punch card stored a single input line.

The Cross-Assembler used to call lines “cards”, as can be seen in the LST headings:

 CARD # LOC     CODE        CARD
   10  0009  B1 0E      NEXT    LDA (SAVIL)Y

The Resident Assembler had dropped the “card” nomenclature on the PET though:

LINE# LOC   CODE        LINE
   10  0009  B1 0E      NEXT    LDA (SAVIL)Y

…except for the single remaining case of the error message.

Use at Commodore

Commodore used the Resident Assembler on the MDT650 (with an 8″ disk drive) to develop the ROM of the original PET and the CBM 2040 (dual 5.25″) disk drive. At the time, the two MDT650 systems at Commodore were a scarce resource, so as soon as the CBM 2040 drive was available, the Resident Assembler was ported to the PET, and thus 6502 software development was switched to PETs.

The software Commodore built on the PET was:

KERNAL/EDITOR and BASIC of later PETs, the CBM2, VIC-20 and C64
the DOS ROMs of the 4040, 8061/8062, D9060/D9090, 8050/8250/1001, 2031, 1540 and the original 1541
the ROMs of the 6502-based printers
Commodore-developed applications and games, like Gorf and Omega Race

A large amount of Commodore source since 1980 (VIC-20, 1540, CBM 4040, …) has been preserved, and all source until 1984 (just before the TED series) is in a format that can be built with the Resident Assembler (V121579 or above).

The next article will discuss the BSO cross-assembler on VAX that Commodore switched to in 1984.

Multiplication and division were missing from the original version.↩

Commodore's Assemblers: Part 1: MOS Cross-Assembler

Michael Steil — Sat, 15 May 2021 05:30:19 +0000

In the series about the assemblers Commodore used for developing the ROMs of their 8-bit computers, this article covers the 1975 “MOS Cross-Assembler”, which was available for various mainfraimes of the era.

Series Overview

Part 0: Overview
Part 1: MOS Cross-Assembler ← this article
Part 2: MOS Resident Assembler
Part 3: BSO CY6502
Part 4: HCD65
Part 5: 6502ASM

History

MOS Technology, Inc. released two assemblers for the newly introduced 6502 architecture: the “Cross-Assembler”, available for various mainframes and minicomputers, and the “Resident Assembler”, running on natively on 6502 systems.

According to Norm Farrington, MOS contracted with the company COMPAS to write the Cross-Assembler. COMPAS specialized in 6502-related software and hardware and developed some of the official MOS peripherals for the KIM-1.

The Cross-Assembler was written in FORTRAN¹ and used a 6-bit (i.e. all uppercase) character encoding. On a CDC Cyber 175 system, it required 120K (!) 60-bit words of memory to run and had “generally acceptable” response times.

The Resident Assembler (part 2 of the series) was then developed (also by COMPAS) using the Cross-Assembler, and designed to be compatible in that it understood the same source format, with the same math features and the same directives and options. This way, MOS defined the basic format supported by all future Commodore assemblers.

Platforms

Today, you would expect a cross-assembler to run on Linux, Windows or Mac. But these were different times: When the 6502 was released, microcomputers were just appearing – and the 6502 was a part of this shift – and even the first fridge-sized minicomputers like the PDP series had only been introduced 10 years earlier. Most companies that used computers at the time used terminals that dialed into mainframes hosted in computing centers. And the market was very fragmented.

The 6502 Programming Manual (6500-50A) states:

The Cross Assembler is available on various time share systems or for batch use on the user’s system.

According to the 6502 Cross Assembler Manual (6500-60P), the supported platforms as of August 1975 were:

GE Timesharing, running on GE/Honeywell mainframes
National CSS time-sharing, running VP/CSS on IBM System/360 and System/370 mainframes

And the 1975 MOS Technology marketing brochure:

Current plans involve having the software available on several of the more popular Time Sharing services.

In addition, it will be available for deck sales. Batch decks for the CDC, IBM, and PDP-11 class machines are available and we will support several other popular mini and major computer systems in the near future.

Furthermore:

The 1975 MCS6500 Microprocessor Software Support brochure shows the Cross-Assembler running on the United Computing Systems (UCS) time-sharing service.
A 1976 dissertation shows that the Iowa State University ran the Cross-Assembler locally on an IBM 360/370.
A 1977 magazine article mentions support for PDP-8, PDP-10 and PDP-11.

Basic Syntax

Back then, not all computer systems used the ASCII encoding, and some computers didn’t even support lower case. The encoding of source files is therefore specific to the platform the Cross-Assembler runs on, and only uppercase characters were allowed.

Here is an example from the manual:

;
; 650X CROSS ASSEMBLER SAMPLE PROGRAM.
;
 *=$C000     DEFINE ORIGIN.
 LDX #$FF    SET UP STACK.
 TXS         LOAD STACK POINTER.
 LDA #$F0    LOAD A WITH HEX F0.
 STA ASAVE   SAVE A IN ASAVE.
;
; ALLOCATE SAVE AREA.
;
 *=$0000
ASAVE *=*
 .END

Lines starting with ; are comments and will be ignored.
Labels start at the first column, everything else is indented.
Comments may follow the operand or the operand-less mnemonic. No ; is necessary.
The Cross-Assembler uses the *= syntax to define the current assembly address.

The rule about the start columns of labels and assembly statements is actually more relaxed, as this very compressed example from the C64 KERNAL shows:

;COMMAND SERIAL BUS DEVICE TO LISTEN
;
LISTN ORA #$20 ;MAKE A LISTEN ADR
JSR RSP232 ;PROTECT SELF FROM RS232 NMI'S
LIST1 PHA

Since all mnemos and register names are reserved keywords and cannot be used for labels, the assembler does not enforce indenting for assembly statement. The JSR RSP232 starting at the first column is legal. In fact, even labels may be indented. This example prepends comments after statements with ;, which is legal because the assembler ignores everything after the statement anyway.

If you want to go for maximum readability (and don’t care about the size of the source), you could also indent the above example like this:

; COMMAND SERIAL BUS DEVICE TO LISTEN
;
LISTN  ORA #$20       ; MAKE A LISTEN ADR
       JSR RSP232     ; PROTECT SELF FROM RS232 NMI'S
LIST1  PHA

Labels can be up to 6 characters in length, so one could use 7 character indents for statements so that they always line up. That’s also how the LST output is formatted, which will be described later.

Assembly Statements

The accepted syntax of assembly statements matches the one in the 6502 Programming Manual. This includes the syntax for statements that take the accumulator as the argument:

ASL A
LSR A
ROL A
ROR A

Modern assemblers usually allow omitting the A; the Cross-Assembler does not.

One additional feature is an alternative syntax to the indirect, y-indexed addressing mode. In addition to

LDA (PNT),Y

the following syntax is accepted:

LDA (PNT)Y

Expressions

Operands of assembly statements can use hexadecimal ($), octal (@), binary (%) and decimal (no prefix) constants. Mathematical expressions using +, -, * and / are possible, but they are always evaluated left-to-right with no operator precedence and no parenthetical grouping. Character/string literals are prefixed with '.

Directives

The Cross-Assembler understands the following directives:

.BYTE: store one or more bytes
.WORD: store one or more words (little endian)
.DBYTE: store one or more words (big endian)
.PAGE: optionally set a section title, and cause an LST page break
.SKIP: insert a number of blank lines into the LST
.END: stop assembly; not required but suggested at end of file
.OPT: set or clear a list of options
- XREF – NOXREF: add a cross-reference to the end of the LST
- ERRORS – NOERRORS: write errors into separate file
- COUNT/CNT – NOCOUNT: add mnemo statistics to the end of the LST
- LIST – NOLIST: enable LST output
- MEMORY – NOMEMORY: enable writing object file
- GENERATE – NOGENERATE: verbose printing of character strings in the LST

Only the first three characters after the period are actually checked.

LST File

Like most development tools from the 1970s, the assembler can create a so-called listing file (suggested file extension .LST) during assembly that shows the source and the generated bytes side-by-side and is meant to be printed on paper. Here is a example from the manual:

 CARD # LOC     CODE        CARD
    1                   CR=15
    2                   LF=12
    3                   ; LOW CORE DATA AREAS
    4  0000  E7 06      TEMTBL   .WORD G3TEM, G1TEM
    5  0002  E7 05
    6                   GROUP=B10
    7  0004  00         THI     .BYTE 0
    8  0005  00         TLO     .BYTE 0
    9  0006  00 00 00   3PER    .WORD 0
***** ERROR ** LABEL DOESN'T BEGIN WITH ALPHABETIC CHARACTER - NEAR COLUMN 1
   10  0009  B1 0E      NEXT    LDA (SAVIL)Y
[...]
  269  07C9  C9 3B              CMP #';
  270  07CB  00 00              BEQ DONE
*****  ERROR ** UNDEFINED SYMBOL - NEAR COLUMN 18
  280                           .END

END OF MOS/TECHNOLOGY 650X ASSEMBLY VERSION 4
NUMBER OF ERRORS =    2,   NUMBER OF WARNINGS =    0

The first column (CARD #) is the line number in the source. The LOC field is the memory address, which is followed by the output bytes (CODE) and the source line (CARD), which the assembler re-indented for readability.

The CARD nomenclature stems from 1960s mainframes, where each line of text was represented by one punch card.

Error messages are shown as extra lines after the line that caused the error. Note that the assembler will keep working through the file no matter what, and will output placeholder bytes for lines with errors.

The original LST printout of the KIM-1 ROM, which was part of the user manual, is a real-world example of a 1200-line LST file, with the symbol table and the mnemo statistics (.OPT COUNT) at the end.

OBJ File

The main output of the assembler is the binary program, which is in the form of a so-called “interface file” with a suggested file extension of .OBJ.

The diverse set of platforms that the Cross-Assembler ran on all had different word sizes, and many of them measured memory only in words and did not even have a concept of (8-bit) “bytes”. Therefore, the assembler could not output a binary file, but instead wrote a portable, hex-encoded text file, like this:

;18E500A200A0DC60A228A01960B00786D684D3206CE5A6D6A4D3600D8C
;18E51820A0E5A9008D910285CFA9488D8F02A9EB8D9002A90A8D890C62
 [...]
;06FFFA43FEE2FC48FF0665
;0001200021

Every line consists of the following characters:

;: first character of each record
2 chars: number of data bytes to follow
4 chars: load address of first byte of line
2 chars (repeated): data byte
4 chars: checksum

The last line is
* ;00: identifier for last line
* 4 chars: number of preceding lines
* 4 chars: checksum

The checksum is calculated by adding all preceding bytes of the line together.

This text-only file is platform-independent and can easily be transferred between different computer systems, and e.g. downloaded from a time-sharing system in order to write it to an EPROM.

Use at Commodore

The Cross-Assembler was used at MOS to create the very first 6502 code, like the KIM-1 ROM (shown above), or the TIM ROM (MCS6530-004).

A large amount of the original Commodore source code has been preserved, and all code before 1984 is in a format very similar to the original MOS definition supported by both the Cross-Assembler and the Resident Assembler. So at first sight, it is not so clear which assembler Commodore used for developing ROMs of the PET, VIC-20 etc. and the disk drives.

The next article in the series will discuss this topic further.

An earlier version of the article stated that the original MOS Cross-Assembler had been implemented by a grad student at the University of Illinois at Urbana-Champaign. In fact, the linked paper is only about the COMPAS/MOS Cross-Assembler as it was used on the university’s CDC computer.↩

Commodore's Assemblers: Overview

Michael Steil — Sun, 09 May 2021 11:06:01 +0000

Commodore used 5 different assemblers, most of them in-house tools, to build the ROMs for their Computers like the PET, the C64 and the C128. Nevertheless, all Commodore source files, from 1975 to 1990, share a common format and use the same assembly directives. This series of articles describes each of these assemblers.

Year	Company	Assembler	Platform	Encoding
1975	MOS	6502 Cross-Assembler	GE, NCSS time-sharing, …	various (upper case)
1976	MOS	Resident Assembler	MDT650, KIM-1, PET, C64, CBM2, TED, C128	ASCII (upper case, CR)
1984	BSO	CY6502	VAX	ASCII (mixed case, CRLF)
1986	Commodore	HCD65	C128	PETSCII (mixed case, CR)
1989	Commodore	6502ASM	VAX, Amiga, PC	ASCII (mixed case, LF/CRLF)

Series Overview

Cross-Assembler and Resident Assembler

In late 1975, MOS Technology, Inc. introduced the 6502 CPU and in 1976, they released the KIM-1, a demonstration/development platform for the 6502. Commodore bought MOS in November 1976, and the 6502 and the KIM-1 became Commodore products.

MOS also developed two assemblers for the 6502:

The “Cross-Assembler” (1975), available for various mainframes and minicomputers.
The “Resident Assembler” (1976), running on 6502 systems. It was ported to all Commodore 8-bit computers. The C64 version was sold as the “C64 Macro Assembler” in 1982.

Both assemblers were compatible in that they understood the same source format, with the same math features and the same directives and options.

BSO CY6502 (VAX)

In mid-1984, Commodore switched to “CY6502” by the company Boston Systems Office (BSO), a cross-assembler running on VAX/VMS systems that was highly compatible to the MOS assemblers, but more advanced.

HCD65 (C128)

In 1986, Commodore wrote a new assembler named “HCD65” for the C128 that aimed at full compatibility with the BSO assembler. They sold it as part of the Commodore 128 Developer’s Package in 1987. In 1989, as Commodore worked on the ill-fated C65, they added support for the extended 65CE02/4510 instruction set.

6502ASM (VAX, Amiga, PC)

Also in 1989, and also for the C65 project, they wrote a new cross-platform assembler from scratch to replace the BSO one on VAX/VMS. It was supposed to be fully backwards-compatible and support the 65CE02/4510 instruction set from the start.

Others

There are two more assemblers that were used to develop the ROMs of Commodore computers that don’t really count as in-house tools:

MACRO-10 (PDP-10)

All Commodore 8-bit computers shipped with a version of Microsoft BASIC. Microsoft had used a PDP-10 mainframe for cross-developing the BASIC interpreter. Instead of writing a cross-assembler from scratch, they reused the MACRO-10 assembler that came with the PDP-10 and defined a set of macros that emitted 6502 opcodes. The article Microsoft BASIC for 6502 Original Source Code [1978] has more information.

For the first two versions of the PET ROM, Microsoft delivered the BASIC binary together with the source to Commodore. After BASIC V2, Commodore adapted it to their own assemblers and built it themselves – so Microsoft’s development tools were never used by Commodore.

Merlin 128 (C128)

The Merlin 128 Macro Assembler by Glen Bredon was a commercial assembler for the C128. It was used by Dennis Jarvis while he worked on the DOS of the Commodore 65. Jarvis had used Merlin for his personal projects before, and it had become the tool of his choice.

He started out with the source of the CBM 8250 disk drive ROM, converted it from Commodore’s format to Merlin (PETSCII) format, and developed on top of it. The 65CE02/4510 extensions were used through a set of macros.

Towards the end of the project, the C65 DOS code was ported from Merlin to the cross-platform Commodore 6502ASM.

Reverse-Engineered geoWrite 2.1 for C64 Source Code

Michael Steil — Mon, 23 Nov 2020 14:26:48 +0000

geoWrite is a WYSIWYG rich text editor for the Commodore 64 GEOS operating system. I created a reverse-engineered source version of the geoWrite 2.1 for the C64 (English and German) for the cc65 compiler suite:

https://github.com/mist64/geowrite

The source compiles into the exact same binaries as the English and German versions of geoWrite 2.1 included with GEOS 2.0.

Not all code has been commented yet, contributions are welcome.

The pagetable.com article series on geoWrite internals is based on the results of this reverse-engineering effort. Here is the list of articles in the series again:

Diskettenlaufwerke am Beispiel der Commodore 1541 [video]

Michael Steil — Tue, 20 Oct 2020 16:30:15 +0000

Dieser Vortrag wurde am 11. Oktober 2020 auf dem (virtuellen) Vintage Computing Festival Berlin gehalten.

Diskettenlaufwerke am Beispiel der Commodore 1541

Aus der Homecomputer- und frühen Personal-Computer-Zeit sind Disketten nicht wegzudenken. Dieser Vortrag beschäftigt sich mit Diskettentechnologie am Beispiel des 5,25-Zoll-Laufwerks “Commodore 1541”, bekannt als das Laufwerk zum Commodore C64. Nach einer historischen Einordnung (Bänder, Platten, 8-Zoll-Disketten) besprechen wir den Aufbau von Laufwerken und Disketten, sowie das Low-Level-Aufzeichnungsformat (Spuren, Sektoren, SYNC-Marker, GCR-Codierung) und dessen Implementierung in der Laufwerks-Firmware. Danach behandeln wir das Dateisystem-Format und die Datenübertragung zwischen dem Laufwerk und dem C64. Wir thematisieren außerdem Schnelllader, welche die Laufwerks-Firmware durch optimierteren Code zum Lesen und zur Datenübertragung ersetzen, sowie Kopierschutz-Systeme, die Nonstandard-Formate mit verschleierten Leseroutinen verbinden. Schließlich sprechen wir noch über Lösungen zum fehlerfreien Auslesen von antiken Disketten mit moderner Hardware.

Ankündigung: Vortrag "Diskettenlaufwerke am Beispiel der Commodore 1541" am VCFB 2020

Michael Steil — Sat, 10 Oct 2020 09:49:27 +0000

This post is about an upcoming talk in German.

Update: Mitschnitt verfügbar!

Am Sonntag, den 11. Oktober 2020 um 17:00 gibt es auf dem (virtuellen) Vintage Computing Festival Berlin meinen Vortrag Diskettenlaufwerke am Beispiel der Commodore 1541.

Der Vortrag wird live gestreamt; ein Mitschnitt wird danach verfügbar sein.

Diskettenlaufwerke am Beispiel der Commodore 1541

Inside geoWrite – 9: Keyboard Handling

Michael Steil — Thu, 24 Sep 2020 14:19:10 +0000

In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses how the app consolidates keyboard input to keep up with fast typists.

Article Series

GEOS Keyboard Basics

GEOS does not use the keyboard driver in the C64’s ROM, but has its own implementation that uses ASCII-based 8 bit key codes: Codes $20 through $7F are the regular ASCII printable characters. All control keys (e.g. cursor keys) as well as non-ASCII keys (like “£”) are mapped into the control code space $00 through $1F:

Code	Key	Comment
$00		no key
$01	F1
$02	F2
$03	F3
$04	F4
$05	F5
$06	F6
$07	NO SCROLL	C128 only
$08	CRSR←	C64: SHIFT+CRSR→
$09	TAB	C64: Ctrl+I
$0A	LF	C128 only
$0B	ENTER	C128 only
$0C		unused
$0D		unused
$0E	F7
$0F	F8
$10	CRSR↑	C64: SHIFT+CRSR↓
$11	CRSR↓
$12	HOME
$13	SHIFT+HOME	“CLR”
$14	←
$15	UPARROW
$16	STOP
$17	SHIFT+STOP	“RUN”
$18	£
$19	HELP	C128 only
$1A	ALT	C128 only
$1B	ESC	C128 only
$1C	SHIFT+DEL	“INST”
$1D	DEL
$1E	CRSR→
$1F		used internally

All this defines the codes $00-$7F, which accounts for seven bits. The uppermost bit (value of $80) indicates whether the “Commodore” modifier key has been pressed with the key. The Commodore key is used in GEOS for keyboard shortcuts, much like the Command key on the Macintosh.

Keyboard Buffer

The keyboard driver runs in the interrupt context, which is triggered 60 times per second by a timer. It scans the keyboard and puts new key codes at the end of a 16 byte queue: This way, no keys will be lost if the application can’t get to handling incoming keys immediately – as long as it’s not more than 16 keys behind.

The application can get the next key from the queue by calling GetNextChar. If there is a key in the queue, it will be removed from the queue and returned. Otherwise, a value of 0 will be returned.

All this is pretty much what’s going on in any operating system – including the C64’s original ROM in BASIC mode.

Callback

GEOS is an event-driven environment: The system’s “main loop” controls all execution and calls back the app based on events, like mouse clicks, key presses and timers.

The idiomatic way to handle the keyboard on GEOS is therefore to register for keyboard callbacks. On every main loop iteration, the system will check whether there is at least one key in the queue. If this is the case, it will dequeue the oldest key, store it in the system variable keyData and call the function pointer keyVector, which the app can set. If the vector is 0 (the default), no callback will happen and the key code will be discarded.

The app can then read the key code from keyData, handle it and return. If there are more keys in the queue, the main loop will call the vector again, with the next key code, in the next iteration.

Alternatively, the callback function can read more keys from the queue by repeatedly calling GetNextChar.

geoWrite

The core function of geoWrite is text input, so it needs to make sure it is responsive, and no key presses get lost.

And that’s tricky: On a 1 MHz CPU, redrawing several lines of proportional fonts can take seconds, so it is quite common that the app falls behind the user’s typing. So if the user types a character that leads to a few lines being redrawn, and the user types three characters in the meantime, these three characters may cause three redraws, in which time the user can type another nine characters. The app would fall more and more behind, and eventually drop characters.

To avoid this, the app has to catch up to the user’s typing. The good news is that once the app is behind on keys, it has several key codes to work with, so there are some optimizations that can be applied:

Consolidating Character Inserts

When inserting a single character, the text data after the cursor is moved up by one byte and the new character is added. Then, the screen is updated: The part of the current line to the right of the cursor is redrawn, and if the last word of the line overflows to the next line, that one has to be redrawn as well and so on.

The following animation shows this in action:

Instead of doing this work for every character in the keyboard buffer separately, a lot of work can be saved by doing it for all characters in one go:

Only move the buffer up once, by the number of characters in the buffer.
Copy the characters into the buffer.
Redraw only once.

This is basically the same as what happens when pasting text from a text scrap.

You can see the effect of this strategy on the animation above. The “a” key was held down on the keyboard, generating a fast stream of “a” key codes.

The first character is inserted, which triggers a redraw of just the line.
In the meantime, two more characters came in, which are added in one go. This triggers the redraw of the whole paragraph, in this case, since the word “Commodore” had to be moved to the next line.
This redraw was so expensive that seven more characters came in in the meantime. The line gets redrawn, and it does not overflow this time.
Because this redraw was cheap, only two more characters came in. Again, it causes a redraw of only the current line.

We see in practice that whenever more text needs to be redrawn, more keypresses are buffered, but the system catches up quickly.

All printable characters as well as line break and TAB characters can be combined into a single string to be inserted. All other keys have to be special cased.

DEL Characters

If the user types a character, and then hits the DEL key, the document will effectively be unchanged, but the text in memory was first moved up by one byte, then moved down by one byte again, and the screen was updated twice.

So if the keyboard buffer contains a DEL char, geoWrites removes the preceding character from the buffer. The two characters cancel each other out and no more work has to be done.

If a DEL is encountered while the buffer is empty, no character can be removed, but it’s not a character that can be inserted either. geoWrite then increments the count of DEL keys it has seen in the keyboard buffer that remained. There are some examples after the next paragraph.

Control Keys and Keyboard Shortcuts

Non-printable key codes like cursor keys and keyboard shortcuts are also special. If the queue contains “abc”, followed by C=T (Commodore Key + T; paste text) and “def”, geoWrite has to insert “abc”, paste the text in the text scrap, and then insert “def”. It can not combine the two text strings.

This is why whenever a control key or shortcut is encountered, geoWrite stops processing the queue. The DEL characters, the string so far and the control key will be evaluated, and the remainder of the keyboard queue will be handled once the keyVector is called the next time.

Examples

Here are some examples of contents of the keyboard queue, the number of DEL characters at the beginning, the string to be inserted, the control key that was detected, and the contents of the keyboard queue after this processing:

Kbd Queue Before	# DEL	Insert String	Control Key	Kbd Queue After
`abc`	0	`abc`
`abc`	0	`ab`
`abcd`	0	`abd`
`abcDEL>`	0
`abcDEL>`	1
`abc`	1	`abc`
`abc`	2	`abc`
`abc`	1	`bc`
`abc`	0	`a`	`C=T`	`bc`
`abcde`	1	`bc`	`C=T`	`de`

The examples show that DEL keys that follow a character will effectively remove that key, but if the number of DEL key codes exceeds the number of characters before it, the extra DEL keys will be counted. And if there is a control key or keyboard shortcut, processing of the keyboard queue stops at this point.

Code

This is the code that processes the keyboard queue and returns the number of characters to delete, the string to insert, and the detected control key:

processKbdQueue:
        lda     #0
        sta     delCount                ; no excess DEL keys so far
        sta     kbdStringCnt            ; kbd string empty
        sta     curControlKey           ; no control key

        lda     keyData                 ; first character, as passed into keyVector

@again: bmi     @ctrl                   ; "C=" shortcut, so stop processing

        cmp     #KEY_DELETE             ; DEL key?
        beq     @del                    ; yes
        cmp     #KEY_INSERT             ; SHIFT+DEL?
        beq     @del                    ; yes (same as DEL in GEOS)

        cmp     #CR                     ; return key?
        beq     @nctrl                  ; yes, does not count as a control key
        cmp     #TAB                    ; TAB key?
        beq     @nctrl                  ; yes, does not count as a control key

        cmp     #$20                    ; below $20, i.e. non-printable
        bcc     @ctrl                   ; yes, it's a control key, stop processing

@nctrl: ldx     kbdStringCnt
        sta     kbdString,x             ; add to kbd string
        inc     kbdStringCnt
@next:  jsr     GetNextChar             ; are there more characters in the kbd queue?
        tax
        beq     @rts                    ; no, return
        bne     @again                  ; yes, repeat

@del:   ldx     kbdStringCnt            ; are there characters in the kbd string?
        beq     @excss                  ; no, no characters to delete

        dec     kbdStringCnt            ; remove previous character
        bra     @next                   ; and continue

@excss: inc     delCount                ; count excess delete characters
        bne     @next                   ; and continue

@ctrl:  sta     curControlKey           ; save control key
@rts:   rts

geoWrite’s keyVector handler calls this, and then applies the three outputs (DELs, string, control key) to the document:

First, if there are excess DELs, it deletes characters at the current cursor position according to the number of DELs. It just moves the remainder of the buffer down by the number of DELs.
Then, if there is a string, it inserts it in one go by moving the buffer up by the length of the string, and copying the string into the text.
If there were DELs or a string, the screen is updated.
If there is a control key, it gets evaluated.

There are two special cases: If there are more DELs than characters on the current page, the extra DELs have to be handled differently. And if there are DELs and a string, there is an optimization performed in the first two steps: There is no need to move the buffer twice. For example

if there is one DEL and two characters in the string, the buffer has to be moved up by one byte, and the insertion position is one byte before the cursor.
if there are two DELs and one character in the string, the buffer has to be moved down by one byte, and the insertion position is two bytes before the cursor.
if there are two DELs and two characters in the string, the buffer does not have to be moved at all, and the insertion position is two bytes before the cursor.

Conclusion

When designing software, a slow CPU can be made up for if there is a lot of memory, e.g. by caching data to avoid calculating it. And too little memory can be made up for by a fast CPU. GEOS and geoWrite had neither. They were written for an 8-bit CPU with 64 KB of RAM.

This scenario makes all design decisions interdependent:

The memory constraint requires all code to be optimized for size, which makes it slower, and requires geoWrite to be split into 9 parts that are loaded from the slow disk drive.
The font renderer could have been much faster, had it had the space for fastpath code and caches.
Slow font rendering requires geoWrite to include lots of extra logic to be able to keep up with fast typists.
The extra code increases the memory pressure, requiring other code to be written more densely or pushed into different records.

The GEOS engineers seem to have found a reasonable tradeoff. Yes, geoWrite is slow on an unexpanded C64, but it is a useful and very powerful tool for a 64 KB computer.

Inside geoWrite – 8: Copy & Paste

Michael Steil — Tue, 22 Sep 2020 15:18:45 +0000

In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses its efficient cross-application cut/copy/paste implementation.

Article Series

GEOS Scrap Architecture

Just like modern operating systems, GEOS supports cut, copy and paste, within one app as well as between multiple apps. But this is about where the similarities end.

First of all, there is no single clipboard/pasteboard: There is one for every type of data, and they are called “scraps”. There can be a “text scrap” and a “photo scrap” (image data) at the same time, for example. When the user selects “paste” in the “edit” menu, geoWrite asks what type should be pasted:

The GEOS KERNAL has no concept of scraps – they are purely a convention between apps. Text and photo scraps are specified by the GEOS reference manual and apps like geoWrite, geoPaint and geoPublish implement this specification.

Since scraps can be rather large, they are stored as files on disk. Copying text between two apps basically means that one app writes a file with a defined file name in a standardized format, and the other app reads it. As a side effect, scraps are persistent even across reboots of the operating system. Here is a text scrap on disk:

Text and photo scraps are the two types specified by GEOS, but since there is nothing special about scrap files, any application can use its own scrap format: geoCalc uses calc scraps, for example, which allows spreadsheet cells to be copied between documents.

By convention, scraps are sequential (non-VLIR) files with a name of “* Scrap” and a type of “* Scrap Vn.n”. “*” represents the type of scrap, space-padded to 5 characters:

App	Filename	Type String
geoWrite 2.1	`Text Scrap`	`Text Scrap V2.0`
geoPaint 2.0	`Photo Scrap`	`Photo Scrap V1.1`
geoCalc 1.0	`Calc Scrap`	`Calc Scrap V1.1`

As you can see from the type string, scraps are versioned, just like apps and documents. Apps should contain conversion code to accept older versions, and refuse to load versions that are newer than supported.

GEOS comes with the Text Manager and Photo Manager desk accessories, which are simple databases (called “albums”) for text and photo scraps, respectively. The following screenshot is Text Manager showing a plain-text preview of one text scrap in its album:

Text Scraps

Format

A text scrap is a sequential file on disk with a name of “Text Scrap” and a type of “Text Scrap V2.0”. It supports all geoWrite features (fonts, styles, rulers) except embedded images.

The first two bytes of the file are the size of the data that follows, so the data can be up to 65535 bytes. geoWrite can paste any size, but cannot create scraps larger than a page.

The remainder is regular geoWrite page data. It must start with a NewCardSet escape sequence to define the font and style of the text that follows. Here is an example:

00000000  10 00 17 8c 00 40 48 65  6c 6c 6f 20 57 6f 72 6c  |.....@Hello Worl|
00000010  64 21                                             |d!|

10 00 ($0010) is the scrap’s length, 16 bytes.
17 is the ESC_NEWCARDSET escape code.
8c 00 ($008C) specifies the California font at 12 pt.
40 is the style byte, and means bold face.
The remainder is the ASCII text “Hello World!”

The text can contain the following control codes:

Code	Description
$09	Tab
$0C	Page Break
$0D	Line Break
$11	Ruler Escape
$17	NewCardSet Escape

See part 7 for more information on the geoWrite escape sequences. For the following sections, the background discussed in that article may be generally useful.

geoWrite Implementation

By convention, the text scrap needs to be a file on disk. But disk is slow, so geoWrite uses a 300 byte cache in memory. When copying and pasting within the app, and if the text is small enough to fit in the cache, the scrap is not written to disk until necessary.

Copy

If the user selects some text and then clicks on “copy” in the “edit” menu, the current selection gets copied into a text scrap.

First, geoWrite looks through the selected range of geoWrite data to see whether it contains an embedded image (ESC_GRAPHICS). If this is the case, it shows an error, since text scraps cannot contain images.

Then it trims the selection so that it doesn’t contain unnecessary data: If the very end of the selection is a ruler data structure, it gets removed, since all its properties (margins, tab stops, alignment and spacing) only apply to the paragraph following it, which is not part of the selection.

The same would be true for a NewCardSet (font and style) structure at the end of the selection, but the text selection logic has already stripped it from the end.

All text scraps have to start with the NewCardSet structure to set font and style. geoWrite does not have the space to work on a copy of the selected text and needs to create the scrap in-place: Therefore it saves the four bytes preceding the text range and overwrites them with the NewCardSet structure. It will restore these bytes after saving the text scrap.

If the resulting scrap data is small enough, it will be copied into the buffer in memory. Otherwise, it will be saved to disk. If there is already a text scrap on disk, it will be overwritten.

Paste

The “paste/text” function in the “edit” menu inserts the text scrap at the cursor position. If there is currently text selected, it gets deleted before inserting the scrap, effectively replacing the selection with the scrap.

To insert text into a page, the text between the cursor position and the end of the buffer is moved up in memory to make space for the data to be inserted. If there is a text scrap in the memory buffer, geoWrite just copies the scrap data verbatim into the page.

Disk

If the text scrap is on disk, it is more complicated. geoWrite cannot just make space in the buffer and read the scrap into it: It might not fit. The app’s memory manager generally keeps about one page in the buffer, and pages data from and to disk to work with bigger amounts of data.

geoWrite therefore reads and inserts the text scrap block by block, every time moving the page data up by a block. If this would overflow the buffer, the memory manager will do a repagination run to move some data to disk and therefore reduce the size of the data in the buffer.

But there is now another problem: Inserting the text scrap is no longer an atomic operation. geoWrite must be able to abort the insertion process after any block and still have a consistent document at the end. There are several reasons the insertion might be aborted:

There is an I/O error when reading the scrap.
The document exceeds 61 pages.
The disk is too low on space to continue.

Ruler (27 bytes) and NewCardSet (4 bytes) escape sequences may cross a block boundary, so if insertion is aborted, it could happen that an incomplete sequence is added to the document, which would leave the document in an illegal state.

Therefore, geoWrite interprets the scrap data and stops before escape sequences that span two blocks. It then only inserts the data before this incomplete escape sequence. Then, when the next block is loaded, it inserts the whole escape sequence in one go.

With this strategy, if an insertion has to be aborted, the text up to the error will be cleanly added to the document.

Style

All text scraps define the font and style at the beginning, and they can contain a ruler definition. By just concatenating the scrap with the document’s existing text after the insertion point, these font/style/ruler changes would continue to apply to existing text after the inserted scrap. Therefore, geoWrites inserts after the scrap a copy of the ruler or cardset that was active before the insertion point, in order to keep the style of the original text the same.

There is no need for this though:

if there is already a ruler/NewCardSet escape at this position.
if the paste happens at the end of the document.
if the paste happens at the end of a page. The next page always starts with explicit paragraph and font/style escapes.

Lazy Logic

Copying and pasting text that is less than 300 bytes within geoWrite happens without disk access, but for interoperability with other apps, the buffer in memory will have to be written to disk in two cases:

If geoWrite runs a desk accessory. Desk accessories are like apps, but they are smaller, launched from a regular app and they return to the app when they quit. Examples would be the calculator, note pad and the text and photo managers. These desk accessories may want to access the text scrap.
It geoWrite quits. If geoWrite is launched again, or any other app is run, this app may want to read the text scrap.

Photo Scraps

Format

A photo scrap is a sequential file on disk with a name of “Photo Scrap” and a type of “Photo Scrap V1.1”. It is generally a rectangular monochrome image.

The first three bytes of the file are the dimensions of the image. The width is one byte and is measured in units of 8 pixels. The next two bytes are the height in pixels. The theoretical maximum dimensions of a photo scrap are therefore 2040 (255*8) * 65535, with the width divisible by 8.

The bitmap data uses 8 horizontal pixels per byte (the leftmost pixel is bit 7, 1-bits are black). Consecutive bytes describe a pixel line from left to right. These lines are stored row by row starting from the top.

This bitmap data is stored compressed using a format based on runlength-encoding (RLE). GEOS KERNAL’s “BitmapUp” can decode this format, so applications can just pass the data to the KERNAL’s APIs without having to care about the specific encoding.

BitmapUp-compressed data is a sequence of packets. Every packet starts with a “count” byte:

count Value	Description
$00	Reserved
$01-$7F	Repeat: Repeat the following byte count times
$80	Reserved
$81-$DB	Unique: Use the next count – $80 bytes literally
$DC	Reserved
$DD-$FF	Bigcount: Followed by a bigcount byte. Repeat the following count – $DC bytes bigcount times, interpreting the resulting bytes using the Repeat and Unique rules.

As an example, let’s look at a 16×16 rectangle:

****************
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
*              *
****************

It can be compressed into 9 bytes:

.byte   2,%11111111
.byte   $DC+3,14
    .byte   $80+2,%10000000,%00000001
.byte   2,%11111111

The first line instructs the decoder to repeat the bit pattern %11111111 2 times, producing 16 black pixels – the top line of the rectangle.
The second line will cause the next 3 bytes to be repeated 14 times, once for every line of the rectangle except the first and the last one.
These next three bytes are again encoded and tell the decoder to take the next 2 bytes verbatim: %10000000 and %0000001 describe one line of the rectangle with the leftmost and the rightmost pixel set.
The last line is the same as the first one; it creates the bottom row.

After the monochrome bitmap data, an image scrap can optionally contain data on how to colorize 8×8 pixel squares. geoWrite does not support color and just ignores this part.

geoWrite Implementation

geoWrite can paste photo scraps with heights up to 144 pixels. Images wider than the page’s dimensions will effectively be cropped when rendering.

In geoWrite documents, image data is not stored inline with the text of the document. Instead, the text contains a 5 byte graphics escape sequence pointing to the image:

Offset	Type	Contents	Description
0	Byte	Escape Code	Constant $10 (`ESC_GRAPHICS`)
1	Byte	Image Width	Width of image divided by 8
2-3	Word	Image Height	Height of image
4	Byte	Record Number	Number of record containing image data

The data of each image is stored in a separate VLIR record. The format of image records is that of photo scraps, so when pasting an image, all geoWrite has to do is make a copy of the photo scrap file into a new VLIR record in the document.

Before this can be done, a few checks have to be made though:

A photo scrap file must exist.
The file version must not be too new. (There is a bug in geoWrite 2.1: It checks for a maximum version of V2.1, but the highest version of the photo scrap file format at the time geoWrite was written was V1.1 – and still is today. The version checking code was meant for geoWrite documents and text scraps, whose versioning follows the geoWrite version. It should have been special cased for photo scraps.)
The height must not be more than 144 pixels.
There must be space in the document. A geoWrite document can hold up to 64 images.

Nomenclature

One last thought about GEOS and naming things. What I called “image” throughout this article, GEOS calls:

graphics in the KERNAL API.
picture in the geoWrite UI.
photo in all references to photo scraps.

References

Michael Farr: The Official GEOS Programmer’s Reference Guide
Berkeley Softworks: The Hitchhiker’s Guide to GEOS

Inside geoWrite – 7: File Format and Pagination

Michael Steil — Wed, 16 Sep 2020 17:34:59 +0000

In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses how its file format allows the app to efficiently edit documents hundreds of KB in size.

Article Series

The Overlay System
Screen Recovery
Font Management
Zero Page
Copy Protection
Localization
File Format and Pagination ← this article
Copy & Paste
Keyboard Handling

Design

Writing a word processor, especially a WYSIWYG one, poses two core challenges:

Memory Management: The whole document may not fit into memory. A small text file is easily dozens of KB, and a large one hundreds of KB, not counting inline images. The word processor needs a strategy to only keep the parts in memory that are currently needed.
Pagination: WYSIWYG mans that the user will always see on which page, and where on that page the text that is currently being edited will appear when printing. This requires the word processor to layout the whole text up to the current position.

Memory Management

The natural approach to memory management is to only read a small part of the document into memory for viewing and editing. Once the user jumps to a different part of the document or writes enough text to fill the buffer, the current part has to be written to disk, and a different part has to be loaded into memory.

This is tricky with regular (“sequential”) files: When writing a part of the document back to disk, the new version is generally a different size than the part that it replaces. This means that the remainder of the document has to be moved. In the worst case, this requires making a complete copy of the document and temporarily taking up twice the space on disk.

GEOS has the concept of a VLIR file, which is a collection of up to 127 “sub-files”, called records, numbered 0 to 126. The geoWrite application itself consists of several such records for its own code, which are paged in based on which functionality is being used. Similarly, geoWrite documents are VLIR files containing multiple records of no more than 7 KB – this is how much memory is left on a 64 KB RAM system after accounting for the operating system and the geoWrite application.

In the generic case, the word processor can then just read a single record from disk into memory, have the user edit it, and write the record back to disk. All other records remain unchanged.

A simple approach for dividing up the document would be to just cut it into 7 KB parts. If text is added to the middle of the document, and the record overflows 7 KB, it will have to be divided into two, and all subsequent records have to be moved up. If two consecutive parts are less than 7 KB together, they can be combined, and subsequent records have to be moved down. Moving records really just means renaming them and is therefore cheap.

The problem with dividing up the document at fixed limits is that the point where text continues from one record to the next may be anywhere. Therefore, a single sentence on the screen might come from two different records, and moving the mouse across this invisible line will cause slow (and surprising!) disk access. It’s even worse when performing an operation on selected text that spans two records, which may cause swapping in and out of parts multiple times.

Pagination

The other challenge is pagination. There is no information in the document on how to map a page number to a record, so if the user wants to jump to a specific page, the word processor would have to actively find out what part of the document will end up printed on that page. If the desired page is after the current one, all text from the current position on has to be paginated, i.e. put through the page layout logic until the point in the document is found that will be printed on the specified page. If the desired page is before the current one, the same logic would have to be done starting from the first page of the document.

To avoid redundant “re-pagination”, the calculated pagination information could be stored as metadata in the file. For every page, this would be the combination of record number and offset within the record to point to the first character of the page. If text is edited anywhere but at the end of the document, the remainder of the document has to be re-paginated, and the table has to be updated – this can be done lazily. Jumping within the document now only requires a table lookup.

geoWrite Strategy

geoWrite uses a combined strategy for memory management and pagination: It maps every page to exactly one record. The app reserves 7000 bytes of RAM for the currently edited page, which corresponds to just about one page fully filled with 9 pt text. Jumping to a different page is as simple as reading a different record – without requiring a separate page-to-record mapping table. And it also solves the other problem from before: Since a whole page is guaranteed to be in RAM, editing text within a page generally does not cause disk access.

Picking pages as the unit of editing does sound weird at first, because the separation into pages is such a transient property of a text document. After all, the very idea of a word processor (as opposed to a typewriter) is that the user can regard the document as just linear text without worrying about page breaks. When editing text, page boundaries change, and the whole document would have to be changed. This is true, but these re-layouts of the document are necessary when editing, no matter what strategy is used to cut the document into pieces.

Here is an overview of the properties of the two strategies, with the more desirable ones marked in bold:

	Max Size Records + Metadata	One Record per Page
Jump to page	lookup, read record	read record
Add text in the middle	re-pagination	re-pagination & data copy
Surprising disk access	yes	no

Both strategies allow navigating the document efficiently.
Adding text to the middle of the document always requires re-pagination of the following pages at some point. With the “one record per page” strategy, this also requires going through all following records and re-combining them according to the new page breaks.
Generally, no edit operation within a single page will cause disk access with the “one record per page” strategy.

So it’s basically a tradeoff between repagination speed and editing performance, and geoWrite went for the latter.

File Format

Before we discuss when and how exactly a document is re-paginated with geoWrite, we have to dive into the exact file format.

geoWrite files are VLIR files. GEOS specifies a VLIR file as consisting of a 256 byte file header and up to 127 records of arbitrary lengths.

The file header of any GEOS document contains the file’s icon, type and creator, a comment, and optionally, type-specific metadata.

geoWrite stores 9 bytes of metadata at offset $89 for document-global properties:

Offset	Size	Contents	Description
$89-$8A	Word	Start Page Number	Number of first page, usually 1
$8B	Byte	Title/NLQ Flags	$80: has title page, $40: NLQ mode
$8C-$8D	Word	Header Height	Height of header in dots
$8E-$8F	Word	Footer Height	Height of footer in dots
$90-$91	Word	Page Height	Page height in dots

The start page number can be set to numbers other than 1, to allow splitting a project into multiple document files with consistent page numbering.
If the document has a title page, no header or footer will be printed onto the first page.
In NLQ mode, the document will be printed by sending ASCII characters to the printer, using the printer’s built-in fonts. This changes the metrics calculation.
Header and footer height are calculated from the header/footer text. These are cached values to allow page layouts without having to measure the height of the header and the footer.
The page height is generally a property of the printer. The field in the file header specifies what page height was used for paginating the document. If the page height of the current printer is different, the document has to be re-paginated.
All sizes are specified in dots, which are 1/80 of an inch on paper, and the same as GEOS screen pixels. geoWrite documents are either 480 (6 inches, “regular”) or 640 (8.2 inches, “wide”) dots wide. The default height (i.e. if no printer is installed) is 752 dots (9.4 inches).

These are the contents of the VLIR records of a geoWrite document:

Records	Contents
0-60	Pages
61	Header
62	Footer
63	Reserved
64-126	Images

A document can have up to 61 pages, which are stored in records 0 through 60. Internally, page numbering is zero-based. For the UI, the start page number from the header is added.
The text for the header and footer are stored in two separate records. They have the same format as pages.
geoWrite supports up to 63 inline images, each of which is stored in its own record, which is pointed to by the page that contains the image.

In a properly closed geoWrite document, all page records are consecutive with no empty records in between, all image records are referenced by pages, pagination is consistent with the page height in the header, and the header and footer height values in the header correspond to the text in the header and footer records.

Text Format

The text is stored in ASCII format, that is, codes $20 through $7F are printable characters, and codes $00 through $1F are control codes. Of these, only the following are defined:

Code	Description
$00	No-Op
$09	Tab
$0C	Page Break
$0D	Line Break
$10	Graphics Escape
$11	Ruler Escape
$17	NewCardSet Escape

The $00 character code specifies the end of the file. The graphics, ruler and NewCardSet escape codes indicate data structures that need a detailed description.

NewCardSet Escape

The NewCardSet structure encodes a change in font and style. It can appear anywhere in the document.

Offset	Size	Contents	Description
0	Byte	Escape Code	Constant $17 (`ESC_NEWCARDSET`)
1-2	Word	Font ID	Encoded font and point size identifier
3	Byte	Style	Text style bitfield

GEOS Font IDs are 16 bit values that encode the unique font identifier (0: system font, 1: University, 2: California, 3: Roma, …) in bits 6-15, and the point size in bits 0-5.
The style bitfield is defined as follows:

Bit	Description
7	Underline
6	Bold
5	Reverse
4	Italics
3	Outline
2	Superscript
1	Subscript
0	Reserved

All bits can be combined, except subscript with superscript.
All zero bits indicate plain text.

Ruler Escape

The ruler structure encodes a paragraph’s properties. It can appear only at the beginning of a new paragraph.

Offset	Type	Contents	Description
0	Byte	Escape Code	Constant $11 (`ESC_RULER`)
1-2	Word	Left Margin	Left margin
3-4	Word	Right Margin	Right margin
5-6	Word	Tab Stop 0	Position/type of tab stop 0
7-8	Word	Tab Stop 1	Position/type of tab stop 1
9-10	Word	Tab Stop 2	Position/type of tab stop 2
11-12	Word	Tab Stop 3	Position/type of tab stop 3
13-14	Word	Tab Stop 4	Position/type of tab stop 4
15-16	Word	Tab Stop 5	Position/type of tab stop 5
17-18	Word	Tab Stop 6	Position/type of tab stop 6
19-20	Word	Tab Stop 7	Position/type of tab stop 7
21-22	Word	Paragraph Margin	Left margin of first line of paragraph
23	Byte	Spacing/Alignment	Line spacing and text alignment
24	Byte	Reserved	Reserved for text color
25	Byte	Reserved	Reserved
26	Byte	Reserved	Reserved

All sizes are in dots.
The left margin is less than the right margin, and the tab stops are in ascending order.
The most significant bit of each tab stop indicates whether it is a regular or a decimal tab stop. Decimal tab stops align the decimal separator to the tab stop.

Bit 15	Description
0	Regular tab stop
1	Decimal tab stop

Line spacing and alignment are encoded into a single byte:

Bits 0-1	Description
0	Left aligned
1	Centered
2	Right aligned
3	Justified

Bits 2-3	Description
0	1.0 line spacing
1	1.5 line spacing
2	2.0 line spacing

Graphics Escape

The graphics escape is used to embed an image into the text. It can appear anywhere in the document, and is regarded as a paragraph of its own.

Offset	Type	Contents	Description
0	Byte	Escape Code	Constant $10 (`ESC_GRAPHICS`)
1	Byte	Image Width	Width of image divided by 8
2-3	Word	Image Height	Height of image
4	Byte	Record Number	Number of record containing image data

All sizes are in dots.
The width of the image has to be divisible by 8.
The record number of the image data is in the range of 64 through 126.

Page Format

To divide the linear text into pages, it is not enough to just cut the file at the (hard or soft) page breaks. When navigating to a page, it would not be clear what the current font and paragraph style of the first character of the page should be. Therefore, every page starts with a header containing this information, repeating the font/style/ruler state from the end of the previous page:

Offset	Length	Contents	Description
0-26	27 Bytes	Ruler Data	Ruler data
27-30	4 Bytes	NewCardSet Data	Font/style data
31		ASCII Text	Text of the document

The ruler data and NewCardSet data include their respective escape codes (ESC_RULER = $11, ESC_NEWCARDSET = $17), which makes any page by itself legally formatted geoWrite text.

Memory Representation

The strategy of the editor is to basically keep a single page in RAM and editing there. This way, for most editing work, there is no need to access the disk.

The buffer in RAM is 7000 bytes in size and in the same format as a page on disk: The first bytes are the header (ruler, NewCardSet), and the remainder is the actual text data, which may include NewCardSet, ruler and graphics escapes.

When the user jumps to a page, the corresponding record is loaded into the buffer. And when a new page is added to the document, an empty page is created in the buffer.

But the buffer isn’t always exactly one page. The text in the buffer starts at a known page boundary in the document, and the start of the buffer is associated with a page number in the document on disk.

But the amount of text in the buffer may be more than fits on the current page: If the user enters some text in the middle of a page, it will be inserted at the corresponding place in the buffer. The text at the end of the buffer may technically belong to the next page, because when laying out the current page, it wouldn’t fit.

The buffer may also be less than the current page of the document: If the user deletes text from the middle of a page, then the data in the buffer may not fill the current page any more – what should show up at the very bottom of this page is actually stored in the following record.

Streaming

It is not a problem to have more text than fits the page in the buffer (as long as the data doesn’t overflow the available 7000 bytes – we’ll talk about that later). But if there is less than a page in the buffer, and the bottom of the page needs to be rendered onto the screen, the missing text needs to be loaded from the next record.

The whole next record is unlikely to fit into the remainder of the current buffer, so the memory management logic loads data from the following records at a block (256 byte) granularity.

Let’s look at the code that does this. When rendering the page for the screen or for printing, and during re-pagination, all code goes through the getByteFromBuffer function:

getByteFromBuffer:
        CmpW    pageEndPtr, r15         ; end reached?
        bcc     @end                    ; yes
        ldy     #0
        lda     (r15),y                 ; read byte
        rts

The virtual register r15 points to the next byte, and pageEndPtr points to the end of the data in the buffer. The interesting case here is reaching the end:

@end:   bit     streamingMode
        bpl     @skip

        [push r0 though r15]
        jsr     streamBlock
        [pop r0 though r15]

        bra     getByteFromBuffer       ; try again

@skip:  lda     #0
        rts

If streamingMode is false, the function just returns NULL bytes, indicating the end of the buffer. But in “streaming mode”, it calls streamBlock (not shown). On its first invocation, this function manually looks up the next record in the filesystem and loads the first block, appending it to the data in the buffer, basically extending the buffer by a single block from the next record. The getByteFromBuffer code now has more data that it can fetch.

On subsequent invocations, streamBlock will keep reading blocks from the record, and will also skip to the following records. With streamingMode enabled, getByteFromBuffer will effectively read bytes from the whole document linearly.

The ruler and NewCardSet escapes at the beginning of each record are redundant and not needed when concatenating the pages, so streamBlock skips them. All of this is completely transparent to the caller.

Let’s look at an example in practice: The document has two pages of text. The user is at the very top of the first page and deletes a few lines. Visually, a few lines from the second page should now show up at the bottom of the first page. But the editor does not care at this point, the buffer only contains the reduced data. And since the cursor is still at the top of the page (and vertically, only about one fifth of a page fits onto the screen), the text renderer for the screen won’t reach the end of the buffer when reading bytes. But once the user moves the cursor down to the end of the page, the text renderer’s calls to getByteFromBuffer will cause one or more blocks of the next record to be loaded into the buffer before the part can be shown that was previously on page 2.

Reading in blocks from subsequent pages is not just some temporary look-ahead: Even though the read-in blocks still exist on disk as part of the next record, geoWrite now regards the data as part of the buffer in memory and will disregard them when accessing the next record in the future.

Repagination

When adding text to or deleting text from the middle of a document, the document generally needs re-pagination at some point, that is, the document will be updated so that every record on disk contains exactly the text of the corresponding page. geoWrite does this lazily: As seen before, most editing within a page happens on the buffer in memory. The buffer usually only gets written back to disk when moving away from the current page, at which point the remainder if the document needs to be repaginated.

The same is true if the buffer overflows the available 7000 bytes: The document has to be repaginated from the current page on. Every record will only be filled with text for exactly one page, so when the record for the current page will be loaded again afterwards, it should be significantly below 7000 bytes.

Triggers

There are many other actions that trigger a repagination run, like:

The page height changes because of switching printers or toggling NLQ printing mode.
The page width is changed from 480 to 640 dots. (geoWrite does not allow switching back.)
The “title page” setting is toggled. Since this toggles showing the header and footer on the first page, the height of this page that is usable for text changes.
The header or footer is edited, potentially changing their heights and changing the usable height of pages.
The search/replace function changes text on arbitrary pages.
The function update in the file menu explicitly updates the document on disk into a consistent state, which includes repagination.
The same is done when closing the document or quitting the app.

In some cases, like writing back a page to disk or closing a document, only repagination of pages following the current one is necessary.

Basic Algorithm

The basic repagination algorithm looks like this:

Read the first page into the buffer and enable streamingMode for getByteFromBuffer. Using this function, the whole (remaining) document can now be read as if it was one linear file.
Keep reading from the document until the text fills a page.
Write the current buffer up to this point (including the header) to the record on disk that corresponds to the current page.
Move the remaining data in the buffer up to the beginning. This data is the start of the next page.
Copy the current font, style and ruler state into buffer’s page header.
Repeat all of this until the end of the data is reached.

Measuring Lines

The core of pagination is the function that measures a line of text. Starting with the current pointer into the buffer, it reads and interprets bytes from the document, and returns the line’s width, height, baseline and a buffer pointer that points to the first character of the next line.

This function is also used for rendering a line on screen or for the printer: Before a line can be rendered, the baseline has to be calculated, so that different fonts on the same line are printed consistently. And the width of the line is necessary to center it, for example.

The following screenshot shows an example of mixed fonts in a single line, where it is necessary to gather the lowest baseline before starting to draw the text:

First, this function calculates the available width by subtracting the left margin from the right margin. For the first line of a paragraph, the “paragraph margin” will be used instead of the left margin. It then reads and interprets the document byte by byte.

If a graphics escape is encountered, the height of the image is returned as the line height, since images are always in their own paragraphs.

Otherwise, the function keeps adding up the widths of characters based on the current font, and keeps track of the maximum baseline offset and maximum font height.

A TAB character in the text requires some additional logic: A TAB will have the cursor jump to the next tab stop. To account for this, the measure line function increases the width of the line to reach the tab stop. In the following example, the two words are separated by a TAB character.

          tab stop
             |
.............*..............
Hello        World!
     /\/\/\/\
    added width

For decimal tab stops, it calculates the widths of all following text until the next decimal separator, and increases the width up to the tab stop minus the width of this text. In the following example, “84” is measured, and enough width is added so that the decimal separator is lined up with the tab stop.

      decimal tab stop
             |
.............*..............
Total      84.25 EUR
     /\/\/\
    added width

If there is a ruler escape in the text, the ruler data gets copied into the app’s state. All further calculations will use the new margins and tab stops. The same happens for NewCardSet escapes: All further character size calculations will be based on the new font and style.

During repagination, the metrics of all fonts used in the particular part of the document need to be known to be able to add up character widths. The geoWrite font manager has a font metrics cache that can hold data for up to 8 fonts, which is more than the font data cache, which can only hold an average of 3 font images, depending on their size. The font files have to be loaded at least once in order to extract the metrics, but the images are not necessary for repagination, and it is enough to keep the metrics data.

The end of a line is reached once the text overflows the available width. The function will then reset the buffer pointer to after the last SPACE character – this is the first character of the new line.

The end of a line is also reached if there is either an explicit line break or page break in the text. In this case, the pointer will be set to the character after the break.

Measuring Pages

The function that measures a page first calculates the usable height by subtracting the header and footer heights from the page height – unless this is the title page, in which the full page height is available.

It then repeatedly calls the function to measure a line and adds up line heights until the sum overflows the usable page height. The buffer pointer is reset to the beginning of the first line that does not fit onto the page. This is the first character of the next page.

A special case is the page break character: Page measuring is stopped here, and the pointer to the next character is returned.

Conclusion

While geoWrite is extremely powerful for an app on a 1 MHz computer with 64 KB of RAM, it is also very slow. Some of the reasons are true for many GEOS applications:

The 6502 cannot efficiently handle 16 bit data, so dealing with pointers and dot size values requires large and slow code everywhere.
Because of memory scarcity, code has to repeatedly be paged in for certain functionality.
Some of the code is especially inefficient, because it had to be optimized for size rather than for speed.
Even with the GEOS “turboDisk” driver, the 1541 disk drive is still very slow, at a maximum of 4 KB/sec of linear reading.

In this context, geoWrite picked a document model that allows the user to edit a page at a time practically without any disk access, with the tradeoff of slower repagination. So in practice, repaginating a document that contains dozens of pages can take a minute or more, but on the other hand, geoWrite can usually keep up with the fastest typists when rendering even complicated text layouts in real-time.

P.S.: The image at the beginning of this article shows the error message caused by a record overflowing 7000 bytes. This happens when using a font that is 9 pt or smaller and filling a page completely with characters. geoWrite will insert a page break character and re-run the pagination code.

Inside geoWrite – 6: Localization

Michael Steil — Tue, 08 Sep 2020 22:18:51 +0000

In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses what was required for the German localization.

Article Series

Overview

Localizing an app doesn’t mean just translating all text. Language is just one part of it. Here are all concepts that require changes to geoWrite:

Language
Date/time format
Number format
Character set

Let’s go through them.

Language

Translating the app is not as easy as just translating all strings.

Some strings must not be translated.
Not all text is part of the UI.
Not all text is in string form.

Do Not Translate

First, let’s look at what must not be translated:

fn_textscrap:
        .byte   "Text  Scrap",0
fn_photoscrap:
        .byte   "Photo Scrap",0

These are magic filenames that contain the current clipboard/pasteboard contents. Translating them would break interoperability between apps in different languages.

Keywords

Then, there are strings where it is up to debate whether they should be translated: geoWrite supports three keywords that get replaced with dynamic contents when used in a page header or footer:

DATE: inserts the current date
TIME: inserts the current time
PAGE: inserts the current page number

For the German version, these keywords were in fact translated: DATUM, ZEIT, SEITE.

Strings

From the strings that should be translated, let’s start with the straightforward ones: Here is a table of strings that is used in menus:

Care has to be taken that the translated version fits the available space: The translations of “edit” and “options” were abbreviated (“Edit”/“Editieren”, “Opt”/“Optionen”), because the whole menu bar would have been too wide otherwise:

For submenus, GEOS programmers have to explicitly state the location and size in pixels, so the definition of a submenu has to change as well:

The German version is wider, so the value for the right border of the menu was updated.

Additionally, all menus in the German version are moved up by one pixel. If you look closely, you can see that the English version has a double line between “file” and “close”, while the German version has a single line. The symbol MENU_HEIGHT in the previous code block is 15 for the English version and 14 for the German version. It is unknown what the purpose of this is.

In the case of dialogs, the translated text might not fit into the same number of lines and might require a re-layout:

So while the English version just consists of one line of text, the German version adds a GOTOXY control code to move the cursor to the second line:

Because of word order differences, the startup dialog needed a complete redesign…

…which required changing the locations of all text and icons in the dialog’s definition:

Images

The startup dialog contains buttons that say “Create”, “Open” and “Quit”. GEOS only provides a limited set of predefined buttons (“OK”, “Cancel”, “Open”, …), so the pixel images of “Create” and “Quit” are supplied by the app and need to be translated as well.

The translated words are longer, so the buttons have to be bigger as well.

Screen Recovery Rectangles

As discussed in part 1 of this series, GEOS uses a custom system to save and recover screen contents that get overwritten by menus and dialogs. Since the sizes and positions of the menus are different, the rectangles that need to be recovered are changed in their table as well:

Date/Time Format

Different cultures/languages use different conventions for the date and time format. The DATE and TIME keywords stamp the current date and time into a page’s header or footer. For the English version, it uses the US format for dates and times:

December 31, 1999  11:59 PM

For the German version, it uses the German format, with German month names:

31. Dezember 1999  23:59

This is the core function to create the date string:

        ; date
        LoadW   r0, dateString
.if DATE_FORMAT=DATE_FORMAT_US
        jsr     getMonthName
        jsr     getDay
.elseif DATE_FORMAT=DATE_FORMAT_DE
        jsr     getDay
        jsr     getMonthName
.endif
        jsr     getYear

The month and day are reversed in the two different formats. The “.” vs. the “,” after the day gets handled by the function getDay:

getDay:
        MoveB   day, r3L
        LoadB   r3H, 0
        jsr     byteToDecimal
        ldy     #0
.if DATE_FORMAT=DATE_FORMAT_US
        lda     #','
.elseif DATE_FORMAT=DATE_FORMAT_DE
        lda     #'.'
.endif
        sta     (r0),y
        IncW    r0
        lda     #' '
        sta     (r0),y
        IncW    r0
        rts

The function to create the time string has some extra logic to convert the hour (0-23) to the range (1-12):

        lda     hour
.if DATE_FORMAT=DATE_FORMAT_US          ; AM/PM
        cmp     #12
        bcc     :+                      ; >= 12?
        sub     #12                     ; then subtract 12
:       cmp     #0
        bne     :+                      ; == 0
        lda     #12                     ; then it's 12
:
.endif
        sta     r3L
        LoadB   r3H, 0

        jsr     byteToDecimal           ; hours
        ldy     #0
        lda     #':'
        sta     (r0),y                  ; ':'
        IncW    r0

        lda     minutes
        sta     r3L
        LoadB   r3H, 0
        lda     #1
        jsr     byteToDecimal           ; minutes

And at the end, there is extra code in the US version to add “AM” or “PM”:

        ldy     #0
        lda     #' '
        sta     (r0),y                  ; space
        IncW    r0

.if DATE_FORMAT=DATE_FORMAT_US          ; AM/PM
        lda     #'A'
        ldx     hour
        cpx     #12
        bcc     :+
        lda     #'P'
:       sta     (r0),y
        IncW    r0
        lda     #'M'
        sta     (r0),y
        IncW    r0
.endif

Number Format

The character used for the decimal separator may differ between languages – “3.14” in an English text would be “3,14” in a German text. Since geoWrite supports “decimal” tab stops that align numbers around the decimal separator, it needs to scan for this character: The English version checks for “.”, while the German version checks for “,”.

Character Set & Encoding

The German language has four extra letters, the umlauts: “ä”/“Ä”, “ö”/“Ö”, “ü”/“Ü” and “ß”.

GEOS Character Encoding

Until the advent of Unicode, operating systems used different character encodings for different languages or scripts.

The English version of GEOS uses the 7 bit ASCII encoding, which contains the 26 letters A through Z, but no umlauts. The GEOS KERNAL has no context of a character encoding, it just blindly draws glyphs that are stored at an index in a font – as long as the index is between 32 and 127, the 7-bit ASCII printable range. The only difference between the English and the German operating system in terms of character encoding are the fonts: Just like the regular fonts, the fonts that come with the German version have 96 characters, but some characters have been replaced by the extra umlauts and the ‘§’ character (important for legal documents). These are the variants of the system font “BSW/9”:

ASCII	German GEOS
@	§
[	Ä
\	Ö
]	Ü
{	ä
\|	ö
}	ü
~	ß

geoWrite doesn’t generally have to care about the encoding either: With the German font set, any version of geoWrite will display German umlauts.

There are two cases where it does have to care though: searching and printing.

Searching

The function to search text has the option of searching for whole words only. For this, geoWrite needs to know which code points are letters or numbers. In English, that’s A through Z and 0 through 9. In German, this must include the umlauts. This is the function that decides on what’s an alphanumeric character:

isAlphanumeric:
        cmp     #'0'
        bcc     @1
        cmp     #'9'+1
        bcc     @yes
@1:
.if CHAR_ENCODING=CHAR_ENCODING_ASCII
        cmp     #'A'
.elseif CHAR_ENCODING=CHAR_ENCODING_DE
        cmp     #'@'
.endif
        bcc     @2
.if CHAR_ENCODING=CHAR_ENCODING_ASCII
        cmp     #'Z'+1
.elseif CHAR_ENCODING=CHAR_ENCODING_DE
        cmp     #']'+1
.endif
        bcc     @yes
@2:     cmp     #'a'
        bcc     @3
.if CHAR_ENCODING=CHAR_ENCODING_ASCII
        cmp     #'z'+1
.elseif CHAR_ENCODING=CHAR_ENCODING_DE
        cmp     #'~'+1
.endif
        bcc     @yes
@3:     cmp     #'_'
        beq     @yes
        clc
        rts

@yes:   sec
        rts

With the German encoding, it includes the three characters after the uppercase ‘Z’ and the four characters after the lowercase ‘z’ (see image of font above). There is a bug in this code though: The German version considers “§” (“@” in the code above) an alphanumeric character, which it isn’t.

Printing

The default is for geoWrite to print pixel images of the pages of a document. But there is also ASCII mode, which sends the plain text to the printer, so the printer can use its built-in fonts. In this mode, the English GEOS sends ASCII-encoded text, that is, its internal representation without any conversion, to the printer driver. If the printer uses a different encoding, the driver has to do the conversion.

German GEOS can’t just send the codes for “§ÄÖÜäöüß” – they would print as “@[]{|}~”. It has to convert them, so that printer drivers can be universal and independent of the system’s language.

This is the code that does the conversion – it is missing from the English version:

convertToCp437:
        ldy     #8
@loop:  cmp     @from-1,y
        beq     @found
        dey
        bne     @loop
        rts
@found: lda     @to-1,y
        rts

@from:  .byte '@','[','\',']','{','|','}','~'   ; source: GEOS_de
@to:    .byte $EB,$8E,$99,$9A,$84,$94,$81,$E1   ; target: CP437
;             'δ','Ä','Ö','Ü','ä','ö','ü','ß'

The eight character codes in German GEOS that differ from ASCII (line @from) are converted to eight codes above 127 (line @to).

The destination encoding is Codepage 437, the standard (and now obsolete) encoding used by the IBM PC and MS-DOS. That is, except for ‘§’, whose CP437 equivalent would be $15, which is a non-printable character in ASCII-based encodings.

The authors of GEOS were free to choose any encoding – it’s really just a convention between applications and the printer drivers. But with CP437, drivers for PC printers of the time can just pipe the data through as is.

Discussion

Modern software usually comes as a single application binary that supports multiple languages, and with support from the operating system can use different conventions for date/time and numbers, and uses Unicode to express and work with any character in any script.

geoWrite is running on a 64 KB system and doesn’t have the luxury of spending code for any of these features – all localization differences are compile-time options. This means that in a multi-lingual environment, there are many limitations:

The English version of geoWrite on a German version of GEOS will support umlauts, but can’t correctly search for words with umlauts or print umlauts in ASCII mode. Besides, some buttons in the UI will be in German.
The German version of geoWrite on an English version of GEOS will not support umlauts, and the characters “@[]{|}~” won’t print correctly in ASCII mode.
Writing an English document in the German version of geoWrite will use the German date/time format, use German month names, and can’t use decimal tab stops with with a ‘.’ as the decimal point.
Writing a German document in the English version of geoWrite (on a German GEOS) has the equivalent problems. In addition, searching for words with umlauts won’t work correctly, and neither will printing umlauts.
Writing a French or Spanish document with any version of geoWrite works, even with accented letters, as long as only fonts are used where the extra letters are added. But the same limitations with date/time, numbers, searching and printing apply.
Good luck with CJK and RTL!
Opening a document in a different language version of geoWrite than it was saved in will break either the umlauts or the “@[]{|}~” characters, as well as data, time and page numbers in headers and footers.

Then again, it would be possible to architect a version of geoWrite with more flexibility:

The code for date, time, numbers and the encoding only differs minimally between the localizations, so the app could support all variants, based on a system setting.
The VLIR architecture of GEOS applications allows dividing code and data into an arbitrary number of records, so every VLIR record of the current geoWrite app could be split into two: one with the code, and one with the strings and UI data structures. Which variant of the UI gets loaded depends on the system language.

The latter point would of course waste space on disk (regular geoWrite is 35 KB, a 1541 disk holds 165 KB) and increase load times of VLIR records.

Inside geoWrite – 5: Copy Protection

Michael Steil — Sun, 06 Sep 2020 09:48:15 +0000

In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses the geoWrite copy protection.

Article Series

GEOS Copy Protection Strategy

GEOS has one of the most notorious copy protection systems. The system disk contains bit patterns that are very hard to reproduce on a stock disk drive. These are checked on every boot in obfuscated code that is used to decrypt the core operating system code. This way, copies of the GEOS boot disk will not boot. Therefore, GEOS always has to be booted from the original disk, so these would break frequently, which is why GEOS came with a second boot disk, and there was a program to get broken boot disks replaced once both failed.

GEOS maker Berkeley Softworks also created several high-profile GEOS applications like geoPublish and geoCalc, which came with a similar copy protection. Even the bundled apps geoWrite, geoSpell and geoMerge have the same protection.

But with apps, it’s more complicated: On a C64 system with a single disk drive, the app needs to be on the same disk as the document that is being worked on, and since it’s a non-starter to make the user edit all their documents on the original application disk, it needs to be possible to have a copy of the app working on the user’s work disk – and still prevent pirated copies of the app from running.

The idea is to link the boot disk and the application through a serial number. On the very first boot, the GEOS system picks a random 16 bit serial number (excluding zero) and stores it on the boot disk as well as on the backup disk – this is called “installing” GEOS. On the first start of a copy-protected application, it verifies that it’s running from the original disk, and if yes, it takes the system’s serial number and stores it – this is called “installing” an application. On subsequent boots, it does not check for the original disk any more, but it only runs if its stored serial number matches the system’s.

As long as the GEOS boot disk cannot be copied, two users (who both bought GEOS) will have different serial numbers, and installed apps from one user won’t work on a different user’s GEOS. And a copy of a not-yet-installed app will refuse to install itself, because it doesn’t run from an original disk.

Code

Apart from this basic protection concept, geoWrite obfuscates what’s going on by encrypting parts of the code, to make it hard to crack the protection.

Let’s walk through the components of the copy protection in the order of what happens on application startup.

Encrypted Record 1

As discussed in part 1 of this series, the GEOS “VLIR” binary consists of 9 so-called records, which are basically individual code files. Record 0 is the main program, and records 1 through 8 get swapped in and out of memory based on what functionality is needed.

When the application gets started, the first thing the record 0 code of geoWrite does is load record 1, which contains initialization code as well as the copy protection.

        lda     #BANK_1
        jsr     loadCode

All of record 1 is encrypted, so after loading, it decrypts it by XORing every byte with $DE.

        lda     #$EB
        eor     #$35
        sta     @2
        LoadW   r0, MEM_OVERLAY
        LoadW   r1, -4000
        ldy     #0
@1:     lda     (r0),y
@2 = * + 1
        eor     #$00
        sta     (r0),y
        IncW    r0
        IncW    r1
        bne     @1

The decryption constant of $DE gets constructed using $ED XOR $35, for which there is no good reason other than maybe making it harder to search for the value.

After decryption, execution is handed to the record 1 code:

        jmp     MEM_OVERLAY

Installation

The first few instructions of record 1 do things in a way more complicated way than necessary, probably to deter any hackers from looking further:

.,3244  A9 32     LDA #$32
.,3246  48        PHA
.,3247  A9 54     LDA #$54
.,3249  48        PHA
.,324A  A9 3B     LDA #$3B
.,324C  85 21     STA $21
.,324E  A9 6F     LDA #$6F
.,3250  85 20     STA $20
.,3252  6C 20 00  JMP ($0020)

The source makes it clear what’s going on:

        lda     #>(@continue-1)
        pha
        lda     #<(@continue-1)
        pha
        LoadW   r15, initApp
        jmp     (r15)
@continue:

It pushes the address of the code following it as a return address (i.e. minus one) on the stack and jumps to initApp using a vector. This is just a convoluted way of calling initApp and continuing with the code below.

initApp does some initialization and calls checkSerialOrInstall. This is the first part of it:

checkSerialOrInstall:
        lda     serial
        ora     serial+1                ; does app have a serial?
        beq     @install                ; no, then install

        lda     #GetSerialNumber
        jsr     CallRoutine
        CmpW    serial, r0              ; does the app serial match the system's?
        beq     @rts                    ; yes, return

        lda     #txt_serial_mismatch
        jsr     showError               ; no, tell the user
        jsr     swap_userzp
        jmp     EnterDeskTop            ; and exit

@rts:   rts

(For the meaning of swap_userzp, check out part 4 of this series.)

serial is a 16 bit variable that is part of the record 1 code:

serial:
        .word   0                       ; not installed

If it is zero, checkSerialOrInstall jumps to the install logic. Otherwise it gets the system’s serial. (It could just call the GetSerialNumber KERNAL API directly, but instead, it calls it by loading its address into registers and calling CallRoutine, so that a reverse engineer has a harder time finding the call.)

If the system’s serial number is the same, the function returns, and the application can start. If no, it shows an error and exits the app.

Let’s look at the installer code. First, it calls executeDiskBlock, which loads a block from disk and runs it:

@install:
protExecTrack = * + 1
        lda     #0                      ; protection track (stamped in by build system)
        sta     r1L
protExecSector = * + 1
        lda     #0                      ; protection sector (stamped in by build system)
        sta     r1H
        jsr     executeDiskBlock
        beqx    @ok                     ; no error

This block contains the code to verify that this is an original disk and not a copy. (We will discuss all of this in detail in the next section.) If executeDiskBlock returns with X != 0, this signals a failure, and and geoWrite exits:

        lda     #txt_copy_protection   ; then show a non-informative message
        bra     showErrorAndExit        ; and exit to deskTop

Otherwise, the installer now knows that it’s an original disk, so it can stamp in the system’s serial into its own code. So after fetching the system’s serial again, it reads the block from disk that is supposed to contain the app’s serial. This block is part of the record 1 file.

@ok:    lda     #GetSerialNumber
        jsr     CallRoutine             ; get OS serial number to put into app

        MoveW   r0, serial
        LoadW   r4, diskBlkBuf
protSerialTrack = * + 1
        lda     #0                      ; serial track (stamped in by build system)
        sta     r1L
protSerialSector = * + 1
        lda     #0                      ; serial sector (stamped in by build system)
        sta     r1H
        jsr     _GetBlock               ; read sector that contains code with serial
        bnex    @ierror

Note that the track and sector numbers point to the location of this very code on disk. The geoWrite build system creates a disk image with the app on it and stamps track and sector numbers in.

The serial has to be put at the correct location within the block and encrypted with the same XOR $DE that is used for decrypting all of record 1:

@offset = (serial-CODE1) .mod 254 + 2
        lda     serial                  ; get serial low
        eor     #$DE                    ; "encrypt"
        sta     diskBlkBuf+@offset
        lda     serial+1                ; get serial high
        eor     #$DE                    ; "encrypt"
        sta     diskBlkBuf+@offset+1

The offset can be calculated by the assembler: It’s the offset of the serial in the current (record 1) code, modulus 254 (because blocks on disk are 254 bytes), plus 2 (number of header bytes at the start of each block).

        LoadW   r4, diskBlkBuf
        jsr     _PutBlock               ; write back block
        beqx    installOk

        cpx     #WR_PR_ON
        beq     @wperr
@ierror:
        lda     #txt_error_installing
        bra     showErrorAndExit

@wperr: lda     #txt_install_write_protected

showErrorAndExit:
        jsr     showError
        jsr     swap_userzp
        jmp     EnterDeskTop

Finally, the block is written back. If there was an error, a dialog is shown, and the app exits.

If installation was successful, the following code runs:

installOk:
        asl     serial                  ; cycle serial left to obfuscate
        rol     serial+1                ; serial = serial[14..0,15]
        lda     serial
        adc     #0
        sta     serial

        jsr     swap_userzp
        jsr     GetDirHead              ; read BAM block
        jsr     swap_userzp

        MoveW   serial, curDirHead+$BE  ; store serial after "GEOS format V1.x"

        jsr     swap_userzp
        jsr     PutDirHead              ; write BAM block
        jsr     swap_userzp

This writes a copy of the serial with the bits rotated into two unused bytes of the disk’s header block, track 18, sector 0.

The reason for this most probably had to do with ordering broken replacement disks: Once both the system and the backup disks didn’t boot any more, the user was supposed to send in both disks, and would get new disks in return, already installed with the same serial, so that existing apps would continue to function.

Side B of the system disk contains geoWrite, and side B of the backup disk contains geoMerge, both of which had to be installed – the manual explicitly instructs the user to open each app once. So after this, both boot disks contain the obfuscated but plaintext serial on track 18, sector 0, offset $BE of side B. This could then be used to create proper replacement disks. After all, sides A of both disks were broken.

Finally, it shows a success dialog and exits.

        lda     #txt_installed         ; show success
        jsr     showError
        jsr     swap_userzp
        jmp     EnterDeskTop            ; and exit

Disk Signature Check

We skipped over executeDiskBlock, which was called by the installer. First, it initializes the disk:

executeDiskBlock:
        PushW   r1
        jsr     swap_userzp
        jsr     NewDisk
        jsr     swap_userzp
        PopW    r1
        bnex    @rts                    ; I/O error -> fail

Then it reads the block whose track and sector was passed in by the caller. It’s the location of the protection check code that was stamped in by the build system.

        LoadW   r4, diskBlkBuf          ; read block
        jsr     _GetBlock
        bnex    @rts                    ; I/O error -> fail

The last byte of the block is a checksum which is the lower 8 bits of the sum of all payload bytes of the sector:

        lda     #0
        ldy     #2
@loop:  clc
        adc     diskBlkBuf,y            ; checksum bytes $02-$FE
        iny
        cpy     #$FF
        bne     @loop
        cmp     diskBlkBuf+$FF          ; checksum at offset $FF
        beq     @ok

        ldx     #$FF                    ; fail
@rts:   rts

If the checksum matches, the block is run in place in the disk block buffer (diskBlkBuf at $8000):

@ok:    jsr     swap_userzp
        jsr     diskBlkBuf+2            ; execute block
        jsr     swap_userzp
        rts

The protection check block is not part of the VLIR file and not formally referenced anywhere. The build system just writes it to a random free block when it generates the final disk image. In fact, it writes another 6 unused decoy copies of the block.

This is the layout of this block:

00:  00 ff                                             block link pointer

           4c 7a 80                                    jump at entry point

                    ad 0f 18  48 29 df 8d 0f 18 20 16
10:  07 68 8d 0f 18 60 ba 86  49 a9 ee 8d 0c 1c a9 07
20:  85 33 a9 f5 85 32 a5 22  8d f5 07 20 10 f5 a0 02
30:  84 00 20 55 07 a2 10 d0  12 a0 00 20 55 07 a0 45  drive code
40:  20 55 07 20 64 07 a0 0a  20 55 07 20 64 07 ca d0
50:  e8 e8 86 00 60 50 fe b8  ad 01 1c 88 d0 f7 60 ad
60:  01 1c b8 60 ac 00 1c 10  f6 50 f9 b8 ad 01 1c c9
70:  55 f0 f1 c9 67 f0 ed 68  68 60

                                    ad e3 c1 85 03 ad
80:  e2 c1 85 02 ad e1 c1 c9  4c f0 0c a0 00 b1 02 aa
90:  c8 b1 02 85 03 86 02 a2  0a ac 89 84 b9 86 84 29
a0:  bf c9 02 90 24 d0 57 a8  88 88 b1 02 c8 c9 20 d0  computer code
b0:  f9 b1 02 c8 c9 5c d0 f1  b1 02 c8 c9 c2 d0 e9 b1
c0:  02 c8 c9 20 d0 f9 88 d0  11 a0 ff c8 b1 02 c9 85
d0:  d0 f9 c8 b1 02 c9 8b d0  f3 c8 a2 00 b1 02 9d f5
e0:  80 c8 e8 e0 06 d0 f5 20  14 c2 20 5c c2 a9 05 85
f0:  8b a2 07 86 8c 00 00 00  00 00 00 20 5f c2 60

                                                   d7  checksum

The second half of the block is GEOS code that executes on the computer side, and the first part is code that runs on the disk drive. The drive code does the actual disk authenticity check. The job of the computer part is to get the disk drive to run the drive code, and receive the result.

Running the Drive Code

GEOS comes with driver code for the 1541 and 1571 disk drives, which contains logic to upload code to the drive, execute it, and send commands, status messages and block data back and forth. The protection code reuses the driver to execute custom code and retrieve the result.

But the disk drivers don’t export this functionality as an API. Instead of adding this to the disk drivers as a private API, the authors of the protection chose to do some hacky stuff to get to these functions instead. This has the side effect of making it much harder to understand what is going on – which is a plus for protection code.

The computer part needs to call the private functions sendExecuteWithTrkSec and getDOSError in the driver. This is some code in the 1541 driver that calls both functions in sequence:

__NewDisk:
        jsr     EnterTurbo
        bnex    NewDsk2
        jsr     ClearCache
        jsr     InitForIO
        LoadB   errCount, 0
NewDsk0:
        lda     #>Drv_NewDisk
        sta     $8C
        lda     #


The protection code scans the __NewDisk function for the STA $8B and steals the following two instructions.
It gets the pointer to the the function by looking at the GEOS KERNAL’s API jump table entry for the symbol NewDisk:
start:  lda     NewDisk+2               ; find the code that is pointed
        sta     r0H                     ; to by the NewDisk API
        lda     NewDisk+1               ; by reading the operand of the
        sta     r0L                     ; direct/indirect JMP at the
        lda     NewDisk                 ; entry point
        cmp     #$4C                    ; direct or indirect JMP?
        beq     @direct                 ; direct, then we found the code

If the KERNAL jumps directly to the driver code (opcode $4C), the two bytes after the API’s address point to the implementation. If it’s an indirect jump, it resolves the indirection:
        ldy     #0                      ; indirect jump, so
        lda     (r0),y                  ; we need to read the vector
        tax
        iny
        lda     (r0),y
        sta     r0H                     ; and we have a pointer to the code
        stx     r0L

The 1541 and 1571 drivers differ slightly, so it checks which type of driver is running:
@direct:
        ldx     #STRUCT_MISMAT          ; default error code:
        ldy     curDrive
        lda     driveType-8,y           ; what kind of drive is this?
        and     #$BF                    ; ignore the shadow bit (drive cache)
        cmp     #$02                    ; 2: 1571
        bcc     @is1541                 ; less, then 1541!
        bne     @rts                    ; not 1571, then return with error (X != 0)

Let’s look at the 1541 code. (The 1571 is be similar.)
@is1541:
        ldy     #$FF
@loop5: iny
@loop6: lda     (r0),y                  ; search for $85 $8B
        cmp     #$85                    ; (STA $8B)
        bne     @loop5                  ; in NewDisk code
        iny
        lda     (r0),y
        cmp     #$8B
        bne     @loop6

This looks for the STA $8B just before the two calls. It then appends the next two calls to the end of its own code:
        iny

@cont:  ldx     #$00
@loop7: lda     (r0),y                  ; extract 6 bytes
        sta     @code,x                 ; copy into this code
        iny
        inx
        cpx     #6
        bne     @loop7

Finally, it enables the disk driver, and calls the two functions it extracted the pointers of to have the driver execute code at checkProtection in its own RAM and retrieve the status.
        jsr     EnterTurbo
        jsr     InitForIO

        lda     #checkProtection       ; (this sector is at $0700!)
        stx     $8C
@code:  .byte   0,0,0                   ; jsr SendExecuteWithTrkSec
        .byte   0,0,0                   ; jsr GetDOSError
        jsr     DoneWithIO
@rts:   rts

checkProtection is actually $0705, and points into the drive’s buffer at $0700-$07FF, which is where the block was read. So the drive code does not have to be uploaded from the computer – it was the last block read by the driver, and it guaranteed to be located at $0700.
Checking for the Gap Sequence
Tracks on a 1541-formatted disk contain the following sequence of structures for 17 to 21 sectors, depending on the track.

A SYNC mark is followed by a sector header, and after a gap, there is another SYNC mark, followed by the sector’s data and another gap. This is repeated for the next sector.
The GEOS copy protection relies on the fact that the data in the gap, which is irrelevant for normal operation, cannot be reliably written to by stock drives. GEOS boot and application disks contain the following sequence of bytes there:
55 55 55 67 55 55 55 67

The purpose of the drive code is to test that the gap after both the header and the sector data contain only the values 0x55 and 0x67 for 16 consecutive sectors.
Before we look at the main program, let’s look at its two helpers. This is skipBytes. It just reads a certain number of bytes from disk and ignores them.
skipBytes:
        bvc     skipBytes               ; wait for byte
        clv
        lda     $1C01                   ; read it
        dey
        bne     skipBytes               ; y times
        rts

And this is checkSignature, which reads all bytes up to the next sync mark and makes sure that they are all either 0x55 or 0x67:
checkSignature:
        ldy     $1C00                   ; if we found the SYNC mark,
        bpl     @foundSync              ; the check is ok and we're done

        bvc     checkSignature          ; wait until byte ready
        clv

        lda     $1C01                   ; get byte
        cmp     #$55
        beq     checkSignature          ; has to be either signature byte $55
        cmp     #$67
        beq     checkSignature          ; or signature byte $67

        pla                             ; magic not found
        pla                             ; -> return to main code
        rts                             ; (error code remains at "2")

@foundSync:
        lda     $1C01                   ; read value
        clv
        rts

If this enounters a gap byte other than 0x55 of 0x67, it pops the return address from the stack, which returns to the drive’s main code with an error.
The main code starts out with waiting until the read head passes the header of sector 0 of the current track and reading the header. It calls a function in the DOS ROM ($F510) for this:
        lda     #>(buffer-$8000+$0700)
        sta     $33
        lda     #<(buffer-$8000+$0700)
        sta     $32                     ; set pointer to track/sector for ROM call
        lda     $22                     ; current track number
        sta     $07F5                   ; (sector is 0)
        jsr     $F510                   ; ROM call: find and read block header

There are two more bytes that are part of the header that haven’t been read yet (the “OFF” bytes), which have to be skipped:
        ldy     #2
        sty     $00                     ; set default error code 2: "READ ERROR"
        jsr     skipBytes               ; skip 2 bytes

The next bytes to be read are now the sector header’s gap bytes.
The remainder of the code now iterates over 16 sectors, always skipping all header bytes and data bytes, and checking for 0x55 and 0x67 values in the gaps:
        ldx     #16
        bne     @1                      ; check 16 headers and sectors

@loop:  ldy     #<$100                  ; skip a total of
        jsr     skipBytes               ; 325 GCR bytes
        ldy     #$45                    ; = 260 data bytes
        jsr     skipBytes               ; = marker + full block + checksum + filler
        jsr     checkSignature          ; check signature after data block
        ldy     #10
        jsr     skipBytes               ; skip full header
@1:     jsr     checkSignature          ; check signature after header
        dex
        bne     @loop                   ; repeat
        inx
        stx     $00                     ; set error code 1: "OK"
        rts

If checkSignature never failed, a status code indicating success will be set, which the computer part of the protection code will fetch.
Encrypted Code in Record 0
To make cracking the protection harder, there is one more component: Somewhere in the initialization code, record 1 checksums itself, and uses this checksum as a key to decrypt one function in record 0.  So if someone was to crack the protection by changing any of the code in record 1, the checksum would be different, the one function in record 0 would be garbled, and the app would sooner or later crash.
First, the checksum code:
decryptR00:
        LoadW   r0, MEM_OVERLAY         ; checksum code record #1
        LoadW   r1, CODE1_END-CODE1
        LoadB   r2L, 0
        ldy     #0
@loop1: lda     (r0),y
        add     r2L
        sta     r2L
        IncW    r0
        ldx     #r1
        jsr     Ddec
        bne     @loop1

It adds all bytes together and keeps the lowest 8 bits.
There are several bytes within the record 1 code that must not be part of the checksum though: The 2 bytes containing the serial number will change once the app is installed, and the value will be different on every user’s copy. So their values will be subtracted from the checksum again:
        lda     r2L                     ; remove variable bytes from checksum
        sub     serial
        sub     serial+1

Furthermore, the stamped-in track/sector pointers of the block with the signature check code and the block with the serial have to be excluded. This is because of the necessary order in the build process, which looks something like this:

assemble the source of each record as well as the drive code block
encrypt record 1 with a constant of $DE
encrypt one function in record 0 with the checksum of the record 1 plaintext
write the whole geoWrite VLIR file to a disk image
write the drive code block to a free block on the disk image
stamp the track and sector of the drive code block into record 1 on the disk image
stamp the track and sector of the block that contains the serial into record 1 on the disk image

Both track/sector pointers aren’t known until step 6 and 7, but they are part of the checksum in step 3. Therefore, they are also excluded:
        sub     protExecTrack
        sub     protExecSector
        sub     protSerialTrack
        sub     protSerialSector

Now it can decrypt the one function in record 0:
        LoadW   r0, r0_encrypted_start ; decrypt some code in record 0
        LoadW   r1, r0_encrypted_start-r0_encrypted_end
        ldy     #0
@loop2: lda     (r0),y
        eor     r2L
        sta     (r0),y
        IncW    r0
        IncW    r1
        bne     @loop2
        rts

Discussion
While the geoWrite copy protection isn’t as complicated or quite as mean as the one on the GEOS system disks, it is nevertheless effective, and requires quite some effort to be cracked.
Without disassembling through the geoWrite binary, a cracker could search the whole disk for code that looks like it’s checking for the protection. Any code running on the disk and reading bytes from the head manually is a candidate. This is easy to find by looking for LDA $1C01, which would reveal the block with the gap signature check. But it’s checksummed, and the cracker wouldn’t know the algorithm, the range or the location of the checksum unless they had disassembled record 1. Besides, they might change the wrong block by mistake because of the decoy copies of this block on the disk.
So any cracking attempt would require disassembling through the geoWrite code. It is quite straightforward to find the code to load and decrypt record 1, and record 1 can either be decrypted with a small script using the key found in the code, or by dumping the memory contents after decryption.
The first few bytes of record 1 are a simple but clever obfuscation with the chance that the cracker would miss the serial check and installation code. If they do find it, they now know the track and sector of the serial on disk, and the encryption key, so they could change the serial of an installed copy, or deinstall geoWrite on the original disk.
The holy grail would be a cracked version of geoWrite that didn’t care about the serial, so a cracker could just remove the call to checkSerialOrInstall. But this would alter the checksum of record 1, and break the decryption of the one function in record 0, so the app would sooner or later crash. So removing the call to checkSerialOrInstall would also require patching the decryption to take a fixed key instead.
References

ZAK256: GEOS-Kopierschutz (C64 Wiki)
Michael Steil: Copy Protection Traps in GEOS for C64
Michael Steil: Why Do C64 GEOS Boot Disks Break?
Michael Steil: Reconstructing the GEOS 2.0 (de) Master Images from a Pile of Broken Disks



Inside geoWrite – 4: Zero Page
Michael Steil — Fri, 04 Sep 2020 20:25:55 +0000
In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses how it makes maximum use of the scarce zero page space.
Article Series

The Overlay System
Screen Recovery
Font Management
Zero Page ← this article
Copy Protection
Localization
File Format and Pagination
Copy & Paste
Keyboard Handling

GEOS Zero Page
The MOS 6502 CPU has special encodings for addresses that fit in 8 bits: Instructions that read from or write to addresses $0000 to $00FF in memory are encoded in two instead of three bytes:
a5 28      lda $28
ad 28 00   lda $0028

The two instructions have the same effect, but the first one is one byte shorter, and faster by one clock cycle.
Zero page space is scare and valuable, so it has to be used wisely. This is the GEOS zero page layout, roughly to scale:
-----------------------------------------
$0000  6510 CPU built-in I/O port
-----------------------------------------
$0002  Virtual 16 bit registers
       r0-r15

-----------------------------------------
$0022  Used by GEOS system


-----------------------------------------
$0040  Reserved for GEOS system




-----------------------------------------
$0070  GEOS app space            <<<<<<<<
-----------------------------------------
$0080  Used by GEOS disk driver
-----------------------------------------
$0090  Used by Commodore KERNAL ROM












-----------------------------------------
$00FB  GEOS app space            <<<<<<<<
-----------------------------------------

GEOS designates only a total of 21 bytes to the application. 30 bytes are used by the GEOS KERNAL itself, and 48 bytes are reserved for future versions of GEOS, so these areas are off-limits. The biggest part, 107 bytes, is used by the Commodore KERNAL ROM.
KERNAL Zero Page
The C64’s ROM consists of the 9 KB Microsoft BASIC interpreter and a 7 KB operating system: the Commodore KERNAL. When the machine is in BASIC mode, the KERNAL ROM takes care of the keyboard, the screen, RS232, tape, disks and printers.
For the most part, GEOS does not use the KERNAL ROM at all: It comes with its own keyboard, screen and mouse drivers. With disk drives and printers, it’s more complicated.
Disk drives and printers are daisy-chained on the Commodore Serial Bus. Unfortunately, byte transmission with the original protocol is painfully slow, which is why most applications and games come with their own speeder code which uploads alternative transfer code to the disk drive. GEOS also uses its own disk speeder called diskTurbo.
diskTurbo only replaces the data transmission protocol though, not the IEEE-488 TALK/LISTEN protocol, which is needed to negotiate which device is talking on the bus at which time. So whenever GEOS switches between disk drives, it calls the original code in KERNAL. And printers don’t allow uploading code to replace the bus protocol at all, so GEOS uses the KERNAL for talking to the printer as well.
To keep the original KERNAL happy, GEOS doesn’t touch any of its zero page variables – which is quite generous, since the serial code only touches a small part of the $0090-$00FA area.
In addition, GEOS reserves 16 more bytes for use by the diskTurbo driver at $0080-$008F. The Commodore 1541 driver uses two bytes in this space, for example.
So effectively, almost the whole upper half of the zero page ($0080-$00FA) is blocked because the code to access disks and printers uses parts of it.
geoWrite
Since the $0080+ area is only used by the system during disk and printer accesses, geoWrite can use it whenever it is not using the disk or the printer, as long as it restores its contents whenever it does need to use them.
It does this by swapping the 128 bytes in the zero page with a dedicated buffer. The area can now have one of two sets of contents: the geoWrite contents and the diskTurbo/KERNAL contents:
-----------------------------------------
$0000  6510 CPU built-in I/O port
-----------------------------------------
$0002  Virtual 16 bit registers
       r0-r15

-----------------------------------------
$0022  Used by GEOS system


-----------------------------------------
$0040  Reserved for GEOS system




-----------------------------------------
$0070  geoWrite variables        <<<<<<<<
-----------------------------------------    -----------------------------------------
$0080  geoWrite variables        <<<<<<<<    $0080  Used by GEOS disk driver
                                 <<<<<<<<    -----------------------------------------
                                 <<<<<<<<    $0090  Used by Commodore KERNAL ROM
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<< <-swapped->
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<
                                 <<<<<<<<    -----------------------------------------
                                 <<<<<<<<    $00FB  GEOS app space (unused)
-----------------------------------------    -----------------------------------------

This is the code that swaps the zero page area and the buffer:
swap_userzp:
        php                             ; save all registers and flags
        pha
        txa
        pha
        tya
        pha
        ldx     #$7F                    ; $7F..$00
@loop:  ldy     userzp,x                ; load zp byte
        lda     userzp_copy,x           ; load buffer byte
        sta     userzp,x                ; store zp byte
        tya
        sta     userzp_copy,x           ; store buffer byte
        dex
        bpl     @loop
        pla                             ; restore all registers and flags
        tay
        pla
        tax
        pla
        plp
        rts

It saves all registers and flags, so it can easily be called from anywhere in the code without messing up any state. Here is an example of using it:
        LoadW   r0, otherFnBuffer       ; load argument
        jsr     swap_userzp             ; **swap**
        jsr     OpenRecordFile          ; call KERNAL disk API
        jsr     swap_userzp             ; **swap**
        lda     #2                      ; load argument
        jmp     PointRecord             ; load KERNAL API that does not access disk

The OpenRecordFile API call accesses disk, so it’s surrounded by calls to swap_userzp. The geoWrite programmers were very aware of which API calls cause a disk access: The PointRecord API call is about file management as well, but it only updates data structures and does not access disk, so there is no need to swap the zero page.
For all of this to be correct

swap_userzp has to be called as the very first thing when the application launches.
swap_userzp has to be called before exiting the app.
swap_userzp has to be called for every API that may end up calling the disk driver, as well as all printer APIs.
calls to swap_userzp always need to be balanced.
zero page variables at $80+ cannot be accessed between the two swap_userzp invocations.

Adding the two calls to every disk API call is prone to error, and bloats the code, so there are wrapper functions for the commonly used disk access APIs:
_ReadFile:
        lda     #ReadFile-GetBlock
        .byte   $2C                     ; skip next
_ReadByte:
        lda     #ReadByte-GetBlock
        .byte   $2C
_CloseRecordFile:
        lda     #CloseRecordFile-GetBlock
        .byte   $2C
_InsertRecord:
        lda     #InsertRecord-GetBlock
        .byte   $2C
_DeleteRecord:
        lda     #DeleteRecord-GetBlock
        .byte   $2C
_AppendRecord:
        lda     #AppendRecord-GetBlock
        .byte   $2C
_UpdateRecordFile:
        lda     #UpdateRecordFile-GetBlock
        .byte   $2C
_OpenDisk:
        lda     #OpenDisk-GetBlock
        .byte   $2C
_FindFile:
        lda     #FindFile-GetBlock
        .byte   $2C
_GetBlock:
        lda     #GetBlock-GetBlock
        .byte   $2C
_PutBlock:
        lda     #PutBlock-GetBlock
        add     #GetBlock
        sta     @jmp+2
        jsr     swap_userzp
@jmp:   jsr     GetBlock
        jmp     swap_userzp

Each of the wrappers loads the offset of the specific API entry point from the GetBlock entry point, and the common code adds it to the GetBlock address, and uses self-modification to call the API – of course calling swap_userzp before and after.
These wrappers are in the record 0 code, so that any overlay code can call it as well.
Discussion
Like most of geoWrite’s tricks, this is a tradeoff. It gains 123 zero page locations, and its use speeds up the code by maybe a low two-digit percentage and saves maybe 1 KB of code space. On a slow and memory-constrained system like the C64, this is significant. On the other hand, the disk access code gets a bit more complicated (which is countered by the wrappers), and every back-and-forth swap takes about 6000 cycles. But in the context of a disk access, this is negligible.



Inside geoWrite – 3: Font Management
Michael Steil — Thu, 03 Sep 2020 22:25:55 +0000
In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses the font manager’s system of caches for pixel fonts.

Article Series

The Overlay System
Screen Recovery
Font Management ← this article
Zero Page
Copy Protection
Localization
File Format and Pagination
Copy & Paste
Keyboard Handling

GEOS Fonts Overview
The GEOS operating system contains a rendering library for pixel fonts of up to 63 pt, using its own font file format.
Like most GEOS files, fonts are VLIR bundles that contain one “sub-file” for every point size. This is “California”, which is available at 10, 12, 14 and 18 pt:
California         size
\--- File Header    256
\--- 10             892
\--- 12            1114
\--- 14            1322
\--- 18            2110

To use a font, an application has to explicitly load it into its own memory buffer and activate it using the LoadCharSet API:
    LoadW   r0, otherFnBuffer
    jsr     OpenRecordFile          ; open font file
    lda     #12
    jsr     PointRecord             ; select 12 pt font
    LoadW   r7, FONT_BUFFER
    LoadW   r2, FONT_BUFFER_SIZE
    jsr     ReadRecord              ; read pixel font into memory
    jsr     CloseRecordFile         ; close font file
    LoadW   r0, FONT_BUFFER
    jsr     LoadCharSet             ; activate font

The 9 pt system font (called “BSW”) is always in memory and can be activated using UseSystemFont:
jsr     UseSystemFont

As soon as a font is activated, it can be used for drawing text:

Putchar – draw a character
PutString – draw a zero-terminated string of characters

Font metrics are accessible through:

curHeight (global variable) – font height (in pixels)
baselineOffset (global variable) – baseline offset (in pixels from top)
GetRealSize – query width and height of a given character code

All this API is very basic. The GEOS KERNAL does not help the application with:

enumerating available fonts and sizes: the application has to find font files on disk and decode their metadata.
dynamically caching several fonts in memory: GEOS only knows about a single font at a time.
getting font metrics without loading the font data: the GEOS API for getting the metrics requires the font to be loaded and active.

geoWrite implements all this on the application side.
Enumerating Fonts
geoWrite’s “font” menu shows all available fonts and their point sizes:

To get this information, applications have to find font files on disk and extract it from their metadata.
A font file has a file type of FONT, and its file name is also the font’s name, so that’s what the application will show in the UI.
The API FindFTypes returns an array of file names matching a filename or type, so this is how geoWrite gets the (file) names of the fonts on disk:
    LoadW   r6, fontNames
    LoadB   r7L, FONT               ; file type
    LoadB   r7H, MAX_FONT_FILES
    LoadW   r10, 0                  ; no name filter
    jsr     FindFTypes              ; get font files
    lda     #8
    sub     r7H                     ; number of files found
    sta     numFontFiles

(All code has been edited for readability.)
geoWrite also needs to get the available point sizes for each font, as well as the start track and sector of the data on disk and its size. This way, it can later load the data for a particular point size without having to read any extra metadata again.
In C notation, the data structure that it builds looks like this:
struct {
    uint16_t font_id;
    uint16_t record_size;
    uint16_t start_ts;
} disk_fonts[16][8];

For each of the (up to) 8 font files, there are (up to) 16 point sizes, for each of which geoWrite collects the font ID, the record size and the start track and sector.
A font ID is a GEOS concept that allows applications to use numbers instead of font name strings. It is a 16 bit value that uniquely identifies the font and point size:

The upper 10 bits are a unique ID assigned by Berkeley Softworks, e.g. 3 is a synonym “California”.
The lower 6 bits are the point size (0-63).

This is the main function to extract the metadata of a font:
;---------------------------------------------------------------
; extractFontMetadata
;
; Function:  Read point sizes, track/sector pointers and data
;            sizes for a font file from its file header
;
; Pass:      a   font index (0-7)
;            r0  font filename
;---------------------------------------------------------------
extractFontMetadata:
        pha
        MoveW   r0, r6
        jsr     FindFile                ; get file
        LoadW   r9, dirEntryBuf
        jsr     GetFHdrInfo             ; read file header
        MoveW   dirEntryBuf+OFF_DE_TR_SC, r1
        jsr     ldR4DiskBlkBuf
        jsr     GetBlock                ; read index block
        pla                             ; font index (0-7)
        asl     a
        asl     a
        asl     a
        asl     a                       ; * 16
        tay
        ldx     #0
@loop:  jsr     extractFontIdTrackSector
        jsr     extractFontRecordSize
        iny
        iny
        inx
        inx
        cpx     #FONTS_PER_FONTFILE
        bne     @loop
        rts

It opens each font file and reads its file header and index block. For every point size, it calls extractFontIdTrackSector and extractFontRecordSize.
Here is extractFontIdTrackSector:
;---------------------------------------------------------------
; extractFontIdTrackSector
;
; Function:  Copy a point size ID and its track/sector pointer
;            from the font file header and the index block
;            into the app's data structures.
;
; Pass:      x   font index within font file (0-15)
;            y   fontfile * 16 + fontindex * 2
;---------------------------------------------------------------
extractFontIdTrackSector:
        lda     fileHeader+OFF_GHPOINT_SIZES,x
        sta     diskFontIds,y
        and     #FONT_SIZE_MASK
        sta     r6L                     ; point size
        lda     fileHeader+OFF_GHPOINT_SIZES+1,x
        sta     diskFontIds+1,y
        ora     diskFontIds,y
        beq     @rts                    ; skip empty records
        txa
        pha
        lda     r6L                     ; point size
        asl     a
        tax
        lda     diskBlkBuf+2,x          ; track
        sta     diskFontRecordTrackSector,y
        lda     diskBlkBuf+3,x          ; sector
        sta     diskFontRecordTrackSector+1,y
        pla
        tax
@rts:   rts

A font’s file header contains 16 words at offset OFF_GHPOINT_SIZES that contain font IDs of the different point sizes, which this code copies into its internal data structure. It takes the start track and sectors for each point size from the VLIR index sector.
And this is extractFontRecordSize:
;---------------------------------------------------------------
; extractFontRecordSize
;
; Function:  Copy a font's data size from the font file header
;            into the app's data structures.
;
; Pass:      x   font index within file (0-15)
;            y   fontfile * 16 + fontindex * 2
;---------------------------------------------------------------
extractFontRecordSize:
        lda     fileHeader+OFF_GHSET_LENGTHS,x
        sta     diskFontRecordSize,y
        sta     r2L
        lda     fileHeader+OFF_GHSET_LENGTHS+1,x
        sta     diskFontRecordSize+1,y
        sta     r2H

        CmpWI   r2, MEM_SIZE_FONTS      ; data size too big?
        bcc     @rts
        beq     @rts
        lda     #0
        sta     diskFontRecordTrackSector,y ; then pretend it doesn't exist

@rts:   rts

Similarly, it extracts the data size for each point size.
Caching Font Data
geoWrite can keep up to 8 fonts in memory at the same time and dynamically allocates space for fonts in a 7000 byte buffer.
Fonts are managed with an LRU strategy, meaning that if a new font is supposed to be loaded that wouldn’t fit, the least recently used font will be removed from memory.
The font buffer contains one font immediately after the other: If a font is removed, fonts at higher addresses are moved down to fill the hole. This way, the free space is always at the end and there is no fragmentation.
These are the data structures in C notation:
uint8_t buffer[7000];
struct {
    uint16_t font_id;
    uint16_t data_ptr;
    uint16_t data_size;
    uint16_t lru;
} loaded_fonts[8];

For every loaded font, geoWrite keeps track of its font ID, the pointer to the data in the buffer, the size in the buffer, and its LRU ID.
The main API of the font library is the call setFontFromFile, which allows the application to ask the library to activate a font given its ID. If it’s not already in memory, it will be loaded into the buffer, and if necessary, one or more previously used fonts will be removed from memory.
This is the first part of the function:
;---------------------------------------------------------------
; setFontFromFile
;
; Function:  Set font. If necessary, load from disk and cache it.
;
; Pass:      r1  font ID
;
; Return:    c   =0: success
;                =1: fail, system font was loaded instead
;---------------------------------------------------------------
setFontFromFile:
        CmpW    r1, curFont
        bne     @find
        clc
        rts

@find:  jsr     findLoadedFont          ; is it already loaded?
        bcs     @load                   ; not found
        jsr     updateloadedFontLruId   ; mark it as the latest one that was used
        lda     loadedFontPtrsHi,x
        sta     r0H
        lda     loadedFontPtrsLo,x
        sta     r0L
        jsr     LoadCharSet             ; switch to it
        MoveW   r1, curFont
        clc
        rts

If the requested font is the currently active font, the function does nothing. Otherwise, it checks whether the font is already loaded into memory, and if yes, it just activates it and returns.
This is the implementation of findLoadedFont:
;---------------------------------------------------------------
; findLoadedFont
;
; Function:  Search for font in font buffer.
;
; Pass:      r1  font ID
;
; Return:    c   =0: found
;                    x   index
;---------------------------------------------------------------
findLoadedFont:
        ldx     #0
@loop:  cpx     loadedFontsCount
        beq     @notfound
        lda     loadedFontIdsLo,x
        cmp     r1L
        bne     @1
        lda     loadedFontIdsHi,x
        cmp     r1H
        beq     @found
@1:     inx
        bra     @loop

@notfound:
        sec                             ; failure
        rts
@found:
        clc                             ; success
        rts

It scans the array of loaded font IDs. If the ID is found, the index to be used with the data structures is returned in X.
If the font ID is not currently loaded into memory, setFontFromFile will load it:
;---------------------------------------------------------------
; setFontFromFile
; (continued)
;---------------------------------------------------------------
@load:  jsr     findFontIdOnDisk        ; does the font exist on disk?
        bcs     useSystemFont           ; no, quietly use system font instead
        lda     diskFontRecordTrackSector,x ; does point size exist?
        beq     useSystemFont           ; no, quietly use system font instead
        txa
        pha
        lda     diskFontRecordSize,x    ; r3 = size of font data
        sta     r3L
        lda     diskFontRecordSize+1,x
        sta     r3H
        jsr     allocateFontBufferSpace ; kick out least recently used font(s) if needed
        jsr     updateloadedFontLruId   ; mark it as the latest one that was used
        lda     r1L
        sta     loadedFontIdsLo,x       ; save the ID in the table so the font
        lda     r1H                     ; can be found in RAM again
        sta     loadedFontIdsHi,x
        lda     loadedFontPtrsHi,x      ; r7 = allocated location in RAM
        sta     r7H
        lda     loadedFontPtrsLo,x
        sta     r7L
        pla
        tax
        PushW   r1                      ; save ID
        PushW   r7                      ; save RAM location
        lda     diskFontRecordTrackSector,x; location on disk
        sta     r1L
        lda     diskFontRecordTrackSector+1,x
        sta     r1H
        LoadW   r2, MEM_SIZE_FONTS      ; maximum file size
        jsr     setDevice
        jsr     ReadFile                ; load font data into font buffer
        PopW    r0                      ; read RAM location into r0
        PopW    curFont                 ; read ID into curFont
        cpx     #0
        bne     useSystemFont           ; read error
        jsr     LoadCharSet
        clc                             ; success
        rts

useSystemFont:
        jsr     UseSystemFont
        LoadW   curFont, SYSTEM_FONT_ID
        sec                             ; fail: it's not the font we wanted
        rts

It calls findFontIdOnDisk (not shown) to check whether the information about the available fonts and point sizes on disk contains the requested font ID.
If the ID is found, setFontFromFile calls allocateFontBufferSpace with the required data size to make space for the font in the buffer, and loads it using ReadFile and the track and sector pointer. If anything goes wrong, the system font is activated instead.
This is allocateFontBufferSpace:
;---------------------------------------------------------------
; allocateFontBufferSpace
;
; Function:  Allocate buffer space for a new font.
;
; Pass:      r3  size of font data
;
; Note:      This function cannot fail: It will remove fonts
;            using an LRU strategy until there is space.
;---------------------------------------------------------------
allocateFontBufferSpace:
        ldx     loadedFontsCount        ; no fonts loaded?
        beq     @first                  ; then load it to start of buffer

        cpx     #MAX_FONTS_LOADED
        beq     @remove                 ; too many fonts loaded, remove one

        lda     loadedFontPtrsLo-1,x    ; check for r3 bytes of spaces in font buffer
        clc                             ; (last font pointer + last font size + required size)
        adc     loadedfontDataSizeLo-1,x
        tay
        lda     loadedFontPtrsHi-1,x
        adc     loadedfontDataSizeHi-1,x
        tax
        tya
        add     r3L
        tay
        txa
        adc     r3H
        cmp     #>MEM_SCRRECV
        bne     :+
        cpy     #MEM_FONT              ; load first font to start
        ldy     #

If the new font does not fit into the empty space at the end of the buffer, this function keeps calling unloadLruFont until there is enough space. It then fills the data pointer and size fields for the new font and increments the number of currently loaded fonts.
unloadLruFont is used to make space:
;---------------------------------------------------------------
; unloadLruFont
;
; Function:  Unload the least recently used font and compress
;            the font buffer.
;---------------------------------------------------------------
unloadLruFont:
                                        ; find lowest LRU ID
        ldy     #0                      ; candidate for lowest
        ldx     #1
@loop1: cpx     loadedFontsCount
        beq     @end1                   ; done iterating
        lda     loadedFontLruIdHi,x
        cmp     loadedFontLruIdHi,y
        bne     @1
        lda     loadedFontLruIdLo,x
        cmp     loadedFontLruIdLo,y
@1:     bcs     @2
        txa                             ; current one is lower
        tay                             ; -> update candidate
@2:     inx
        bra     @loop1

@end1:  tya
        tax                             ; lowest index to X

@loop2: inx
        cpx     loadedFontsCount        ; is it the last one?
        beq     @end2                   ; then we're done

        dex
        lda     loadedfontDataSizeHi+1,x; count: size of the one after
        sta     r2H
        lda     loadedfontDataSizeLo+1,x
        sta     r2L
        lda     loadedFontPtrsHi+1,x    ; source: address of the one after
        sta     r0H
        lda     loadedFontPtrsLo+1,x
        sta     r0L
        lda     loadedFontPtrsHi,x      ; target: address of the current one
        sta     r1H
        lda     loadedFontPtrsLo,x
        sta     r1L
        txa
        pha
        jsr     MoveData                ; move the next font down
        pla
        tax
        lda     loadedFontIdsLo+1,x     ; move loadedFontIds
        sta     loadedFontIdsLo,x
        lda     loadedFontIdsHi+1,x
        sta     loadedFontIdsHi,x
        lda     loadedFontLruIdLo+1,x   ; move FontLru
        sta     loadedFontLruIdLo,x
        lda     loadedFontLruIdHi+1,x
        sta     loadedFontLruIdHi,x
        lda     loadedfontDataSizeLo+1,x
        sta     loadedfontDataSizeLo,x  ; move loadedfontDataSize
        clc
        adc     loadedFontPtrsLo,x      ; update fontPtrs
        sta     loadedFontPtrsLo+1,x
        lda     loadedfontDataSizeHi+1,x
        sta     loadedfontDataSizeHi,x
        adc     loadedFontPtrsHi,x
        sta     loadedFontPtrsHi+1,x
        inx
        bra     @loop2                  ; repeat for all fonts above removed one

@end2:  dec     loadedFontsCount
        rts

It finds the lowest LRU ID, i.e. the least recently used font, moves all fonts at higher addresses (and their pointers) down, and decrements the number of loaded fonts.
To keep track of which font is least recently used, updateLoadedFontLruId is called on load and on every activation of a font:
;---------------------------------------------------------------
; updateLoadedFontLruId
;
; Function:  Mark a given font as most recently used.
;
; Pass:      x   font index
;---------------------------------------------------------------
updateLoadedFontLruId:
        lda     fontLruCounter
        sta     loadedFontLruIdLo,x
        lda     fontLruCounter+1
        sta     loadedFontLruIdHi,x
        IncW    fontLruCounter
        bne     @rts

        ; 16 bit overflow: clear LRU ID for all fonts
        ldy     #0
        tya
@loop:  sta     loadedFontLruIdLo,y
        sta     loadedFontLruIdHi,y
        iny
        cpy     loadedFontsCount
        bne     @loop

@rts:   rts

It keeps assigning the next number of a sequence to the given font, guaranteeing that it will always be the highest number and therefore the last one to be removed.
Caching Metrics
When a word processor is dealing with fonts, it does not always need the actual image data for it. Sometimes the fonts metrics are enough, i.e. the height of the font, the baseline offset and the width of the different characters. This is true when selecting text, for example, to know where the character boundaries are, or when reflowing a document.
Caching font data is expensive; the 7000 bytes of geoWrite can hold five 12pt fonts, but only two 24pt fonts. Caching metrics is cheap: The character widths take up only 96 bytes per point size (for the printable ASCII character codes $20-$7F).
geoWrite therefore has an independent cache for font metrics that holds information about the last 8 loaded fonts.
The data structure looks like this:
struct {
    uint16_t font_id;
    uint8_t height;
    uint8_t baseline_offset;
    uint8_t widths[96];
} metrics[8];

So if the application wants to draw characters, it has to call setFontFromFile, which will make sure the pixel data is in memory and activated, but if it only needs the font for measuring, it should call lookupFontMetrics instead:
;---------------------------------------------------------------
; lookupFontMetrics
;
; Function:  Prepare cached font metrics for use.
;
; Pass:      a3  font ID
;---------------------------------------------------------------
lookupFontMetrics:
        ldx     #0                      ; find font id in metricsIds
@loop:  lda     metricsIds,x
        tay
        ora     metricsIds+8,x
        beq     @nfound
        cpy     a3L
        bne     @no
        lda     metricsIds+8,x
        cmp     a3H
        beq     @found
@no:    inx
        cpx     #MAX_FONTS_LOADED
        bne     @loop

        jsr     getMod8Index

        ; not found in metrics cache
@nfound:
        PushB   r1H
        jsr     moveA3R1                ; r1 = font id
        txa
        pha
        jsr     setFontFromFile         ; set font
        pla
        tax                             ; mod8 index
        PopB    r1H
        lda     a3L                     ; store font id in metricsIds
        sta     metricsIds,x
        lda     a3H
        sta     metricsIds+8,x
        lda     curHeight               ; store height in table
        sta     metricsHeights,x
        lda     baselineOffset
        sta     metricsBaselineOffsets,x

        jsr     getCachedFontMetrics
        jsr     calcCharWidths
        rts

@found: jmp     getCachedFontMetrics

; ----------------------------------------------------------------------------
getCachedFontMetrics:
        [...]
        sta     metricsWidths
        [...]
        sta     metricsWidths+1
        lda     metricsHeights,x
        sta     curFontHeight
        lda     metricsBaselineOffsets,x
        sta     curBaselineOffset
        rts

If the metrics for the requested font and point size are in the cache, they will be copied into  curFontHeight, curBaselineOffset and the array metricsWidths. Otherwise, the font’s pixel data is loaded and the metrics are added to the cache using calcCharWidths (not shown).
To get the width of a character after metrics have been looked up, the app can now call getCharWidth:
;---------------------------------------------------------------
; getCharWidth
;
; Function:  Get the width of a specified char of the currently
;            active metrics set (-> lookupFontMetrics).
;
; Pass:      a   character
;            x   currentMode
;
; Return:    a   width
;---------------------------------------------------------------
getCharWidth:
        sub     #$20                    ; ASCII -> table index
        pha
        MoveW   metricsWidths, r14
        pla
        tay
        lda     (r14),y                 ; width
        sta     metricsTmp
        txa                             ; mode
        and     #SET_BOLD
        beq     :+
        inc     metricsTmp              ; add one
:       txa
        and     #SET_OUTLINE
        beq     :+
        inc     metricsTmp              ; add 3
        inc     metricsTmp
:       lda     metricsTmp
        rts

This is basically a reimplementation of the GEOS KERNAL’s GetRealSize API: If the current font style is bold or outline, the width is increased by one or three pixels, respectively.
Conclusion
One goal of a modern operating system and even of many kinds of libraries is to abstract what is going on underneath it. GEOS is a very constrained operating system with only 64 KB of total RAM at its disposal, so it tries to provide as many useful functions as possible (graphics, text rendering, disk access, mouse, printer, …) that fit into 20 KB of code, but in many parts of the system, it barely abstracts the underlying hardware.
GEOS applications are seen as as the natural extension of the operating system, and many features that did not fit into the operating system were implemented in the applications, with full awareness of the details of the filesystem or the file formats of system files.
The GEOS font manager can be regarded as a low-level system library, which deals with the internals of the BAM/VLIR filesystem and the font file format.
References

Michael Farr: The Official GEOS Programmer’s Reference Guide
Berkeley Softworks: The Hitchhiker’s Guide to GEOS
Berkeley Softworks: geoProgrammer User’s Manual
Rebecca G. Bettencourt: GEOS Font Format
Glenn Holmer: GEOS Fonts
GEOS Source Code




Inside geoWrite – 2: Screen Recovery
Michael Steil — Wed, 02 Sep 2020 21:46:43 +0000
In the series about the internals of the geoWrite WYSIWYG text editor for the C64, this article discusses how the app manages to extend its usable RAM by 5 KB using a custom screen recovery solution.

Article Series

The Overlay System
Screen Recovery ← this article
Font Management
Zero Page
Copy Protection
Localization
File Format and Pagination
Copy & Paste
Keyboard Handling

Screen Recovery
Any graphical user interface that allows overlapping items has to deal with the problem of recovery: When dialogs and pull-down menus are shown, they cover parts of the screen, and when they are dismissed, the underlying UI needs to be revealed again.
There are two basic approaches to this:


When the dialog or menu is dismissed, the system calls the same text or image rendering code again that drew it in the first place.


Before the dialog or menu is drawn, the system saves the pixel data that will be overwritten, and after the item is dismissed, the saved pixels get written back onto the screen.


The latter solution requires additional memory, but is a lot faster and doesn’t create a potentially jarring redraw.
Screen Recovery in GEOS
GEOS uses the “restore the pixel data” approach, but with a twist.
Let’s look at the GEOS memory layout again:
-----------------------------------------
$0000  Zero page, stack, system variables
-----------------------------------------
$0400




       Application memory





-----------------------------------------
$6000
       Background bitmap

-----------------------------------------
$8000  System buffers and variables
-----------------------------------------
$9000  Disk driver
-----------------------------------------
$A000
       Screen bitmap

-----------------------------------------
$C000  GEOS KERNAL


-----------------------------------------

The 8 KB of 320×200 monochrome bitmap data resides at $A000-$BF40. In addition, GEOS statically reserves a whole 8 KB just for screen recovery: The “background bitmap” at $6000-$7F40 allows GEOS to recover the whole screen area.
 Imprint and Recover
This background bitmap is really just another (invisible) framebuffer with the same layout as the actual screen bitmap. When saving pixels, they get copied from the screen (“foreground”) bitmap to the same offset in the background bitmap and vice versa.
These two functions are the core of the API:

ImprintRectangle – copy rectangle from fg bitmap to bg bitmap
RecoverRectangle – copy rectangle from bg bitmap to fg bitmap

While applications are free to use the API this way, the system-suggested way is actually different.
Display Buffering
When using the core GEOS API, it’s not actually necessary to save the pixels (ImprintRectangle) before overwriting them. The background buffer already contains a copy!
That’s because all graphics code can be configured to either draw to the foreground bitmap, the background bitmap, or both.
To calculate the offset in the bitmap of a y coordinate, all internal drawing code calls GetScanLine. This function usually returns two pointers in virtual registers r5 and r6:

r5: pointer to the pixel line in the foreground screen
r6: pointer to the pixel line in the background screen

Any code in the GEOS drawing library then stores the pixel data in both the locations pointed to by r5 and r6:
    [...]
    sta (r5),y
    sta (r6),y

For as little as an extra 6 clock cycles for every byte to be stored (which is 8 pixels), the drawing code stores it into both bitmaps, so there is usually no need to copy data from the foreground to the background bitmap.
Now when the system draws a menu or a dialog, it only draws it into the foreground buffer. That’s because the GetScanLine call can actually return different kinds of pointers depending on the global system variable dispBufferOn:



 dispBufferOn        
 r5            
 r6            




 %11000000 (default) 
 fg screen ptr 
 bg screen ptr 


 %10000000           
 fg screen ptr 
 fg screen ptr 


 %01000000           
 bg screen ptr 
 bg screen ptr 



By only setting one of bits 6 and 7, all drawing code can effectively be instructed to only draw to the foreground or the background bitmap. In this case, the two sta instructions will just store the pixel data to the same location twice.
The system default is to draw everything to both bitmaps – except menus and dialogs, which are only ever drawn to the foreground bitmap. There is therefore no need to call ImprintRectangle: All that the built-in code has to do is call RecoverRectangle on the rectangle that it overwrote.
Screen Recovery in geoWrite
geoWrite is a very complex application that is very tight on memory, so it was designed to reclaim as much as possible of the 8 KB normally used for screen recovery. Text rendering is very slow, so redrawing the page as a recovery strategy is out of the question.
Instead, geoWrite stores the saved pixels more efficiently: The buffer only really needs to be as big as is required for the largest rectangle ever saved while in the app. And for geoWrite, that’s dialogs, which are 200×104 pixels. So that’s 2600 bytes instead of 8000 bytes.
The code to save a screen rectangle is called with the following arguments in virtual registers:

r1: the pointer to the recovery buffer
r2L: x ÷ 8
r2H: y
r3L: width ÷ 8
r3H: height

X coordinates have to be divisible by 8, so that the pixels are at byte boundaries.
The Code
The following is the main code, slightly edited for readability. It iterates over all lines of the rectangle and, on save, appends the bitmap bytes to the array of saved data. On recover, it does the opposite.
@loop1: ldx     r2H         ; y coord
        jsr     GetScanLine ; r5 := ptr to bitmap
        lda     r2L         ; x coord / 8
        asl     a
        asl     a
        asl     a           ; * 8
        bcc     :+
        inc     r5H
:       tay
        MoveB   r3L, r4L    ; copy byte count
@loop2: bit     r4H         ; save or recover?
        bpl     @1
        jsr     @recv
        bra     @2
@1:     jsr     @save
@2:     IncW    r1          ; advance buffer pointer
        add     #8          ; account for quirky VIC-II memory layout
        bcc     :+
        inc     r5H
:       tay
        dec     r4L         ; dec byte counter
        bne     @loop2      ; loop for horizontal bytes
        inc     r2H
        dec     r3H         ; dec line counter
        bne     @loop1      ; loop for lines
        rts

This is the subroutine for copying a byte from the screen to the buffer…
@save:  lda     (r5),y      ; read screen byte
        tax
        tya
        pha                 ; save offset
        ldy     #0
        txa
        sta     (r1),y      ; write into buffer
        pla                 ; restore offset
        rts

…and the code for copying a byte from the buffer to the screen:
@recv:  tya                 ; save offset
        pha
        ldy     #0
        lda     (r1),y      ; read from buffer
        tax
        pla
        tay                 ; restore offset
        txa
        sta     (r5),y      ; write onto screen
        tya
        rts

The Tables
geoWrite uses a table with the buffer pointer, x, y, width and height values for each use case, so only an offset within the table has to be passed:
;              r1L/r1H  r2L  r2H  r3L  r3H
;                ptr      x    y wdth hght
scrrecvtabs:
scrrecvtab_geos:
        .word   MEM_SCRREST
        .byte             0,  15,  10, 128
scrrecvtab_file:
        .word   MEM_SCRREST
        .byte             3,  15,   7, 100
[...]

The first item in the table is for saving and restoring the rectangle under the “geos” menu, and so on.
MEM_SCRREST is the address of the start of the recover buffer. The buffer pointer isn’t always MEM_SCRREST though:
scrrecvtab_font:
        .word   MEM_SCRREST
        .byte            17,  15,  10, 114
scrrecvtab_fontsize:
        .word   MEM_SCRREST + 1408
        .byte            26,  15,   9, 114

This is because the font menu has a sub-menu for the point size:

When the font menu is open, the rectangle covered by it is already saved in the buffer. The rectangle under the point size menu has to be saved in the area following the already occupied data.
Memory Layout
Conveniently, the RAM area for the background bitmap immediately follows the application’s memory. Apps are free to just not use the background bitmap at all. By setting dispBufferOn to %10000000 globally, the system will never touch the $6000-$7FFF area.
Here’s a mostly to scale overview of the geoWrite memory map compared to the GEOS default:
       GEOS Default                    geoWrite
-------------------------      -------------------------
$0400                                              $0400

                                     Main Code


       Application memory      -------------------------
                                     Overlay Code  $3244
                               -------------------------
                                                   $4310
                                     Page Data

-------------------------      -------------------------
$6000                                Font Heap     $5E68
       Background bitmap       _________________________
                                     Recovery Data $75D8
-------------------------      -------------------------

geoWrite puts the recovery buffer at the topmost 2600 bytes of the $0400-$8000 window that is available to the application if it doesn’t use the background screen.
The reason for this location is because the range $7900-$7F40 is where GEOS requires the printer driver to be loaded once the application wants to print. It overlaps the background screen on purpose: As long as the application is not printing, so no space for the driver is wasted. And once it is printing, it just can’t use the background buffer any more. The same is true in geoWrite’s model.
Triggers
Aside from setting dispBufferOn to just the foreground, using one’s own save/recover logic requires the app to make sure the save and recover code gets called at the right times.
For dialogs, this is easy: Apps have to explicitly show them by calling the doDlgBox API, so before calling it, geoWrite saves the respective rectangle, and once the API returns, it recovers it.
For menus, it’s more tricky. The application has to trigger on the open event of a sub-menu.
Usually, the data structure of a menu for the doMenu API contains pointers to sub-menu data structures, forming the complete menu tree, like this:
menu_main:
        .byte   0, 14, 0, 191 ; pos & size
        .byte   HORIZONTAL | UN_CONSTRAINED | 3 ; #items

        .word   txt_geos  ; string "geos"
        .byte   SUB_MENU  ; =below is a sub-menu ptr
        .word   menu_geos ; sub-menu data structure

        .word   txt_file
        .byte   SUB_MENU
        .word   menu_file

        .word   txt_edit
        .byte   SUB_MENU
        .word   menu_edit

This way, an application can describe a complete menu tree with just data structures and no code.
But instead of a pointer to a sub-menu (code SUB_MENU), the data structure can alternatively contain a pointer to code that returns a sub-menu data structure (code DYN_SUB_MENU):
        .word   txt_geos     ; string "geos"
        .byte   DYN_SUB_MENU ; =below is a code ptr
        .word   callbackGeos ; code, called on sub-menu open

In this example, whenever the “geos” item is clicked, the callbackGeos function is called, which saves the rectangle that will be covered by the “geos” sub-menu, and returns a pointer to the sub-menu to be presented (edited for clarity):
callbackGeos:
        ldx     #scrrecvtab_geos-scrrecvtabs
        stx     a2L
        jsr     screenSave
        LoadW   r0, menu_geos
        rts

For the recovery of the rectangle, there is actually a GEOS system variable supporting this use case: If the application sets the RecoverVector pointer to anything but 0, closing a menu will call that code instead of doing its own RecoverRectangle-based recovery.
This is where it points in geoWrite (again edited for clarity):
appRecoverVector:
        ldx     a2L
        jsr     screenRecover
        rts

The function reuses the previously set offset in the screen recovery table (in virtual register a2L) for buffer location, the origin and size of the rectangle.
Conclusion
Berkeley Softworks wrote the GEOS operating system as well as many major applications like geoWrite, geoPaint, geoPublish and geoCalc, yet they implemented two different methods for screen recovery: The system way of doing it wastes space, but is very simple to use. And the way they do it in the applications is very much tied to the specific use case, but very optimized.



Inside geoWrite – 1: The Overlay System
Michael Steil — Tue, 01 Sep 2020 20:42:53 +0000
geoWrite is a WYSIWYG rich text editor for the Commodore 64 GEOS operating system, which runs with a total of just 64 KB of RAM. In the series about the internals of geoWrite, this article discusses how it manages to fit 52 KB of code into the available 23 KB of application RAM.

Introduction
GEOS is a disk-based graphical operating system for the Commodore 64 that provides the following features:

applications, desk accessories
disk, printer and mouse drivers
loadable proportional fonts
menu bars, dialogs, file picker
multi-fork filesystem API
misc. library code (math, memory, strings, …)

But since the OS kernel (called the “GEOS KERNAL”) is only 20 KB in size, some of the APIs are very limited, or cumbersome to use, so applications had to do a lot of work one would expect from the OS these days.
The GEOS authors “Berkeley Softworks” also wrote several applications – the OS-included deskTop, geoWrite and geoPaint, as well as geoPublish, geoCalc, geoChart, geoFile and geoDex – which all share low-level functionality that is not in fact part of the operating system, but shared code between these apps, like:

code overlays
custom screen recovery
create/open/exit startup dialog
desk accessory enumeration
font management
zero page management
copy protection

In this series of articles, we will discuss some of the lower-level features that are implemented on the application side, using the example of geoWrite.

The Overlay System ← this article
Screen Recovery
Font Management
Zero Page
Copy Protection
Localization
File Format and Pagination
Copy & Paste
Keyboard Handling

GEOS Memory Map
GEOS and its apps need to fit into the 64 KB of RAM of the C64. Here is a rough overview of the memory map, mostly to scale:
-----------------------------------------
$0000  Zero page, stack, system variables
-----------------------------------------
$0400




       Application memory





-----------------------------------------
$6000
       Background bitmap

-----------------------------------------
$8000  System buffers and variables
-----------------------------------------
$9000  Disk driver
-----------------------------------------
$A000
       Screen bitmap

-----------------------------------------
$C000  GEOS KERNAL


-----------------------------------------

The screen bitmap is an 8 KB RAM area for the 320×200 monochrome screen bitmap. There is a full-size “background” copy used for recovering the main screen contents after closing a menu or a dialog without needing a slow redraw.
The application has 23 KB of memory that it can use for code and data. geoWrite is 52 KB of code, and it needs 2 KB for variables, 7 KB for the current page of text and 6 KB for bitmap fonts. That would be 67 KB…
VLIR Files
In the UNIX world, a file is a mapping of a filename to a sequence of bytes (and some metadata). Some operating systems extend this concept: On classic MacOS, files consist of two of these sequences (the “resource fork” and the “data fork”). NextSTEP and MacOS X can present folders with a tree of individual files as a single “bundle”.
GEOS extends the UNIX-like Commodore filesystem with “VLIR” files, which stands for “Variable Length Index Record”. A VLIR file maps a filename to a 256 byte “file header” (for the icon and extra metadata) and up to 127 records, numbered 0-126. A record is a variable-length sequence of bytes, much like a traditional UNIX file.
You can imagine a VLIR file as a folder with several files in it. The following is a visualization of a typical geoWrite document:
geoWrite Doc
\--- File Header
\--- 0
\--- 1
\--- 2
\--- 61
\--- 62
\--- 64

As you can see, record numbers don’t have to be contiguous. (geoWrite for example stores pages in records 0-60, the header and footer into records 61 and 62, and image data in records 64-126.)
The file header contains the icon, the file type, and some other generic as well as application-specific metadata.
VLIR Applications
GEOS applications can be VLIR files as well. When running an app, the system loads record 0 into memory and executes it. It’s entirely up to the application what to do with the other records.
This is what the geoWrite app looks like:
GEOWRITE           size
\--- File Header    256
\--- 0            10335
\--- 1             2552
\--- 2             3999
\--- 3             2328
\--- 4             1965
\--- 5             3870
\--- 6             3998
\--- 7             3897
\--- 8             1194

The geoWrite main code in record 0 is about 10 KB in size. The operating system loads it to $0400-$2C5E into application RAM.
Code Overlays
This is the memory layout of application RAM for geoWrite:
-----------------------------------------
$0400  Main code (record 0)


-----------------------------------------
$2C5F  Variables
-----------------------------------------
$3244  Overlay code (records 1-7)

-----------------------------------------
$41E4  Variables, page data, font data





-----------------------------------------

The record 0 code loaded by the OS always remains in its slot. It contains the core editing functionality, the overlay manager, the font manager and other library code.
There is a 4 KB slot for “overlay” code, meaning that the record 0 code can swap in any of the records from 1 through 7.

[0 library code, core text editing]
1 initialization, copy protection
2 core text editing
3 cut, copy, paste
4 ruler editing
5 startup/about, create, open, paste text, run desk accessory
6 navigation, search/replace, header/footer, reflow
7 printing
[8 print settings]

(Record 8 is handled differently and is discussed at the end of this article.)
Every record is linked to the same address ($3244) and starts with a jump table, like this one:
CODE5:
    jmp recover            ; 0
    jmp showStartupMenu    ; 1
    jmp renameDocument     ; 2
    jmp openDocument       ; 3
    jmp showAboutDialog    ; 4
    jmp loadDeskAcc        ; 5
    jmp exitToDesktop      ; 6
    jmp readReservedRecord ; 7
    jmp makeFullPageWide   ; 8

The jump table means that the record 0 code can be assembled independently of the overlays. While the overlays access symbols in the record 0 code, record 0 code only calls through these jump table entries.
VLIR API
The GEOS KERNAL has the following calls for working with VLIR files:

OpenRecordFile – Open an existing VLIR file given its name
UpdateRecordFile – Flush the VLIR’s metadata to disk
CloseRecordFile – Flush and close VLIR file
PointRecord – Set current record
PreviousRecord – Move to previous record
NextRecord – Move to next record
ReadRecord – Read complete record into memory
WriteRecord – Write/overwrite complete record from memory image
DeleteRecord – Delete current record

Reading overlay code should therefore be as simple as this:
    ; startup
    LoadW   r0, fnBuffer
    jsr     OpenRecordFile

    ; load overlay code
    lda     #n
loadCode:
    jsr     PointRecord
    LoadW   r7, OVERLAY_ADDRESS
    LoadW   r2, OVERLAY_SIZE
    jsr     ReadRecord

Unfortunately, GEOS can only have one VLIR file open at a time, and a geoWrite document is also a VLIR file. Opening and closing the two files would cause too much disk activity, which is why geoWrite comes with a simple read-only VLIR implementation on the side.
Loading Overlays Manually
On disk, a VLIR file’s directory entry points to it 256 bytes index table. Here is an example:
00 FF  06 13  08 10  09 14  0A 01  0A 12  0A 13  0B 00
0C 02  0D 04  00 00  00 00  00 00  00 00  00 00  00 00
[...]

The 00 FF at the beginning is the Commodore DOS sector header and not part of the data. The remaining pairs of bytes point to the track and sector of the start of each record on disk.
GEOS has an API for loading a file given a track and a sector (ReadFile), so all geoWrite needs to do is read its own index table on startup, and call ReadFile on items of this table when loading an overlay.
Here’s a shortened version of the code to get a copy of the app’s index table:
    ; find application
    LoadW   r6, fnBuffer
    lda     #APPLICATION
    sta     r7L
    lda     #1 ; find max. 1 file
    sta     r7H
    LoadW   r10, appname
    jsr     FindFTypes

    LoadW   r0, fnBuffer
    jsr     OpenRecordFile

    jsr     i_MoveData ; copy index table
    .word   fileHeader+2
    .word   appIndexTable
    .word   2 * NUM_APP_RECORDS
    LoadB   curCodeRecord, $FF
    rts

appname:
    .byte   "geoWrite    V2.1",0

OpenRecordFile reads the VLIR file’s index table info fileHeader. geoWrite then copies NUM_APP_RECORDS into its own table appIndexTable, skipping the first two bytes (00 FF).
And here is a shortened version of the code to read a record:
    ; load overlay code
    lda     #n
loadCode:
    cmp     curCodeRecord
    beq     @rts ; already loaded
    sta     curCodeRecord
    asl     a
    tay
    lda     appIndexTable,y
    sta     r1L
    lda     appIndexTable+1,y
    sta     r1H
    LoadW   r7, OVERLAY_ADDRESS
    LoadW   r2, OVERLAY_SIZE
    jsr     _ReadFile
@rts:
    rts

You can see that the code keeps track of the currently loaded record, so it does not re-load the same code if it’s already in memory.
Managing Overlays
All overlay functionality is implemented in the record 0 code, because it always needs to be accessible.
There is a set of functions for loading the different records:
loadCode1:
    lda     #1
    .byte   $2C ; skip next
loadCode2:
    lda     #2
    .byte   $2C ; skip next
loadCode3:
    lda     #3
    .byte   $2C ; skip next
loadCode4:
    lda     #4
    .byte   $2C ; skip next
loadCode5:
    lda     #5
    .byte   $2C ; skip next
loadCode6:
    lda     #6
    .byte   $2C ; skip next
loadCode7:
    lda     #7
loadCode:
    [...]

The record 0 code can then load an overlay and call a function through its jump table
    jsr     loadCode5
    jsr     J5_showStartupMenu ; OVERLAY_ADDRESS + 3 * 1

Code inside an overlay can’t call code from a different overlay this way, because the loadCode call would overwrite the caller. For this case, the record 0 code has functions like this one:
_showCantAddPages:
    ldy     #

The function showCantAddPages is implemented on overlay 3. Code in overlay 2 can call _showCantAddPages in the record 0 code, which will load overlay 3, call the function, load the original overlay 2 again, and return.
Splitting the Logic
With helper functions in the record 0 code, it is possible to arbitrarily split logic into the different records. But since loading an overlay takes about 2-3 seconds on a 1541 disk drive, this should be minimized.
Central Library Code
The overlay code that we have seen above has to live in the record 0 code, so it’s directly callable by any record.
While in theory any other library code could live in any other record, keeping the most-used functionality in the record 0 code will reduce disk accesses. Here are some examples:

generic dialogs
error dialogs
drive switching (app vs. document)
disk full testing
screen recovery
font management
zero page management
some common text strings

One-time logic
Furthermore, there is code that is only ever needed once. On startup, the following is done:

enumerate fonts and desk accessories on disk
get the page size from the printer
initialize the menu bar
draw the ruler and the page indicator
prepare a file opened for printing only
do the copy protection dance

All this code lives in record 1. It is loaded immediately after the app is started. When it returns, it never gets loaded again.
Main Mode Code
Then, there is code that is needed when the app is in its main mode, like the text renderer and the handlers for navigating on the page, typing text and deleting text.
The main mode code lives in the remainder of record 0 as well as in record 2: During normal text editing, geoWrite always keeps record 2 loaded.
Since functions for menu items, keyboard shortcuts and mouse triggers in main mode are called by the GEOS KERNAL directly, which does not know about banking, at least the entry points of these handlers also have to live in record 0 (or, with restrictions, record 2).
Grouped Functionality
The remaining records contain further functionality, grouped by topic, so they can use common code inside the same record. Here is the list again:

3 cut, copy, paste
4 ruler editing
5 startup/about, create, open, paste text, run desk accessory
6 navigation, search/replace, header/footer, reflow
7 printing

On app launch, the record 0 entry code immediately loads initialization code in record 1. After its return, the record 0 code runs the startup UI code in record 5, which creates a new file or opens an existing file and returns to the core text editor in the record 0/2 code.
Duplicate Code
Common code needed by very different functionality groups usually lives in record 0, so that any record can call it without causing swapping in and out overlays. This is true for most common code, but since space in record 0 is at a premium, a different solution was necessary especially for bigger reusable components.
The startup code for example needs to talk to the printer driver to query the page size, and the print code needs to talk to the printer for printing. They both need to look up and load the printer driver. This common code is too big to fit into record 0, and having the startup code (#1) call out into the printing record (#7) would increase startup time by at least 5 seconds, swapping in #7 and then swapping in #1 again.
The solution is to just duplicate the common code into different records, e.g. by using .include statements to reuse the same code in multiple places. As long as the individual records don’t overflow their 4 KB maximum, this is a reasonable tradeoff.
Other examples are the document file version check, the string-to-int conversion code and common “text scrap” (clipboard) code. Parts of the latter are even included in three different records.
Conclusion
The overlay system has been a feature since the very first version of GEOS and is used by all major applications. It is not without its limits though: An application’s main mode should be responsive and do most of its work without swapping overlays, so the core logic (record 0 and one overlay record) has a certain limit in complexity. The individual overlays have a very tight size limit as well.
For the printing functionality, geoWrite already has to work around the overlay code size, which shows that an app like geoWrite is truly at the limit of what a GEOS application can do on a 64 KB system.



CMDR-DOS: Commodore DOS on FAT32
Michael Steil — Sat, 22 Aug 2020 08:17:36 +0000
All disk drives connected to the Serial Bus of a Commodore 64 speak the Commodore DOS protocol, from the popular 1541 5.25″ drive to the modern sd2iec SD card interfaces. CMDR-DOS is a new and open source implementation of the Commodore DOS protocol, using SD cards with the FAT32 filesystem and supporting advances features like partitions, subdirectories and timestamps – and running on a 65c02!
Commander X16
It is the built-in DOS of the Commander X16, and runs on the main CPU, so the KERNAL API (talk, tksa, untlk, listn, secnd, unlsn, acptr, ciout) calls directly into the DOS implementation. This allows LOAD speeds of about 140 KB/sec on an 8 MHz system.
Demo:

Transcript:
DOS"$=P":REM THERE ARE TWO PARTITIONS ON THIS SD-CARD

255 "CMDR-DOS SD CARD"  MBR
1    "PART1"            FAT32
2    "PART2"            FAT32

READY.
DOS"N1:SYSTEM,1616,FAT32":REM FORMAT PARTITION 1

READY.
DOS"N2:DATA,1617,FAT32":REM FORMAT PARTITION 2

READY.
DOS"$=P":REM THE NEW NAMES OF THE TWO PARTITIONS

255 "CMDR-DOS SD CARD"  MBR
1    "SYSTEM"           FAT32
2    "DATA"             FAT32

READY.
DOS"CP1":REM SWITCH TO PARTITION 1

READY.
DOS"$":REM SHOW DIRECTORY

0 "SYSTEM          " FAT32
99 MB FREE.

READY.
OPEN1,8,2,"HELLO,P,W":PRINT#1,"HELLO WORLD!":CLOSE1:REM CREATE FILE

READY.
DOS"$"

0 "SYSTEM          " FAT32
1    "HELLO"            PRG
99 MB FREE.

READY.
DOS"C:WORLD=HELLO":REM DUPLICATE FILE

READY.
DOS"$"

0 "SYSTEM          " FAT32
1    "HELLO"            PRG
1    "WORLD"            PRG
99 MB FREE.

READY.
DOS"C:HELLO WORLD=HELLO,WORLD":REM CONCATENATE FILES

READY.
DOS"$"

0 "SYSTEM          " FAT32
1    "HELLO"            PRG
1    "WORLD"            PRG
1    "HELLO WORLD"      PRG
99 MB FREE.

READY.
DOS"MD:SECRET":REM CREATE SUBDIRECTORY

READY.
DOS"$"

0 "SYSTEM          " FAT32
1    "HELLO"            PRG
1    "WORLD"            PRG
1    "HELLO WORLD"      PRG
0    "SECRET"           DIR
99 MB FREE.

READY.
DOS"$//SECRET/:":REM SHOW SUBDIR CONTENTS

0 "SYSTEM          " FAT32
0    "."                DIR
0    ".."               DIR
99 MB FREE.

READY.
DOS"CD:SECRET":REM CHANGE TO SUBDIR

READY.
DOS"$"

0 "SYSTEM          " FAT32
0    "."                DIR
0    ".."               DIR
99 MB FREE.

READY.
DOS"C:SECRET HELLO=//:HELLO":REM COPY FILE FROM ROOT TO HERE

READY.
DOS"CD:_":REM CHANGE BACK UP

READY.
DOS"CP2":REM CHANGE TO PARTITION 2

READY.
DOS"$

0 "DATA            " FAT32
98 MB FREE.

READY.
DOS"C:DATA FILE=1//SECRET/:SECRET HELLO":REM COPY FILE FROM PARTITION 1

READY.
DOS"$

0 "DATA            " FAT32
1    "DATA FILE"        PRG
98 MB FREE.

READY.
DOS"$1:":REM SHOW DIRECTORY OF PARTITION 1

1 "SYSTEM          " FAT32
1    "HELLO"            PRG
1    "WORLD"            PRG
1    "HELLO WORLD"      PRG
0    "SECRET"           DIR
99 MB FREE.

READY.
DOS"S1:H*":REM DELETE ALL FILES THERE STARTING WITH H

READY.
DOS:REM THIS WILL SAY THAT "02" FILES WERE DELETED
01, FILES SCRATCHED,02,00

READY.
DOS"CP1":REM CHANGE BACK TO PARTITION 1

READY.
DOS"$

0 "SYSTEM          " FAT32
1    "WORLD"            PRG
0    "SECRET"           DIR
99 MB FREE.

READY.
DOS"S:*":REM DELETE ALL REMAINING FILES

READY.
DOS:REM THIS WILL SAY THAT "01" FILE WAS DELETED
01, FILES SCRATCHED,01,00

READY.
DOS"$":REM THE DIRECTORY IS STILL THERE

0 "SYSTEM          " FAT32
0    "SECRET"           DIR
99 MB FREE.

READY.
DOS"RD:SECRET":REM DELETE IT

READY.
DOS:REM "00" FILES DELETED, BECAUSE DIR WAS NOT EMPTY
01, FILES SCRATCHED,00,00

READY.
DOS"S//SECRET/:*":REM DELETE ALL FILES INSIDE

READY.
DOS:REM "01" FILE DELETED
01, FILES SCRATCHED,01,00

READY.
DOS"RD:SECRET":REM NOW TRY DELING THE DIR AGAIN

READY.
DOS:REM "01" FILES DELETED, IT WORKED THIS TIME
01, FILES SCRATCHED,01,00

READY.
DOS"$

0 "SYSTEM          " FAT32
99 MB FREE.

READY.
REM THAT'S IT. :)

READY.

Source
The implementation is part of the Commander X16 ROM and available here:
https://github.com/commanderx16/x16-rom/tree/master/dos
Future
The codebase is very versatile and could be reused for other kinds of projects:
Other New Retro Machines
CMDR-DOS could be easily ported to other Commodore-like 65c02+ systems like the MEGA65 and the C256 Foenix, providing a DOS interface to FAT32 on those platforms.
sd2iec-like Device
Functionality-wise, the CMDR-DOS codebase is also very similar to what sd2iec does – minus the Commodore Serial part. It could be ported a device like the 1581replica, with an SD card attached instead of a disk drive, and one would have a 65c02-based sdi2ec-like device.



FAT32 Filesystem for the 65c02
Michael Steil — Fri, 21 Aug 2020 07:55:25 +0000
We are presenting the (to our knowledge) first full-featured open source library for 65c02 CPUs for accessing FAT32 formatted disks.

The library supports filesystems from 32 MB to 2 TB, can read and write long filenames, subdirectories and time stamps, and can even create new filesystems.
It decodes a Master Boot Record (MBR) partitioning table and can have multiple partitions mounted at the same time.
It comes with a driver for the SD card protocol, so you can hook it to your own SD card solution; all you have to do is implement your own byte transmission code. If you want to use a VIA 65c22, you can hook up the 65c22 serial port code by the Steckschwein project.
Converting character encodings and matching names is done using callbacks – you can use the X16 implementation for a template.

The API looks like this:
    ; allocate context for filesystem #0
    ; (first MBR partition, mount if necessary)
    lda #0
    jsr fat32_alloc_context
    sta context

    ; open file
    lda #filename
    sta fat32_ptr + 1
    jsr fat32_open

loop:
    ; read and print byte
    jsr fat32_read_byte
    bcc end
    jsr print_character
    jmp loop

end:
    jsr fat32_close

    lda context
    jmp fat32_free_context

    filename:
        .byte '/path/to/file.txt', 0

The implementation uses the 65c02 extensions. With the help of 65c02.inc and some simple search-and-replace, it can be adapted for the 6502 though.
The library was written by Frank van den Hoef, with features (LFN, mkfs, …) added by Michael Steil.
It is the core of the DOS in the Commodore-like Commander X16 retro computer. The source is currently maintained as part of the X16 ROM:
https://github.com/commanderx16/x16-rom/tree/master/dos/fat32
Contributions welcome!



Building the Tynemouth Mini PET
Michael Steil — Sun, 05 Jul 2020 10:03:41 +0000

 Tynemouth Mini PET at FTW8b.com



Ultimate Commodore Charset / PETSCII / Keyboard Reference
Michael Steil — Tue, 09 Jun 2020 09:29:07 +0000
Another addition to the The Ultimate C64 Reference: We’re adding character sets, PETSCII codes and keyboard layouts – supporting eight different Commodore computers.

There are three different (related) modes: Character Sets, PETSCII and Keyboard. The controls on the left switch some global settings:


Character Set lets you select the Commodore 8×8 charset to be used in all modes. The drop-down list contains 84 ROM-extracted charsets.



Control Codes specifies which set of PETSCII control codes will be visualized in the PETSCII table.



Color Scheme matches the charset to the color scheme of a specific computer, or shows it in black-on-white.



Aspect Ratio controls the width-to-height ratio of the character set displayed, showing them either with square pixels, or matching one of the computers.



By clicking one of the radio boxes next to the computer names, the four settings above will be set to match a specific computer.



The checkboxes next to the computer names allow viewing the PETSCII control codes, keyboards and keyboard combinations of multiple computers in all other views.

Character Sets
The Character Sets tab shows all 128 characters of the currently selected charset as well as its inverted form, sorted by screen code. You can click on a character to view its screen code, PETSCII and Unicode values, as well as the keyboard combinations that produce this key on the different machines.

Below, there is a table showing all character sets at the same time. It can be filtered to only show upper case or lower case charsets.

In addition, there is a function that lets you compare two character sets. The middle line shows the XOR of the two charsets. This example shows that several lower case letters were optimized from the C64 to the TED:

PETSCII
The PETSCII tab visualizes the 256 PETSCII codes. Codes 0x00 to 0x1F (the first two rows) and codes $80 to $9F (rows 9 and 10) are control codes, the others are printable. Clicking on a code will reveal the PETSCII value, the screen code and the keyboard combinations. For control codes, it shows the functions on the different computers, and for printable characters it shows the Unicode equivalent.

The table below shows this information for all codes in one place. Here is a part of it comparing some keyboard combinations and control codes between three different computers:

Keyboard
The Keyboard tab shows the keyboard layouts of the different computers and lets you explore which PETSCII codes and characters are generated by which key combinations.

In the screenshot above, you can see the three different keyboards of the computers from the TED series: the C16, the C116 and the Plus/4.
Contributing
Like all web pages of the Ultimate C64 Reference, these view are generated from independent formatted ASCII files. The C64 keyboard file looks like this, for example:

It contains the ASCII-art of the layout, which is converted into SVG graphics by a Python script, the key caps, information about modifiers as well as the scancode-to-PETSCII tables.
The Ultimate C64 Reference is being developed as an open source project at github.com/mist64/c64ref – contributions in the form of additions, corrections etc. are welcome!



Ultimate C64 KERNAL API Reference
Michael Steil — Wed, 03 Jun 2020 21:35:41 +0000
The Ultimate C64 Reference is growing again: This time, we’re adding the KERNAL API reference – as always, in eleven different versions side-by-side.

These are the references that have been adapted for this:

Commodore 64 Programmer’s Reference Guide, ISBN 0-672-22056-3
COMPUTE!’s VIC-20 and Commodore 64 Tool Kit: Kernal by Dan Heeb, ISBN 0942386337
Machine Language Routines for the Commodore 64 and 128 by Todd D Heimarck and Patrick Parrish, ISBN 0874550858
Mapping the Commodore 64 by Sheldon Leemon, ISBN 0-942386-23-X
Commodore 128 intern by Jörg Schieb, Frank Thrun and Heinz Wrobel, ISBN 3-89011-098-3
The almost completely commented C64 ROM disassembly by Lee Davison
Cracking The Kernal by Peter Marcotty in COMPUTE! #40, September 1983, pp. 268-274
Kernal 64 / 128 by Craig Taylor in C= Hacking, Volume 1, Issue 3; July 15, 1992
Commodore 64 standard KERNAL functions by Joe Forster/STA
C64 KERNAL jump table by Frank Kontros
Das neue Commodore-64-intern-Buch by Baloui, Brückmann, Englisch, Felt, Gelfand, Gerits and Krsnik, ISBN 3890113079

You can enable and disable columns by clicking the checkboxes next to the sources, and you can expand/collapse all details with the corresponding button above the table.
Here are four different expanded explanations of the SCNKEY call ($FF9F):

As you can see, KERNAL API symbols as well as zeropage/variable symbols and addresses are cross-referenced and link to the respective description.
Like all web pages of the Ultimate C64 Reference, this table is generated from independent formatted ASCII files. In the case of the KERNAL API, these files look like this:

It consists of three columns: the address in hex, the symbol name and the description in MarkDown format.
The Ultimate C64 Reference is being developed as an open source project at github.com/mist64/c64ref – contributions in the form of additions, corrections etc. are welcome!



Ultimate C64 Memory Map
Michael Steil — Fri, 15 May 2020 19:26:41 +0000
The system software of the Commodore 64 has been extensively reverse-engineered. Next to disassemblies of the ROM, several “memory maps” have been published: tables that document system variables in the first kilobyte of RAM, and how to tweak the system software with PEEK and POKE. Now, I’m presenting the Ultimate C64 Memory Map: A C64 memory reference that shows eight sources side-by-side.

These are the references that have been adapted for this:

Reference from Mapping the Commodore 64 by Sheldon Leemon, ISBN 0-942386-23-X.
German-language reference from Memory Map mit Wandervorschlägen by Dr. H. Hauck, in 64’er Sonderheft 1986/07.
German-language reference from Das neue Commodore-64-intern-Buch by Data Becker, ISBN 3890113079.
Reference by Joe Forster/STA, with awsm’s changes applied.
Comments from the original M6502 BASIC source by Microsoft and the original C64 KERNAL source by Commodore
Reference from Commodore 64 Programmer’s Reference Guide.
Reference as found in Commodore 64 Memory Maps.txt by anonymous.
Reference by Jim Butterfield in COMPUTE! #29 (October 1982).

You can enable and disable columns by clicking the checkboxes next to the sources, and you can expand/collapse all details with the corresponding button above the table. Here are four different expanded explanations of the STATUS byte:

And here is the collapsed version of the range $2B-$48, comparing the comments in the original sources with the Programmer’s Reference Manual:

The symbols (second column) are taken from the original sources. Sometimes, a single memory location has several meanings and thus several symbols. Some descriptions have been adapted to describe the different meanings independently:

KERNAL and BASIC ROM addresses link to the respective spots in the disassembly:

And in the disassembly, zero page addresses (like $CC) and symbols (like BLNSW) link back to the memory map:

The memory map table is generated from independent formatted ASCII files that look like this:

It consists of three columns: the address range in hex, the symbol name and the description in MarkDown format.
The Ultimate C64 Reference is being developed as an open source project at github.com/mist64/c64ref – contributions in the form of additions, corrections etc. are welcome!



Typos in the C64 Programmer's Reference Guide: C3PO or C3P0?
Michael Steil — Sun, 10 May 2020 19:23:36 +0000
The Commodore 64 Programmer’s Reference Guide contains a memory map with a complete description of the zeropage and system variables used by KERNAL and BASIC, but now that we have the original source, we know there are three typos in this table.
C3P0
The first one is the symbol at address $94. Here is the full page from the Programmer’s Reference Guide:

If you look closely, the symbol at $94 is C3PO, with a capital Oh.

But it should end in a zero. Here is the original source:

This is a printout of the original assembly log from 1983:

On the kind of printer Commodore was using, there is no way to tell a 0 from an O. Here is the whole page for context:

The Commodore Serial Bus library uses C3P0 to buffer a byte that is supposed to be sent to the bus – similar to the zero page location R2D2 located at address $A3. Here is some code that uses the two:

So yes, the two symbol names are references to the Star Wars characters C-3PO and R2-D2. It is unknown why the former is spelled with a zero in the source, but the typo in the Programmer’s Reference Guide is very forgivable.
Disappointingly, the R2D2 symbol is not mentioned in the Programmer’s Reference Guide – $A3 is just part of a “Temp Data Area”:

BUFPT
The next typo is the symbol at address $A6, the tape buffer pointer:

The Reference Guide calls it BUFPNT, but it is in fact called BUFPT.

LSXP
And the final typo is the symbol at $C9:

The correct spelling is LSXP:

In fact, in the source, the two bytes have individual labels, LSXP and LSTP. LSXP is the cursor line, and LSTP the cursor column in the context of inputting text, shadowing the zero page addresses TBLX (line) and PNTR (column). The following table summarizes this:



        
 Generic 
 Input  




 Column 
 PNTR  
 LSTP 


 Line   
 TBLX  
 LSXP 



It is confusing that the symbols for the Y coordinates each contain an X. It is unclear what these symbol names are supposed to mean in the first place – if you have any idea, please share them in the comments!



Dumping MiniDisc Media
Michael Steil — Fri, 27 Mar 2020 06:41:56 +0000
Update 2022: Web MiniDisc Pro can access NetMD devices through the browser and can also download protected files!

If you have music on a collection of MiniDisc media and want to finally copy the data off onto modern media (or the cloud!), here are simple instructions for some different solutions:

MZ-RH1 and NetMDPython
MZ-RH1 and SonicStage
Analog Copy
Bonus: Recovering Deleted Tracks

MZ-RH1 and NetMDPython
The best device for digitally dumping MiniDiscs is Sony’s last (and best) MiniDisc recorder, the portable MZ-RH1. Unfortunately, this makes it quite pricy on the used market these days – but you can (and should!) always sell it after you’re done dumping your media…
Dumping
The NetMDPython set of scripts can copy the original ATRAC1-encoded bitstreams off MiniDiscs. You can run it on Windows, macOS or Linux. I will describe the Linux steps with Ubuntu – I would advise Windows and Mac users to set up a virtual machine with Ubuntu, since this is the most reliable way.


First, you need to make sure that you have Git, Python 2 (with crypto support) and libusb installed:
  sudo apt-get install git python2 python-crypto libusb-dev



Then, get the source from the linux-minidisc project:
  git clone https://github.com/glaubitz/linux-minidisc.git



The tools are in linux-minidisc/netmd
  cd linux-minidisc/netmd



Connect your MZ-RH1 to the (virtual) machine, but do not insert a MiniDisc yet. The Linux usb-storage driver would claim any inserted media, so that the tools wouldn’t be able to access it any more. Therefore, you need to remove the usb-storage driver:
  sudo modprobe -r usb-storage



Now make sure that the MZ-RH1 shows up:
  sudo ./lsusb.py

  [...]
  Bus 002 Device 006: ID 054c:0286 Sony Net MD/Hi-MD
  [...]



If your device is detected, you can list the contents of the MiniDisc like this:
  sudo ./lsmd.py

  Disk (writable media) We Are The Night
  Time used: 01:09:50+032 (87.14%)
  14 tracks
  000: 00:01:04+027 sp stereo unprotected No Path To Follow
  001: 00:06:33+039 sp stereo unprotected We Are The Night
  [...]



You can dump the whole media like this:
  sudo ./upload.py

  Storing in We Are The Night
  Uploading ./01 - No Path To Follow.aea
  Done: 10000/1a51a0 (3.80%)
  Done: 20000/1a51a0 (7.60%)
  Done: 30000/1a51a0 (11.40%)
  [...]



Converting ATRAC1 Files
The resulting files are in .aea format, which is the raw data from the MiniDisc, encoded in the ATRAC1 format.
You can convert them to any other audio format using ffmpeg:
    ffmpeg -i file.aea file.wav # decompression
    ffmpeg -i file.aea file.mp3 # lossy recompression
    ffmpeg -i file.aea file.m4a # lossy recompression

Limitations
NetMDPython can not dump tracks that are marked “protected”. This includes all tracks on pressed MiniDiscs, tracks that have been copied from a digital master, and tracks written with the PC software (SonicStage).
MZ-RH1 and SonicStage
Sony’s SonicStage software is an iTunes-like application for Windows that can, among many many other things, transfer most tracks from MiniDisc to the PC’s hard disk.
(The last version is 4.3; the screenshots are showing 3.4. I recommend Windows XP; newer versions might work as well.)

Navigate to the “Transfer” Tab. Your MZ-RH1 should show up in the pane on the right.



By default, SonicStage will recompress dumped data as ATRAC3+. To make sure you don’t lose any sound quality, press the toolbox icon, select “Advanced…” and “Import Settings”, and set the format for importing non-MDLP tracks to “PCM”:





Now you can drag and drop tracks from the right pane to the left one. Right clicking a track on the left and selecting “Properties…” will reveal the location of the file in the filesystem.

Converting OMA Files
The resulting files are in OpenMG Audio (.oma) format, which is Sony’s container format that can contain audio data encoded with one of a number of different codecs. Since we changed the import settings to PCM, they will basically be the same as umcompressed WAV/AIFF files (44.1/2/16), just in a different container.
ffmpeg can also convert .oma files into any other audio format:
    ffmpeg -i file.oma file.wav # lossless container conversion
    ffmpeg -i file.oma file.mp3 # lossy recompression
    ffmpeg -i file.oma file.m4a # lossy recompression

Limitations
SonicStage does not allow dumping the raw ATRAC1-encoded data from the MiniDisc, it always decodes it and stores it in a different format. When decoding to PCM, as shown above, there is no quality loss.
Unlike NetMDPython, SonicStage can dump protected tracks from pressed MiniDiscs, but it also cannot dump tracks that have been copied digitally from a CD, as well as tracks it has written itself – unless it was from the same computer.
Analog Copy
If all else fails, you can always make an analog copy of the audio on a MiniDisc. After all, the kinds of MiniDiscs you care about are probably not digital copies of CDs (you should rather find the original CDs then), but recordings from analog sources anyway – so the extra added noise should be negligible.
The most basic way to make an anlog copy is to connect any MiniDisc player to a computer and using https://www.audacityteam.org for recording. Since modern computers have many things going on at once, you might get skips while recording, so you need to make sure that there is as little load on the computer as possible.
If you have an MZ-RH1 and a second MiniDisc player, you could also connect the two and have the MZ-RH1 record the MiniDisc in the second player onto a 1 GB Hi-MD – uncompressed. You can then use SonicStage or the platform-independent QHiMDTransfer to copy the file(s) over.
The nicest solution is the dump_md.py script from the NetMDPython project, which can remote-control any NetMD-compliant MiniDisc player (such as the MZ-RH1) and record the analog output through the sound card. This way, the audio will be copied as individual tracks.
Bonus: Recovering Deleted Tracks
MiniDiscs allow arbitrarily deleting tracks, and recording new tracks in their place. This is made possible by the TOC: a global data structure that points to the sections on the media that make up the tracks.
When deleting a track, or the entire MiniDisc, the audio data is not touched, only the TOC is modified. By hacking the TOC, it is possible to access all raw data on the media.
Here are the steps: (This is based on the TOC Cloning trick.)

Locate your recorder’s door detection switch. On the SHARP MD-MT15, it is here:



Find a way to keep the switch constantly depressed. In the case of the MD-MT15, you can use part of a toothpick to keep it down.



Take a spare MiniDisc that is at least the same size (60/74/80) as the one you want to recover.
Erase the whole spare MiniDisc. (Don’t erase single tracks!)
Record silence until the whole disk is full.
Remove the spare MiniDisc and take take the batteries out of the recorder.
Insert the spare MiniDisc and put the batteries back in. The media should be detected correctly.
Now replace the spare MiniDisc with the one you want to recover. The recorder should not notice that the door has been opened and will not re-read the TOC.
Press play. The whole 60/74/80 minutes of raw audio data should now play.




The Ultimate Acorn Archimedes Talk [video]
Michael Steil — Mon, 30 Dec 2019 00:09:06 +0000
Matt Evans presented “The Ultimate Acorn Archimedes Talk”, the 8th talk in the “Ultimate Talk” series, at the 36th Chaos Communication Congress (36C3).

Here is the video:




Commander X16: Philosophy and Specification
Michael Steil — Fri, 18 Oct 2019 11:45:31 +0000
I recently got involved in the Commander X16 project. I would like to give an overview of the project and the vision behind it from my perspective.
Philosophy
The Commander X16 is a new 8 bit computer designed by David Murray of the well-known “The 8-Bit Guy” online video channel.
8 bit computers are great for learning about computer architecture, because they are simple enough that they can be fully understood, and they are great for learning to program, because they boot straight into a programming language. But 8 bit computers also have some annoyances like cheap, non-standard keyboards, TV-out and the reliance on floppy disks (or rather expensive SD card adapters).
Therefore, David is designing  a new 8 bit computer in the style of Commodore computers like the VIC-20 and the C64, fixing the annoyances of retro computers by having:

VGA output (in addition to composite)
support for standard PS/2 keyboards (instead of a cheap built-in keyboard with a non-standard layout)
SD card for storage (instead of an additional floppy drive)
RS232 port for efficient cross-developmment
efficient modern power supply

Another problem with retro computers nowadays is the high barrier of entry when developing (semi-)professional software: Since retro computers have been researched for almost 40 years, standards for software are very high. Any credible new game release for the C64 for example will have to come with a fastloader to overcome slow load speeds, and a sprite multiplexer to overcome the 8 sprite limit. The X16 ROM supports fast loading from SD card, and its video chip supports plenty of sprites as well as hardware scrolling of bitmaps. The memory map is kept simple and does not need (or support) complex reconfigurations.
With a rather modest set of hardware features, the X16 targets a lower price point than most comparable projects.
These are the original videos on the topic:

The 8-Bit Guy: My dream computer – Part 1
The 8-Bit Guy: My dream computer – Part 2

Hardware
These are the hardware specs:

65C02 CPU at 8 MHz
40 KB of main RAM
512 KB (or 2 MB) of banked RAM
128 KB of banked ROM
VERA video controller

16-bit class
128 KB of external video RAM
640×480 (or 320×240) pixels
256 colors out of 4096
2 layers supporting tiled and bitmap modes
128 sprites (limit per line based on memory bandwidth)


sound TBD
two 6522 VIA I/O controllers

The computer supports the following connections:

VGA (480p), or Composite/RGB (480i)
PS/2 keyboard
PS/2 mouse
two NES or SNES controllers
SD card
legacy IEC
RS-232

The X16 is not an FPGA-based solution, but uses real, socketed 65C02 and 6522 chips for the same hackability as a retro computer. The video chip is a new design and comes as an FPGA.
Software
From the software side, the Commander X16 feels like a Commodore computer. Its ROM contains the “KERNAL” operating system derived from the C64 version, as well as an enhanced version of Commodore/Microsoft BASIC based on V2.

Consequently, the X16 can be considered a sibling of the computers from the Commodore 8 bit family (PET, VIC-20, C64, CBM2, Plus/4, C128 and C65). It is not meant to be fully compatible with any of these machines, but it is as compatible as a Plus/4 is with a C64: BASIC programs without PEEK and POKE as well as machine code programs that only use the documented KERNAL API (e.g. BSOUT $FFD2) will just work, but existing code that accesses hardware would have to be ported.
The fact that the X16 breaks compatibility with the C64 is what I find particular interesting. Most retro projects try to recreate a classic computer. Users will start their favorite two games and then get bored. The X16 is a new system, with new tricks to discover – but familiar to people who know the C64.
Development
As of October 2019, only a handful of prototype machines exist. If you don’t want to wait for the release hardware, you can use the emulator:
https://github.com/commanderx16/x16-emulator/
There are binary releases for macOS, Windows and Linux on the GitHub releases page, which always include the latest build of the ROM.
The ROM is being developed as open source:
https://github.com/commanderx16/x16-rom/
The reference guide is worked on as an open source project as well:
https://github.com/commanderx16/x16-docs/
And here is a collection of demo/example code contributed to by many people:
https://github.com/commanderx16/x16-demo/
The official forum is unfortunately hosted on Facebook, but there is a lot going on: Users show off programs they have written and discuss programming questions. There is also a forum on David Murray’s website.
The official forum is at commanderx16.com, with people discussing programming questions and showing off new programs.



NES and SNES Controllers on a 6502 (like the C64)
Michael Steil — Sat, 10 Aug 2019 22:52:50 +0000
NES and SNES controllers support 8 to 12 buttons with only three data pins (plus VCC/GND). Let’s attach them to a C64 – or any 6502-based system!

NES Connector
The NES controller needs to be connected to +5V, GND and three GPIOs.
 ----------
| 5  6  7   \
| 4  3  2  1 |
 ------------




 Pin 
 Description 




 1   
 GND         


 2   
 CLK         


 3   
 LATCH       


 4   
 DATA        


 5   
 –           


 6   
 –           


 7   
 +5V         



SNES Connector
The SNES controller’s pins are just like the NES controller’s, but with a different connector.
 /---------------------
| 7  6  5 | 4  3  2  1 |
 \---------------------




 Pin 
 Description 




 1   
 +5V         


 2   
 CLK         


 3   
 LATCH       


 4   
 DATA        


 5   
 –           


 6   
 –           


 7   
 GND         



User Port
The C64 User Port exposes, among other lines, +5V, GND and 8 GPIOs (CIA#2 Port B):
 1 | 2  3  4  5  6  7  8  9  10 | 11 12
--- ---------------------------- -------
 A | B  C  D  E  F  H  J  K  L  | M  N

(viewed towards the C64 edge connector)



 Pin 
 Description 




 1   
 GND         


 2   
 +5V         


 C   
 PB0         


 D   
 PB1         


 E   
 PB2         


 F   
 PB3         


 H   
 PB4         


 J   
 PB5         


 K   
 PB6         


 L   
 PB7         



Connection
Let’s semi-arbitrarily map the signals like this:



 GPIO 
 Description                  




 PB3  
 LATCH (for both controllers) 


 PB4  
 DATA (controller 1)          


 PB5  
 CLK (for both controllers)   


 PB6  
 DATA (controller 2)          



The latch and clock outputs go to both controllers. There is a data line for each controller.
So the connection diagram for two NES controllers looks like this:



 Description 
 User Port Pin 
 NES #1 Pin 
 NES #2 Pin 
 Color  




 GND         
 1             
 1          
 1          
 black  


 +5V         
 2             
 7          
 7          
 red    


 LATCH       
 F             
 3          
 3          
 blue   


 DATA#1      
 H             
 4          
 –          
 green  


 CLK         
 J             
 2          
 2          
 white  


 DATA#2      
 K             
 –          
 4          
 yellow 



And this is the same diagram for two SNES controllers:



 Description 
 User Port Pin 
 NES #1 Pin 
 NES #2 Pin 
 Color  




 GND         
 1             
 7          
 7          
 black  


 +5V         
 2             
 1          
 1          
 red    


 LATCH       
 F             
 3          
 3          
 blue   


 DATA#1      
 H             
 4          
 –          
 green  


 CLK         
 J             
 2          
 2          
 white  


 DATA#2      
 K             
 –          
 4          
 yellow 



In fact, you can attach an NES and an SNES controller in parallel for each of the two slots, as long as you ever only connect one controller per slot.
This is the user port connector with wires attached for two controllers, using the color scheme above:

This is an NES connector attached as the first controller (green data line):

And this is an SNES connector attached as the second controller (yellow data line):

The Code
The code to read both controllers at a time is pretty simple:
; C64 CIA#2 PB
nes_data = $dd01
nes_ddr  = $dd03
;
bit_latch = $08 ; PB3 (user port pin F): LATCH (both controllers)
bit_data1 = $10 ; PB4 (user port pin H): DATA  (controller #1)
bit_clk   = $20 ; PB5 (user port pin J): CLK   (both controllers)
bit_data2 = $40 ; PB6 (user port pin K): DATA  (controller #2)

; zero page
controller1 = $e0 ; 3 bytes
controller2 = $f0 ; 3 bytes

query_controllers:
    lda #$ff-bit_data1-bit_data2
    sta nes_ddr
    lda #$00
    sta nes_data

    ; pulse latch
    lda #bit_latch
    sta nes_data
    lda #0
    sta nes_data

    ; read 3x 8 bits
    ldx #0
l2: ldy #8
l1: lda nes_data
    cmp #bit_data2
    rol controller2,x
    and #bit_data1
    cmp #bit_data1
    rol controller1,x
    lda #bit_clk
    sta nes_data
    lda #0
    sta nes_data
    dey
    bne l1
    inx
    cpx #3
    bne l2
    rts

After calling query_controllers, three bytes each at controller1 and controller2 will contain the state:
; byte 0:      | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
;         NES  | A | B |SEL|STA|UP |DN |LT |RT |
;         SNES | B | Y |SEL|STA|UP |DN |LT |RT |
;
; byte 1:      | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
;         NES  | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
;         SNES | A | X | L | R | 1 | 1 | 1 | 1 |
; byte 2:
;         $00 = controller present
;         $FF = controller not present

A 0 bit means the button is pressed, 1 means it is released.
The code pulses LATCH once, which makes the controllers sample their button states and transmit the first bit through their respective DATA lines. Pulsing CLK 15 more times will make the controllers send the remaining bits. Since SNES controllers send 16 bits of data, but NES controllers only send 8 bits, the type of controller can be detected through the lowermost nybble of byte 1. Similarly, the presense of a controller is detected by continuing to read bits: If a controller is attached, it will send 0 bits.
Repository
The driver code, together with demo code for the C64 can be found at https://github.com/mist64/nes_snes_controller_6502.

More?
For each additional controller, only a single extra GPIO is needed. This way, six controllers would be possible on a single 8 bit I/O port, with only slightly modified code.



The Apollo Guidance Computer [video + slides]
Michael Steil — Sun, 04 Aug 2019 21:28:19 +0000
I re-did an updated version of my part of the Ultimate Apollo Guidance Computer Talk at VCF West 2019.
It was followed by Frank O’Brien‘s talk on the role of the AGC in the Apollo missions.
Here is the video:

And here are my slides in PDF format:
AGC_CHM.pdf (43 MB)



Visualizing Commodore 1541 Disk Contents – Part 2: Errors
Michael Steil — Wed, 29 May 2019 07:17:55 +0000
We have previously visualized the physical layout of C64/1541 disks. In order to understand the encoding and potential read errors, it is more useful to visualize the disks sector-by-sector.

This animation shows 12 regular disks. (Click for original size.)

Every pack of 17-21 lines is a track, numbered 1-41.
Every line within a pack is one sector.
The raw sector contents are drawn from left to right.
The cyan part is the header, the green part the data.
Black is 0, cyan/green is 1.
White represents missing header or sector sections.

The tool to generate these images from .G64 files is available at https://github.com/mist64/visualize_1541.
Header
Let us look at the header first. Here is an animation of the header of tracks 23 to 25 of a few disks:



The header contains six data bytes encoded using the 4-to-5 GCR scheme:

H: Header Code: Every header starts with 0x08 so it can be distinguished from the sector data (0x07). You can see that it is the same bit pattern on all sectors on all tracks.
T: Track: This is the track number (1-35). It is the same for all sectors on the same track.
S: Sector: This is the sector number (0-16/17/18/20, depending on the track). You can see the same bit patterns on each of the three tracks shown.
ID: ID: The two-byte ID should be unique per disk and is used to detect disk changes. It is the same for all sectors on the same disk.
C: Checksum: The checksum is the XOR of C, S, T and ID, so you can see it behaves kind of randomly in the animation above.
GAP: The gap does not contain usable data, but separates the header from the sector data.

Data

The first byte of the data section is the code 0x07 to distinguish it from a header. The rest is the 256 data bytes. The Commodore DOS filesystem uses two link bytes (next track, next sector) at the beginning of every sector, which is why the next two bytes in this visualization look more regular between disks.
Good Disks
First, let’s look at some error-free disks in practice.
Empty Disk
An empty disk looks quite uniform: (Click for original size.)

Only sectors 0 and 1 of track 18 look different:

18/0 contains the block allocation map (BAM) and the name of the disk, and 18/1 contains the first 8 directory entries.
Illegal Bits
The last few bits of the end of the header are also different for sectors 0 and 1. The headers of all sectors directly after formatting the disk end in a pattern of alternating zeros and ones, but as soon as a sector has been written to, this is no longer the case for the last few bits. It gets even more interesting when visualizing these bits across several reads of the same disk:

These are unstable illegal bits on disk, bits that sometimes read back as zeros and sometimes as ones.
When writing a sector, the software in the disk drive waits for the correct sector header to pass by the read head, then waits until the exact end of the header gap area, and starts writing the new data. When switching to write mode, the magnetization written onto the media for the first ~15 µs (4-5 bits) will be analog values between logical 0 and 1. These are technically illegal values, and will be unstable when read.

These illegal bits are benign, because they do not encode anything. But it is important to note that two reads of a good disk may not be identical.
Errors
Now, let’s look at what common errors look like:
Logical Errors
Some read errors are caused by buggy software writing to disk: (Click for original size.)

This disk is missing some sector headers:

This is one of the faulty GEOS boot disks: Some sectors were written with the wrong speed zone setting, so they spilled into the next header, overwriting it. The sector contents are still there, but the missing header makes them unreadable with the original Commodore DOS. (The visualization tool will guess the sector number in case of a sector with a missing header.)
Dropouts
 
There is an example of a dropout:

There is one sector that starts reading back as all 0 bits (black) at some point, and the next sector data is completely missing (white), which is because the SYNC mark of the next sector was not readable.
This can be caused by demagnetization, or more likely, by dirt on the disk’s surface. Sometimes, the dirt will scrape off if we just read the disk often enough. The following animation shows the result of subsequent reads of the same disk:

Here, you can see how the faulty sector on the third track in the picture starts out all-black, with the next sector missing, over unstable data reads, to finally stable and correct reads. You can also see how the previous two sectors are impacted by the same speck of dirt, just not as much.
(The fact that it is different sector numbers that are impacted on the different tracks is due to the fact that on 1541 disks, there is no sector alignment between tracks, i.e. whether sector 0 on one track touches sector 0..20 on the next track is practically random.)
If the dropout persists across retries, it might still be dirt, just of the more resilient kind. Just clean the disk and try again.
Weak Bit Data Errors

This is a disk with multiple checksum errors in the data section across several tracks. Cleaning the disk did not help, so these are weak bits:

Weak bits do not result in flipped bits, but in missing or duplicate bits. As you can see in the animation, sections of the data move back and forth between retries. It is clearer when looking at a single sector. This excerpt contains four weak bits:



The read logic of Commodore disk drives measures the time between zero-to-one and one-to-zero edges and then decides how many zeros or ones were on disk. Therefore, weak bits lead to duplicate or missing bits.
A single weak bit can be recovered from by retrying: If it reads back correctly every now and then, it is as easy as retrying until the checksum is correct. But with multiple weak bits in a single sector, this gets exponentially less likely.
Conclusion
All that current tools can do with faulty disks is retry until the checksum is correct. By analyzing the weak bits, it should be possible to create tools for data recovery on a lower level. Multiple reads will reveal where the weak bits are, so a tool could try out different sequences around the weak bits and verify the checksum.



Commodore "Video Supergame 64" Bundle
Michael Steil — Tue, 28 May 2019 11:40:14 +0000
The “Video Supergame 64” is a Commodore 64 bundled with a joystick and three games on a cartridge, sold mainly in Germany in 1988/1989. Here are some pictures.





The top and bottom as well as the two sides of the box are identical. They all say:


VIDEO SUPERGAME 64

Inhalt:

Contents:

Commodore 64

Joystick

3 Super Games


There is a sticker on one side:


software

by

Epyx

CDS SOFTWARE

COMMODORE


There is absolutely no technical information on the box whatsoever.

Inside the cardboard, there is a styrofoam box with the Commodore logo, and another cardboard box.

The C64 and the power supply are inside the styrofoam.



These are pictures of the storofoam.

The C64 has a red power LED and a light keyboard. These properties varied for the “Supergame” bundle. Note that the packaging shows a brown keyboard.

The label on the bottom states that this is a C64G. Two of the screws are covered with warranty stickers.

The power supply says “FOR C-64 ONLY”. It outputs 5V DC at 1.7A and 9V AC at 1A. There is a sicker saying “7-88”.

The cardboard box contains the UHF video cable, the cartridge, the joystick and the manuals.

The C-1342 joystick was only sold as part of this bundle.

The “Super Games” cartridge contains Colossus Chess 2.0, Silicon Syborgs (a space-themed “Connect Four” style game), and International Football. Note that the packaging of the bundle says nothing about the types of games included. The art implies it was about space ships, race cars and soccer.

These are the cover pages of the C64 and “Supergames” manuals.

The German warranty sheet has a fixed end date of 1989-06-30 and guarantees repair or replacement within 10 days. It also advertizes some periperals:

Commodore Diskettenlaufwerk (Floppy) 1541 II (für 5 1/4“ Diskette)
Commodore Diskettenlaufwerk (Floppy) 1581 (für 3 1/2“ Disketten)
Commodore Datasette 1530
Commodore Monochrom-Monitor 1900 M
Commodore Farbmonitor 1802
Commodore Drucker MPS 1230
Commodore RAM-Expansion 1764
Commodore BTX-Decoder 2
Commodore Maus 1351




Dumping Commodore 64/1541 Disks with Errors
Michael Steil — Mon, 27 May 2019 17:53:53 +0000
Many old Commodore 64/1541 disks have read errors, but this doesn’t mean the data isn’t recoverable – with the right nibtools settings and some cleaning.
ZoomFloppy and nibtools
The ZoomFloppy adapter is the de-facto standard for connecting Commodore disk drives to modern computers, using a USB connection.
There are two ways to read a disk image:

d64copy (opencbm) will read the contents of a disk sector by sector and create a .d64 image file. This works with all Commodore drives through the serial cable.
nibread (nibtools) will read the raw bits of each track without any interpretation and create a .nib/.nbz file (which can be converted into a standard .g64 file). This requires a 1541 with a parallel port mod or a 1570/1571 with just a serial connection.

The main use for .g64 files is to preserve copy-protected games, which often use custom on-disk structures that would not be or sometimes cannot even be preserved in a .d64 image. In the context of reading disks that may have read errors, “nibbling” into .nbz/.g64 preserves all the information that could be read, which gives us better insight into what happened, and may allow us to recover more data.
This article covers reading the raw data using nibread, but most of the information translates to reading the decoded data using d64copy.
Error-free Dump
Let’s first look at the output of nibread for an error-free disk. The command nibread disk123 will create disk123.nbz and print this:
   1.0: (3) 7818 [CBM OK] (weakgcr:3)
   2.0: (3) 7819 [CBM OK] (weakgcr:6)
   3.0: (3) 7819 [CBM OK] (weakgcr:4)
   4.0: (3) 7819 [CBM OK] (weakgcr:9)
   5.0: (3) 7819 [CBM OK] (weakgcr:5)
   6.0: (3) 7819 [CBM OK] (weakgcr:4)
   7.0: (3) 7819 [CBM OK] (weakgcr:3)
   8.0: (3) 7819 [CBM OK] (weakgcr:6)
   9.0: (3) 7819 [CBM OK] (weakgcr:1)
  10.0: (3) 7819 [CBM OK] (weakgcr:3)
  11.0: (3) 7819 [CBM OK] (weakgcr:4)
  12.0: (3) 7819 [CBM OK] (weakgcr:5)
  13.0: (3) 7819 [CBM OK] (weakgcr:3)
  14.0: (3) 7819 [CBM OK] (weakgcr:9)
  15.0: (3) 7819 [CBM OK] (weakgcr:5)
  16.0: (3) 7819 [CBM OK] (weakgcr:3)
  17.0: (3) 7819 [CBM OK] (weakgcr:4)
  18.0: (2) 7148 [CBM OK] (weakgcr:2)
  19.0: (2) 7148 [CBM OK] (weakgcr:8)
  20.0: (2) 7148 [CBM OK] (weakgcr:4)
  21.0: (2) 7148 [CBM OK] (weakgcr:5)
  22.0: (2) 7148 [CBM OK] (weakgcr:2)
  23.0: (2) 7149 [CBM OK] (weakgcr:6)
  24.0: (2) 7148 [CBM OK] (weakgcr:1)
  25.0: (1) 6673 [CBM OK] (weakgcr:4)
  26.0: (1) 6673 [CBM OK] (weakgcr:3)
  27.0: (1) 6673 [CBM OK] (weakgcr:1)
  28.0: (1) 6673 [CBM OK] (weakgcr:6)
  29.0: (1) 6673 [CBM OK] (weakgcr:5)
  30.0: (1) 6673 [CBM OK] (weakgcr:4)
  31.0: (0) 6255 [CBM OK]
  32.0: (0) 6255 [CBM OK]
  33.0: (0) 6255 [CBM OK] (weakgcr:2)
  34.0: (0) 6255 [CBM OK] (weakgcr:4)
  35.0: (0) 6255 [CBM OK] (weakgcr:2)
  36.0: (1!=2 NOSYNC!) 0 [Unformatted Track]
  37.0: (1!=2 NOSYNC!) 0 [Unformatted Track]
  38.0: (1!=2 NOSYNC!) 0 [Unformatted Track]
  39.0: (1!=2 NOSYNC!) 0 [Unformatted Track]
  40.0: (1!=2 NOSYNC!) 0 [Unformatted Track]
  41.0: (1!=2 NOSYNC!) 0 [Unformatted Track]

Tracks 1 to 35 show “CBM OK’, which means that there are no errors. Tracks 36 to 41 are unformatted, which is the usual case for data disks. You can pass -E35 to nibread to save a few seconds on each disk if you are certain it doesn’t use the extra tracks.
“weakgcr” means that there is data on the track that does not consist of legal GCR-encoded bit combinations. This is normal for the gaps, especially the tail gap, i.e. the unused areas of a track.
You can then use nibconvert to create a standard .g64 image – or, if there are no errors and no copy protection, a .d64 image.
Error Codes
Errors are shown like this:
  10.0: (3) 7898 [E5S16]

E5S16 means there was an error 5 when decoding sector 16. Here is the full list of error codes:



 Controller Code 
 Description          
 DOS Code 




 2               
 Header not found     
 20       


 4               
 Data not found       
 22       


 5               
 Data checksum error  
 23       


 9               
 Header checkum error 
 27       



The nibtools error codes are the same as the 1541 “controller” error codes. The DOS codes are the “READ ERROR” codes returned by the 1541 status channel.

The most common error is number 5: About 90% of a track consists of the actual sector data, and any incorrectly read bit will cause an error 5.
Tracks with more serious read problems often show an error 2 or 4: This usually means that large chunks of the track were unreadable, so that the header/data markers (“SYNC”) could not be found.

For a description of the on-disk format of sectors and more information on common errors, have a look at this article.
Checksums
The on-disk format of 1541 disks uses 8-bit checksums to protect the integrity of headers and data sections. It is calculcated as an XOR of all data bytes.
nibread will flag an error if any of the checksums is incorrect, and retry reading up to the specified retry count (default 10).
An 8 bit checksum is not very strong: The probability of an undetected read error is 1:256. In practice, it is good enough that it should be trusted for most cases. But if a disk has dozens of read errors, it is not a solution to set the number of retries to 1000 and leave it running overnight!
Weak Sector
The most common case is an error that goes away after a few retries:
  10.0: (3) 7898 [E5S16]
        (3) 7898 [E5S16]
        (3) 7898 [CBM OK] (weakgcr:3)

In this example, the second retry was successful. The incorrect read was either caused by dirt that rubbed off easily, or by a weak bit.
Bad sector
But sometimes an error does not go away. Here is an error 2 that persists after 10 retries, the nibread default:
   1.0: (3) 7692 [E2S9]
        (3) 7691 [E2S9]
        (3) 7690 [E2S9]
        (3) 7691 [E2S9]
        (3) 7691 [E2S9]
        (3) 7692 [E2S9]
        (3) 7692 [E2S9]
        (3) 7691 [E2S9]
        (3) 7691 [E2S9]
        (3) 7692 [E2S9]
        (3) 7691 [E2S9] (weakgcr:34)

As always, the error could be caused by weak bits, or by dirt¹.
You could increase the number of retries, which works in some cases, but if the error does not go away after maybe 20 retries, it is best to first cleaned the disk.
Cleaning the Disk
The most common reason thirty year old disks have read errors is that some of their surface material became loose and is now a thin layer of dust that interferes with the read head.
An alcohol-based cleaning wipe is the easiest method to clean the disk. For this, you need to remove the top of the drive’s case. It’s good practice to keep it unscrewed on the device you are using for dumping disks anyway.
The read head is on the bottom, and there is a spring-loaded counter-weight on the top, which can be easily lifted, so that the cloth can be put between it and the disk’s surface:

Note that this always cleans the side of the disk that is opposite of what is expected based on the orientation of the disk on the drive. In other words, to clean side A, you have to put in the disk as if you wanted to read side B.
A 1571 has a read head on both sides, and the top one cannot be lifted very far, so you have to be careful not to break anything.
Then, instruct nibread to do one pass across all tracks, ignoring errors:
  nibread -e0 /tmp/x

You should do this several times, so that the cloth gets rubbed over every track multiple times.
These cleaning cloths dry out very quickly. Instead of using new cloths every time, you can use an alcohol-based disinfection spray to add moisture to them again.
Trying Again
Before trying again, you should make sure that the disk’s surface has dried. Otherwise, you will get something like this, possibly on a track that was fine before:
  12.0: (3) 7882 [E5S0]
        (3) 7882 [E5S5]
        (3) 7880 [E5S8]
        (3) 7881 [E5S0][E5S5][E5S8][E5S11]
        (3) 7880 [E5S2][E5S5][E5S8][E5S10][E5S12][E5S16]
        (3) 7882 [E5S1][E5S2][E2S7][E5S11]
        (3) 7882 [E5S1][E5S8][E5S9][E5S10][E5S12][E5S16]
        (3) 7882 [E5S2][E5S3][E5S5][E5S10][E5S11][E5S13][E5S15]
        (3) 7881 [E5S2][E5S10][E5S11][E5S13][E5S14][E2S18]
        (3) 7882 [E5S4][E5S5][E5S9][E5S10][E5S11][E5S12][E5S13][E5S14][E5S16]
        (3) 7881 [E5S1][E5S2][E5S4][E5S5][E5S8][E5S10][E5S11][E5S12][E5S13][E5S14][E5S16] (weakgcr:11)

On every read, there were different errors, which indicates that the errors aren’t caused by the contents of the disk. Let it dry, then try again. (It could also indicate a dirty read head – see below.)
If you are lucky, the track will read without any errors after cleaning:
   1.0: (3) 7691 [CBM OK] (weakgcr:37)

Or maybe it just reads a little better, because it was a combination of dirt and weak bits.
In any case, now is the time that you can increase the number of retries with the -e argument. Numbers up to 100 are reasonable; in practice, there is rarely a good read after 30 retries.
And here is an example of a track with multiple errors that, after cleaning, read correctly after 26 retries:
  28.0: (1) 6638 [E5S5][E2S6]
        (1) 6639 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6639 [E5S5][E2S6]
        (1) 6638 [E5S5][E9S6]
        (1) 6638 [E5S5][E9S6]
        (1) 6638 [E2S6]
        (1) 6638 [E5S5][E9S6]
        (1) 6638 [E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E2S6]
        (1) 6638 [E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6638 [E5S5][E2S6]
        (1) 6639 [E5S5][E2S6]
        (1) 6638 [CBM OK] (weakgcr:5)

Dirty Read Head
The dirt on a disk can stick to the read head, which will interfere with all reads. Here is an example of a disk dump with a dirty head:
   1.0: (3) 7801 [CBM OK] (weakgcr:21)
   2.0: (3) 7882 [CBM OK] (weakgcr:2)
   3.0: (3) 7882 [CBM OK] (weakgcr:5)
   4.0: (3) 7775 [CBM OK] (weakgcr:21)
   5.0: (3) 7694 [CBM OK] (weakgcr:29)
   6.0: (3) 7693 [CBM OK] (weakgcr:27)
   7.0: (3) 7693 [CBM OK] (weakgcr:27)
   8.0: (3) 7695 [E5S2][E9S13]
        (3) 7694 [CBM OK] (weakgcr:23)
   9.0: (3) 7696 [E5S3][E9S5][E9S7][E2S10][E5S18][E2S20]
        (3) 7696 [E9S7][E2S10]
        (3) 7696 [E9S6][E9S7][E5S8][E2S10][E2S15][E2S20]
        (3) 7695 [E5S0][E2S2][E9S4][E9S6][E9S7][E9S8][E9S9][E5S10][E9S20]
        (3) 7696 [E5S2][E5S4][E9S6][E2S7][E9S8][E9S9][E2S10][E2S11][E2S12][E9S18][E5S19][E9S20]
        (3) 7695 [E9S0][E9S1][E9S2][E9S3][E9S4][E9S5][E9S6][E9S7][E2S8][E9S9][E2S10][E2S11][E5S13][E5S14][E2S15][E2S16][E9S17][E2S18][E5S19][E9S20]
        (3) 7693 [E9S0][E9S2][E5S3][E2S4][E2S5][E5S6][E9S7][E9S8][E2S9][E2S10][E2S11][E5S12][E9S13][E2S15][E2S16][E2S17][E9S18][E5S19][E9S20]
        (3) 7694 [E5S1][E9S2][E2S3][E9S4][E2S5][E9S6][E2S7][E2S8][E2S9][E2S10][E9S11][E2S12][E9S14][E5S15][E5S16][E5S17][E5S18][E9S19][E2S20]
        (3) 7691 [E5S0][E2S1][E2S2][E9S3][E2S4][E2S5][E9S6][E2S7][E2S8][E2S9][E2S10][E2S11][E2S12][E5S13][E9S15][E2S16][E2S17][E9S18][E2S20]
        (3) 7690 [NDOS]  (weakgcr:29)
  [...]

Again, if the errors are radically different between retries, it cannot be an issue of the disk’s surface.
A clean read head looks like this:

And a dirty one like this:

You can use the same alcohol-based wipes to clean the read head.
Conclusion
In my experience, more than half of old disks with read errors can be read correctly again after cleaning them.
If the errors don’t go away, and it is a commercial disk, there is of course usually the possibility of buying another copy, and even if that one has errors as well, the data can be spliced together.
Otherwise, the tool g64conv can help: It converts .g64 files into a textual representation, which allows you to inspect what exactly went wrong, and may give you the opportunity to extract some information from sections with errors.




Errors can also be cause by incorrectly written on-disk data structures, e.g. because of a buggy driver, or an error when duplicating the disk. We will ignore these kinds of errors in this article.↩






Commodore “Magic Voice” Cartridge for C64
Michael Steil — Sun, 26 May 2019 10:02:50 +0000
“Magic Voice” is an expansion cartridge for the Commodore 64 that can speak 235 predefined words. Here are some pictures.

Magic Voice does not combine words from phonemes, but contains 235 prerecorded words in a 16 KB ROM. It was meant to be used with tape, disk or cartridge based software that made use of the extension, but few titles were released.
Without any extra software, the speech features are accessible through added BASIC commands.
The canceled Commodore V364 was supposed to ship with the same speech Toshiba T6721A chip and a similar driver ROM.

The case has a passthrough connector on top, which allows using the speech feature in cartridge-based applications. The label says “Commodore Magic Voice”.

There is no label in the designated area on the bottom. A small sticker says “MADE IN HONG HONG”. It also contains the names of the two connectors on the side: “OUT” and “IN”.

The Audio In (red) and Audio Out (black) RCA connectors are on the side.

The board contains

a MOS 6525A TPI I/O controller (also used in the IEEE-488 cartridge, the 1551 disk drive, as well as other devices)
a 16 KB EPROM: This contains the compressed speech data and the C64 code.
a gate array (address decoding, parallel-serial-conversion)
the Toshiba T6721A speech chip


Emulation
VICE supports the Magic Voice cartridge by giving it the ROM from zimmers.net:
  x64 -magicvoiceimage 251476.bin

Then you can make it speak all words with
  FOR I = 0 TO 255: SAY I: PRINT I: NEXT

References

http://www.stefan-uhlmann.de/cbm/MVM/index.html
http://www.stefan-uhlmann.de/cbm/MVM/Repair/index.html
https://binarium.de/commodore_magic_voice_speech_module_c64
http://www.floodgap.com/retrobits/ckb/secret/operiph.html
http://www.6502.org/users/sjgray/computer/magicvoice/index.html
https://www.c64-wiki.de/wiki/Magic_Voice
http://www.zimmers.net/anonftp/pub/cbm/schematics/cartridges/c64/magic-voice/index.html
http://cbm-hackers.2304266.n4.nabble.com/Magic-Voice-Schematics-td4067465.html




Falk Rehwagen, TopDesk and GEOS
Michael Steil — Sat, 25 May 2019 13:22:09 +0000
Here are some of Falk Rehwagen’s thoughts on his involvement with GEOS and TopDesk.

It’s not so easy to remember the details after more than 25 years 

GEOS

Back then, I was fascinated by what one could get out of this outdated hardware, many years after its release. As for GEOS, I found it exciting that it was possible to develop a real commercial operating system on such a small machine, with a variety of powerful applications, tools, drivers, and so on. Naturally, I bought GEOS as soon as it was possible, paid with my family’s welcome money in Berlin, 1990. 

Programming & GEOS User Club

That’s when I got very involved with GEOS (64) and its possibilities on my C128, and it quickly became clear that I wanted to develop applications myself. I a found good and fast entry with the MegaAssembler, which paved the way deep into GEOS development. As part of this learning, I created a diverse set of applications, which, compiled as a “best of” disk, we also offered for sale. In this context, in 1991/92, I got into contact with the Geos User Club, the association of all GEOS users and fans in Germany. The “GUC-Regio-Sachsen” was founded, and in 1992, I participated in the GUC Annual General Meeting for the first time. If I remember correctly, this was also the time when TopDesk was almost ready as a the modern replacement for the “deskTop” built into GEOS. It was absolutely fascinating, because the windowing technology once again showed what can be done with GEOS and the hardware and brought the system closer to the more modern competitors. TopDesk was delayed, but eventually I held it in my hands and used it as my exclusively GEOS control center.

Patch System and GeoCOM

I was keen to make GEOS better and more flexible. From this thought arose the “Patch System”, which allowed defining and distributing small improvements and extensions in a consisteny way. In order to give more users and developers the opportunity to develop new applications for GEOS, a powerful programming environment called GeoCom was created, based on ECOM from the 64’er Magazine. With an extensive manual, created in co-operation with Denis Döhler, the system was offered commercially as a alternative to programming in assemnly.

TopDesk Maintainer

Around the same time, the market around PCs and operating systems developed rapidly, many GEOS fans following the technological development towards PC/GEOS or other available alternatives. I think sometime in 1993/94, the GUC had completely switched its focus on PC/GEOS and was looking for maintainers for its GEOS-64-related technologies and products. With my spectrum of projects and experience, I was probably a good candidate for this: After all, I remained faithful to the C64/128 and GEOS 64 as a developer. I don’t remember when and where exactly, but I was given TopDesk maintainership – and the sources.  It was convenient that TopDesk had been written by the same people as MegaAssembler, so I was already well-equipped for the development of TopDesk.

Improving GEOS and TopDesk

After “Patch System” and GeoCom, I wanted to develop GEOS in a more profound way, to combine it with the extended TopDesk, and to make the whole system more open to the many hardware enhancements that were available on the market. For this purpose, I completely reverse-engineered the GEOS KERNAL (with all-GEOS tools like GEODISASSEMBLER), so it could be assembled again with MegaAssembler. The idea for GEOS 3.0 was born. Development became much faster after I had gotten a loaner Flash 8 acceleration cartridge (8 MHz) – for which I adapted GEOS. TopDesk got color support, and the GEOS KERNAL was further developed to have a flexible driver model. An early version of the project was presented in Berlin at the GUC Annual General Meeting 1994.

After GEOS 64

I think I had already decided to take the step towards the PC at that time, to intensively work my way into the PC/GEOS SDK, which had just been published. (However, various sources seem to state that PC/GEOS wasn’t released until April 1995…) At least that’s when I decided to turn my back on GEOS 64 (initially) and to take on new challenges as part of my university studies. For this reason, the current state of development of TopDesk and GEOS 3.0 was cleanly returned to the GUC – to Wolfgang Grimm, I think. This soon resulted in version 3.0 of TopDesk, which was developed further as part of the MegaPatch 3.




TopDesk 1.3 GEOS64/128 Original Source
Michael Steil — Fri, 24 May 2019 10:26:18 +0000
Thanks to Falk Rehwagen and Jürgen Heinisch, the original source of the TopDesk file manager for the GEOS operating system of C64/C128 is available.
https://github.com/mist64/TopDesk

The source has been converted from GeoWrite format to plain text.
These are the original source disks:

td64_13_en.d71
td64_13_de.d71
td128_13_de.d71
td128_13_en.d71

Most of the code of the different variants is identical.
These disks use TopDesk-style subdirectories. Here is a listing of the disk contents:
  NAME                 TYPE   SIZE
  --------------------------------
  Main/               
    DeskWindows.akt    WRI    40K
    DeskTop.main       WRI    52K
    DeskTop.sub        WRI    4.7K
    DeskTop.sub2       WRI    4.2K
    DeskTop.sub3       WRI    8.6K
    DeskTop.sub4       WRI    6.2K
    DeskTop.sub5       WRI    7.3K
    DeskTop.sub6       WRI    10K
    DeskTop.sub7       WRI    12K
    DeskTop.sub8       WRI    1.8K
    DeskTop.sub9       WRI    7.8K
    DeskTop.sub10      WRI    7.8K
    Ende.s             WRI    808B
    Dos.lnk            WRI    1.0K
  Include/            
    DeskMain2          WRI    20K
    SubDir.src         WRI    21K
    University 6       FNT    1.0K
    Symbol/           
      TopSym           WRI    8.7K
      TopMac           WRI    3.7K
      Sym128.erg       WRI    1.1K
      CiMac            WRI    1.2K
      CiSym            WRI    1.1K
  DeskInclude/        
    Validate+Undelet   WRI    9.0K
    SizeRectangle      WRI    2.3K
    CopyFile           WRI    11K
    DiskCopy           WRI    11K
    SearchDisk         WRI    4.0K
    EditText           WRI    8.7K
    DosFormat.s        WRI    3.0K
  WinInclude/         
    InvFrame           WRI    2.7K
    SpeedFrame         WRI    2.2K
    InvFrame.old       WRI    2.6K
  GetDrivers.dir/     
    GetDrivers.s       WRI    2.8K
    GetDrivers         APP    1.3K
    SaveDriver.s       WRI    1.6K
  Protect/            
    Desksub0.s         WRI    1.2K
    SetBAM             APP    962B
    ProtectDisk        APP    1.3K
    Sources/          
      SetBAM.s         WRI    2.8K
      ProtectDisk.s    WRI    2.8K
      SetProtection.s  WRI    2.4K
      RemProtection.s  WRI    2.4K
      RemProt.mod.s    WRI    2.6K
      SetProt.mod      APP    762B
      RemProt.mod      APP    762B
    TopMac(a)          WRI    4.9K
    SetProt.mod.s      WRI    2.4K
    Desksub0           APP    9.6K

There is a follow-up article with with some history by Falk Rehwagen on the “Patch System”, TopDesk, and GEOS 3.0.

`dispBufferOn`	r5	r6
`%11000000` (default)	fg screen ptr	bg screen ptr
`%10000000`	fg screen ptr	fg screen ptr
`%01000000`	bg screen ptr	bg screen ptr

GPIO	Description
PB3	LATCH (for both controllers)
PB4	DATA (controller 1)
PB5	CLK (for both controllers)
PB6	DATA (controller 2)

Description	User Port Pin	NES #1 Pin	NES #2 Pin	Color
GND	1	1	1	black
+5V	2	7	7	red
LATCH	F	3	3	blue
DATA#1	H	4	–	green
CLK	J	2	2	white
DATA#2	K	–	4	yellow

Controller Code	Description	DOS Code
2	Header not found	20
4	Data not found	22
5	Data checksum error	23
9	Header checkum error	27

pagetable.com

How the Final Cartridge III Freezer works

Freezer cartridge theory

Doing the freeze correctly

Initializing the freezer

Displaying the freezer menu

Accessing C64 memory

Backups

Game trainer

Screenshots

Final words

64 Tips & Tricks [PDF]

Klappentext

Inhaltsverzeichnis

6502 Illegal Opcodes in the Siemens PC 100 Assembly Manual (1980)

The PC 100 Assembly Manual

“Special Instructions”

Analysis

Credits

Siemens Personal Computer PC 100 Bedienungsanleitung, Ausgabe 1981/1982

Brotkastenfreunde Interview

Das Titelbild vom 64’er Sonderheft 2/85

64'er Magazin – mit 40 Jahren Verzögerung jetzt monatlich im Web

Silo S01E06: 38911 BYTES FREE

[Ankündigung] Vortrag “Apollo Guidance Computer” an der Embedded Computing Conference in Winterthur

The Easter Egg in the “Schrott-Tornado” at the Deutsches Museum

darmok.com: Memes in the Tamarian Language

PostScript Cartridge for HP LaserJet

Article Series

Cartridge

Board

ROM

PostScript Files

FONTPAGE

TEST PAGE

STARTUP PAGE

Manuals and Extras

Future Work

Scanntronik Manuals

Pagefox

Printfox

Videofox

Eddison & Eddifox

Cheese

Colourprinter

Handyscanner 64

Catalogs

The Commodore AUTOMODEM (Model 1650)

Historical Context

Photos

Box

Manual

More Box Contents & Tape

A 1960s Children's Book about Computers

Historical Context of the Books

Differences in the German Version

Who invented computers?

Does an electronic brain ever fail?

Complete Comparison

Digitizing Analog Video through a Digital Camcorder

The Problem with Interlaced Video

Digital Video and DV

Digital SD Camcorders

Setup

Installing Tools

Digitizing

Compressing

On-the-fly Compression

Deinterlacing

Limitations

Simpler Solutions

Links

Dissecting a Dummy Promo MiniDisc

References

The Commodore VICMODEM (Model 1600)

Historical Context

Photos

Box

Manual

More Box Contents