Well, I just jumped straight to claim 1, and it’s clearly invalid:
1. An apparatus, comprising:
an instruction memory circuit configured to store a plurality of instructions;
a plurality of execution circuits; and
a program sequencer circuit configured to:
fetch the plurality of instructions from the instruction memory circuit in a program order; and
issue a particular instruction to a particular execution circuit in response to a determination that there are no hazards associated with the particular instruction; and
wherein the particular execution circuit is configured to complete the particular instruction prior to a completion of a different instruction of the plurality of instructions that was issued to a different execution circuit of the plurality of execution circuits prior to when the particular instruction was issued to the particular execution circuit.
All this is saying is you issue out of order, and instruction B finishes before instruction A even though A was issued first. That’s how pretty much any OoO hardware would work (if instructions take different numbers of cycles, or there is an exception, etc.). Doesn’t say anything about retiring - just “completing” - which makes it even more silly.
The patent specification may be more interesting - often times these initial sets of claims are crazy broad like this and then through the course of prosecution, the claims become focused on what is actually supposedly novel. But I don’t have time to read the whole thing right now.
The main text specifically highlights how their approach is executing instructions out of order without a reorder buffer, I assume this is the primary innovation. They do mention that “completion” is the same as “retirement”. From what I understand, it’s a more energy-friendly way to do OOO with limited scalability - you pre-decode a bunch of instructions, check for hazards, check for data sharing and issue whatever you can. They already have some of the machinery in place for their int/fp co-issue.