I'm a total noob @ assembly, but if you know how it works you could simply inline some asm and optimize it yourself?
Also maybe it'll optimize it properly if you put it between parentheses?
Also maybe it'll optimize it properly if you put it between parentheses?