Skip to main content

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

1
  • What CPU hardware did you test on? What compiler version? Does your code call a lot of SGX functions? If so, have you tried simpler microbenchmark loops that run the exact same instructions in both builds, which you can check with a disassembler? Like perhaps store-forwarding latency is higher in SGX than outside (which would hurt most -O0 code a lot), or perhaps L1d cache hit load-use latency is higher if it has to do extra checks? Commented Jan 19, 2024 at 15:12