So there are multiple sites&groups that pirate video games especially on PC. I was wondering if there are places on the internet where you find source code for games especially the highly modifiable ones like Half Life 2/Portal and Skyrim. Or groups that crack into the source code of games (or even software in general), not only for PC maybe PS, XBox or mobile too, and share it. I just wanted to see some code samples of games or their engines, maybe I get hooked into video game design. Shout out to Valve for sharing a lot about the creation of Half Life 1

      • Draconic NEO@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        Only if they wrote why in it though, plenty of people (unfortunately myself included) fail or forget to add meaningful comments or they let their comments go stale when making changes by forgetting to update them (I do it a lot too), and some people also use horrible function names that don’t make any sense.

        So it only really applies to source code intended to be released where care was made to ensure it would be readable, it might not apply for source code never intended to be public, such as stolen, leaked, or posthumously released. In this case the only real benefit is that it can be recompiled on different architectures provided there isn’t a dependency issue preventing that.

        • lud@lemm.ee
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          Source code without comments is still way easier to read than machine code.

          And there are very likely comments anyways.

          Multiple Devs will work on the same code for years and of course they need or at least appreciate comments.

    • Draconic NEO@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      But even if you can intuitively understand what the machine code is you’ll still need to convert it by hand back into something more portable, or to the machine code of another platform you might want to run it on. There’s not really an easy automated way to do that, even when playing dirty.

      That’s the hard part, getting it into a different form, such as from x86 to ARM or from 6502 RISC to x86.

  • X3I@lemmy.x3i.tech
    link
    fedilink
    English
    arrow-up
    5
    ·
    edit-2
    1 year ago

    Well, source code is not sth that you “crack”, you can only reverse engineer it (I think it was done with Doom, also OpenRA) or steal it from the company’s servers. The use for it is also rather niche, so the risk vs gains ratio is not attractive enough to feed dedicated websites. You can also look at fully open source games like 0AD and check out what they did!

    • CaptainBasculin@lemmy.ml
      link
      fedilink
      English
      arrow-up
      4
      ·
      1 year ago

      Doom was open sourced later. An example of a game that got reverse engineered just fine is Super Mario 64

      • datavoid@lemmy.ml
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        Or the time the people who made doom reverse engineered super mario 3 for PC, then turned it into commander keen after Nintendo was unimpressed.

      • Excrubulent@slrpnk.net
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        1 year ago

        I was once at a talk by someone in that company and he straight up said that open sourcing it was a mistake. I assume because that meant they couldn’t sell us a thousand versions of it like Skyrim.

        No word of whether its ongoing popularity was at all caused by open sourcing it.

  • BotCheese@beehaw.org
    link
    fedilink
    English
    arrow-up
    0
    ·
    1 year ago

    From what I understand: During the compilation of the code it becomes unreadable to humans, needing to be reverse engineered, which is entails insane amounts of work. So, unless there is a leak or the game isn’t fully compiled like (I think) Unity games, it will be unlikely to find source code.

    • NuXCOM_90Percent@lemmy.zip
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 year ago

      Yes and no.

      VERY oversimplifying but:

      The basic flow of compilation is (human readable-ish) source code becomes symbols which the compiler then applies optimizations to before converting into machine code.

      With no optimizations, it is a pretty trivial process to revert back. You lose the names of variables and functions (unless they were added as annotations by the compiler, which they often are for many reasons) but you get the logic.

      With optimizations, things get messier. An example that is pretty popular and useful to understand this is https://en.wikipedia.org/wiki/Fast_inverse_square_root The coder likely wrote invsqrt(foo), expecting this to map to a series of hardware functions in the floating point unit. Instead, they see that it became a series of integer operations and bit shifts and get very very confused. Its like that, except imagine an entire 5000 line file becomes two instructions.

      That said, compilers are stupid and most people (rightfully) use libraries to a large portion. So the above example would actually not happen because they would just see the instructions to do a function call labeled “fast_invsqrt” and not care. Similarly, basic heuristics (or, if you are trying to get funding, machine learning and neural nets) can be applied to figure out what was done in heavily optimized blocks of code. So the resulting “source code” might be something like

      int foo(int bar)
      {
        int a = 4;
        // Loop to do some fancy ass math shenanigans to a
        return a + 5;
      }
      

      Which results in more or less human readable code again. Also, a lot of compilers will actually embed basically comments in the assembly/binary unless someone remembers to actively disable that. And nobody ever does.

      Because source code really doesn’t matter all that much. If anyone ever did a deep dive of the Source Engine and Unreal Engine (of the day. I think that was 3?)… they were actually a lot more similar than not. Yeah, there was a lot of infrastructure and even some fundamental differences (been a minute, but I want to say Source is still additive because of its quake origins whereas UE3 was still subtractive geometry?). But… function influences form and they have most of the same functionality.

      The issue is more, like with DRM, those “week one sales” as it were. Data is sparse for a lot of reasons, but Securom for Mass Effect 1 PC is generally treated as a massive win because warez groups could not properly crack it for a week or so which led to a lot of pc players just buying the game because we needed hot blue lady action NOW. Same with the reason so many games still use denuvo.

      And same with engine and game features. Unity and Unreal basically were in an arms race for years (and Cryengine basically sucked on the barrel of their own gun) where one would add one feature and the other would add the same one a few months later. Having access to the source code potentially means someone can accelerate that a lot…

      Except they won’t. Because this shit is cancer. Any dev who does anything even remotely similar to how Wolverine jacks off to his friend’s wife in a game is opening themselves up to a LOT of lawsuits and investigations. Its why almost every emulator group will go scorched earth on anyone who even acknowledges looking at the nintendo leaks. And it is why someone quitting their job at Microsoft to go work at Google and offer to provide a Teams feature gets blacklisted and reported immediately. Companies don’t fuck around with that.

      Which leads to the real reason these are so detrimental to gaming and software. People… put questionable comments in their code all the time. Sometimes it is ///TODO: Implement this in Q2 2024 for release 1.5. Sometimes it is //Hey, Fuck you Fred. I finally did this. Stop your fucking bitching and stop whining to John that you couldn't do anything until I did this for you. And sometimes you get bullshit like Bungie (?) using racial slurs as codenames for a lot of the skins and the like…

      And it also makes it a lot easier to make a cracked binary if you know what the code was before the DRM was applied.

      So decompiling code is actually a lot more viable than you would expect. But also entirely pointless.

    • X3I@lemmy.x3i.tech
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Correct. However, this looks like a direction that language processing and GPTs would excel in and to my knowledge, there already have been some ML addons made for certain RE tools, so this might become more easy in the future. If that is necessarily a good thing is a different story…