GitHut – Programming Languages and GitHub (2014)

(githut.info)

65 points | by tonyhb 10 hours ago ago

23 comments

  • steveklabnik 9 hours ago ago

    As noted, should be (2014).

    There is also GitHut 2.0: https://madnight.github.io/githut/#/pull_requests/2024/1

    This updates through 2024.

    • nightpool 9 hours ago ago

      Interesting to see the number of JS pushes go down significantly, but actually realize that it's just because many more projects are using TypeScript:

      https://i.imgur.com/AJBE9so.png

      • fuzzythinker an hour ago ago

        No usability consideration at all. Yellow on grey (top curve's) is unreadable.

      • threatofrain 8 hours ago ago

        The library space converged to TS far faster than the rest of the JS world. Also interesting to see the sharp rise of Go.

      • oceansky 6 hours ago ago

        If you sum both, it's 17.204%, which would place it at the top.

  • kodablah 10 hours ago ago

    I think correlating "pushes per repository" to certain languages is interesting. The top "pushes per repository" are C++, TeX, Rust, C, and CSS. I guess it's no surprise many would also consider those the most guess-and-check or hard-to-get-right-upfront-without-tooling languages too.

    • Etheryte 9 hours ago ago

      It's unclear if that's the takeaway here. Pushes per repository can just as well indicate a project that's just old, or active, or popular, or etc.

    • IshKebab 9 hours ago ago

      Really? I don't think Rust is like that because it has such strong compile time checking. More likely because Rust 1.0 hadn't even been released in 2014 so by definition every Rust project was extremely new and active.

      • kodablah 9 hours ago ago

        Yes, maybe the causation assumption here is inaccurate.

  • akerl_ 10 hours ago ago

    The connectors are interesting, but I wish there was a way to sort by a column and have the rows be actually linear.

    Also, worth noting that it looks like this data only covers 2012-2014?

  • jtwaleson 10 hours ago ago

    Would love to see an update to 2025

    • kleiba 10 minutes ago ago

      +1. This has historical value but 11 years are eons in IT.

    • tonyhb 9 hours ago ago

      I really, really want this updated too and saw it in my bookmarks. Figured the historic data was interesting, and that someone might want to give this another go.

  • into_ruin 8 hours ago ago

    This may be a stupid question, but if most iOS apps are written in Swift, why isn't Swift more popular? Is it just because most Swift projects aren't FOSS?

  • ivanjermakov 10 hours ago ago

    Would be fun to weight each language by average number of stars, but normalize by repository count.

    Data analysys without adjusting groups by popularity is a bit lame.

    • clircle 5 hours ago ago

      What statistic are you proposing? Number of repos / avg stars ?

  • jonny_eh 7 hours ago ago

    Wow, 1995 was a stacked year for languages: JavaScript, Java, Ruby, PHP

  • irfn 4 hours ago ago

    1995 was a busy year in new programming languages!

  • miguel_martin 10 hours ago ago

    Why are Nim, Odin, Zig, Mojo not included (and probably many others)?

    • some_guy_nobel 10 hours ago ago

      Probably because this was made in 2014 :D

  • ethmarks 9 hours ago ago

    Absolutely stunning and ingenious visualization, but disappointing data. In 2014 there were 2.2 million repos, while in 2025 there are closer to 500 million. The repo was last updated seven years ago, so I assume that this project has been abandoned.

    A cursory glance at the source code[1] reveals that it's using GitHub Archive data. Looking through the gharchive data[2], it seems like it was last updated in 2024. So there's 10 years of publicly accessible new data.

    Is there any reason we (by "we" I mean "random members of the community" as opposed to the developer of the project) can't re-build GitHut with the new data, seeing as it's open source? It's only processing the repo metadata, meaning it shouldn't even be that much data and should be well under the free 1TB limit in BigQuery (The processed data from 2014 stored in the repo[3] is only 71MB in size, though I assume the 2024 data will be larger), so cost shouldn't be a concern.

    I'm not experienced enough to know whether creating an updated version of this would take an afternoon or several weeks.

    [1]: https://github.com/littleark/githut/

    [2]: https://console.cloud.google.com/bigquery?project=githubarch...

    [3]: https://github.com/littleark/githut/blob/master/server/data/...

    • nightpool 9 hours ago ago

      Apparently someone worked on it, but (IMO) the visualization is a lot less nice compared to the original: https://madnight.github.io/githut/#/pull_requests/2024/1

    • flymasterv 6 hours ago ago

      GHArchive is updated constantly, but the tables reflect COMPLETED time periods. So there’s no yearly/2025, yet. You have to look at the monthlies.

      Source: just left GOOG after 5 years on the GitHub tooling team.