Hoarder: Self-hostable bookmark-everything app

(github.com)

385 points | by thunderbong 3 days ago ago

79 comments

  • elashri a day ago ago

    I tried hoarder and I didn't like the way listed view works. I prefer the simplicity of the view provided by Linkding. I find hoarder new auto tagging with ollama something I want to use because I am lazy.

    For references there are many options in selfhosted bookmarking apps market. These beside Hoarder are the most known software.

    Linkwarden (https://github.com/linkwarden/linkwarden)

    Shaarli (https://github.com/shaarli/Shaarli)

    LinkAce (https://www.linkace.org/)

    Linkding (https://github.com/sissbruecker/linkding)

    Wallabag (https://wallabag.org/)

    Shiori (https://github.com/go-shiori/shiori)

    • raybb a day ago ago

      When I use a tool like this one of the most important things is that it works offline so I can read something in a plane or on the go.

      I've looked into most of these (and instapaper, pocket, etc) and ultimately found Wallabag to be the best. However, their app is quite buggy and site is fairly clunky for my taste. Luckily there's a pretty recent 3rd party client that works offline super well and is on Mac/Linux/android/iOS for free (yay flutter) https://github.com/casimir/frigoligo

      Also, I'll note that it's basically a must to use the browser extension with the option to download via what the browser sees if you get content from a lot of sites. That being said the devs are super responsive to reports that sites aren't being scraped appropriately.

      My biggest wish is that they supported YouTube (at least titles) and they had a way to indicate when a article needs to be scraped client side.

      • raybb a day ago ago

        Now that I think about it, maybe I can write a little side script to fetch the YouTube titles and update it.

      • Openai2 18 hours ago ago

        Why is Wallabag better than pocket?

        • raybb 31 minutes ago ago

          Imo it's only better if you expect stuff to be saved correctly for reading offline. Pocket often fails even with major news sites and just opens the link to the browser. That's a big pain for me.

          Also, I don't love the ads and recommended content in Pocket.

      • hnmias a day ago ago

        thanks for the app recomendation, didnt know about it yet..

        regarding youtube, my youtube links saved with wallabager browser extension always show the correct title, are you using something else to save them?

        • raybb 32 minutes ago ago

          Oh good point it does work with the browser extension. I was experiencing this when trying to add them from my phone.

    • user_7832 10 hours ago ago

      Kind of a shot in the dark, but would you (or anyone else here) remember the name/website of a similar subscription app that was modern/minimal in appearance, and iirc was developed by Chinese devs? I remember seeing it I think on HN itself, likely as a comment somewhere. (I think folks were concerned about the data storage/privacy aspect, but iirc the service was very well designed/comprehensive).

      • spondyl 8 hours ago ago

        You might be thinking of https://raindrop.io which is developed by a Kazakh developer?

        • user_7832 8 hours ago ago

          Thanks a lot but I don’t think it’s that, I’ve been aware of raindrop for quite some time (even before I knew of this one I think) and this app only had a paid tier, no free/selfhosted options (which is why I didn’t bother making an account.)

          (Just thinking out loud.. if I were to find all HN threads discussing such apps (maybe searching for raindrop or pocket?) via the search I might find it more quickly.)

          • codetrotter 5 hours ago ago

            Could it be Anybox? Never used it myself but seems like a possible candidate.

            https://anybox.app/

            It does not have self-hosted, although it does have a free tier. The free tier is pretty limited, allowing just 50 bookmarks. So basically amounts to a free trial more than something you’d use for free for a long time, IMO.

            The website is available in English and Chinese, so the developers might be from China.

            • user_7832 an hour ago ago

              Thanks a lot! There's a very reasonable chance is is anybox! (The app I was thinking about wasn't self hosted, sorry about the confusion.) I'll check archive.org to see what it looked like a few months/year ago. (The "clean" and simple name along with the landing page style is definitely similar.)

    • jrm4 19 hours ago ago

      One thing that I've been looking for in these, and I seem to recall few have this:

      Public mode? I'd like people to NOT have to log in.

  • anovick 20 hours ago ago

    Seems like the tag system is flat, which is a big limitation on the organization capability.

    For example, I noticed that in the demo access app, there's a note about cooking, and it has 4 tags: - `baking` - `cupcakes` - `oven cooking` - `recipe`

    This would get out of hand quickly.

    There should be a hierarchy of tags (categories): `cupcakes` in `baking` in `oven cooking` in `recipe`

    The only tag needed in this case for the note would be `cupcakes`

    • oDot 19 hours ago ago

      Hi there! I am extremely glad to read someone else write about this necessity!

      I own and operate a "list-taking" app[0] in which every list/kanban-item can itself be a list/kanban.

      I currently use it for things I'm the creator of -- tasks, story outlines, etc, but looking to introduce 3rd party content for task management (I want to see GitHub tasks from work next to my own tasks) and, as you say, knowledge management of things like recipes or music.

      Items could be part of one or multiple hierarchies. A list of "cake" recipes could be under both "baking" and "party essentials", and music playlists could include other playlists.

      As you can tell, this can become convoluted in my mind, and so if that's something that's interesting to you (or anyone reading this), please reach out and let's discuss! hn at nestful.app

      [0] https://nestful.app

      • pm 16 hours ago ago

        I may have come across your app before in passing, but hadn't checked it out. I playtested aspects of a "productivity system" (grain of salt) with paper earlier this year.

        "Spontaneous productivity" mirrors some of my own thinking on the subject, especially the JIT and bubbling aspects and how they work together. I haven't seen how it works in the case of Nestful, but I'm keen to try it out. It may adjust the design principles guiding development.

    • mohamedbassem 18 hours ago ago

      The tagging system is indeed flat, but the lists can be nested. The idea being that tags are usually AI generated, and there's a lot of them (which is useful for search), but lists are meant for manual curation and this is where you can have whatever structure you want.

      • anovick 18 hours ago ago

        Happy to see that you have considered this

        IMO it would be interesting to try to combine the two approaches (curation + auto tagging).

        It starts out with the user scaffolding an initial hierarchy, then (after enough usage to provide meaningful data for ML predictions) the ML model predicts on subsequent entries, and asks the user for approval (which feeds a reinforcement learning model)

        • mohamedbassem 18 hours ago ago

          This is indeed the plan. We're currently working on generating embeddings for the all the bookmarks stored, and one of the usecases of this is going to be clustering. If a bookmark is similar to all other bookmarks in a list, the model can suggest adding those bookmarks to the list. Still a manual operation, but with ML assistance.

    • ninalanyon 6 hours ago ago

      > This would get out of hand quickly.

      Hierarchies get out of hand quickly too. You will soon find that different people (or the same person at different times) create different hierarchies for the same thing and that the same thing belongs in multiple places.

      • aeonik 5 hours ago ago

        Polyhierarchies combined with RAG is the one true way.

        • UltraSane 4 hours ago ago

          Just give me a Neo4j database and Cypher to query it.

    • UltraSane 4 hours ago ago

      A flat tagging system doesn't require much curation while nested tags require someone to decide what tags are members of what other tags. Taken to the logical extreme a hierarchical tagging system becomes a full blown ontology.

    • tuananh 13 hours ago ago

      i feel this feature is not quite needed in the age of AI

  • marban a day ago ago

    I've saved ~50k .webarchive files from Safari in a single folder, indexed via Spotlight — not an archivist's dream, but I'm a sucker for using anything in the OS stack.

  • mohamedbassem a day ago ago

    Hoarder's maintainer here! What a nice surprise seeing Hoarder on the homepage of hackernews!

    • jonotime 4 hours ago ago

      Question for ya since I'm working on an app with similar platforms - and dont know what I'm doing. I see you used expo for the mobile apps but nextjs for the web app. Why didnt you use expo for the web app too?

    • Fabricio20 16 hours ago ago

      I must ask, in your github readme you say "[Planned] Downloading the content for offline reading." but what does that mean in the context of a self hosted application? Isn't the data already downloaded - via the other features such as "Full page archival" and "Automatic fetching".

      I guess one small request - could the chrome/firefox extension include a way to transfer the page data from the browser, as it's being displayed to the user? (as in, transfer the entire page/html instead of the page's link). This would likely result in much better support for nasty sites like twitter and such that require credentials, etc..!

      • mohamedbassem 4 hours ago ago

        The offline reading thing is bad wording from my side. I mean offline reading on the mobile app (when you don't have internet access / access to the server).

        As for your request, we're tracking this in (https://github.com/hoarder-app/hoarder/issues/172), which aims to do exactly what you're asking for.

  • smcnally 17 hours ago ago

    Hoarder has a chrome plugin, Firefox addon, and apps for Android and iOS for which the app store says something I’ve never seen before:

    “Data Not Collected — The developer does not collect any data from this app.”

  • WillAdams 6 hours ago ago

    It's a nice idea, but really, bookmarks aren't enough.

    I can still distinctly remember reading a site which had Barry Hughart's typewritten notes for his novels (_Bridge of Birds_, _The Story of the Stone_, and _Eight Skilled Gentlemen_), and I considered saving the files for the image scans, but didn't figuring they would always be available.

    Since then, he has passed away and the pages in question vanished (no idea on what order that occurred), and I haven't been able to figure out who is in charge of his literary estate, nor where his papers are stored.

    I'd give a lot for someone to write a book examining his writings in a scholarly context.

    If I think something might be interesting enough to read later, I print it to a PDF --- as a bonus, that means I can send it to my Kindle Scribe to read at my convenience.

  • _ache_ a day ago ago

    I use it for months now.

    It's very good. Some points that can be improved:

    - Search inside the description of the bookmark, it doesn't. - Update to a new version of hoarder. Since the software isn't stable, it's a real problem. - Related to the previous point => More archive formats.

    Otherwise, it's a very good software. Easy to use, nice front-end, good UX.

    • InsideOutSanta a day ago ago

      Is there an official upgrade path available between versions, or do they currently break things without providing migration?

      • mohamedbassem a day ago ago

        Hey, Hoarder's maintainer here. I'd like to know more about the pain you're facing when upgrading hoarder. All the releases since launch has been backward compatible, and it has always been just a matter of updating the docker images.

        Also for searches, Hoarder indexes all the content of the websites it crawls. If it doesn't for you then that would be a bug!

        • SeriousStorm 19 hours ago ago

          Different user here, all of my upgrades have been completely seamless.

  • apitman a day ago ago

    This looks cool

    *bookmarks it in huge Trello list where cool bookmarks go to die*

  • leobg 2 days ago ago

    Anyone using this as an Evernote replacement?

    I use Evernote since what, 2005, 2008? Yet I hate every time I start it up. Such ugly bloatware it has become. And the silly “AI powered” features tacked on when it became fashionable… Man, replacing it would feel so good.

    • samschoice a day ago ago

      I use Joplin since I left Evernote. They have an import for evernote https://joplinapp.org/

    • twojacobtwo 2 days ago ago

      I replaced Evernote with Joplin about 5ish years ago. It was super easy in my case, so maybe it's worth checking into. I haven't used the desktop app recently, but it was always snappier than evernote at the very least.

    • bovermyer a day ago ago

      I use Obsidian for note-taking and personal knowledge management. I haven't used "save for later" bookmarklets or apps since I quit using Evernote many years ago, though.

      • mrbigbob a day ago ago

        If people are interested it was featured hear on HN a day or two ago but Obsidian released an extension called Obsidian Clipper that can save webpages in markdown format. https://github.com/obsidianmd/obsidian-clipper

        • _gtly a day ago ago

          Can this plugin work for Chrome on Android? Does anyone have a recommendation for bookmarking and/or webclips on Android?

        • smallpipe a day ago ago

          I use it when I need to get a table or something from a webpage. Works really well for small stuff.

        • kalinkochnev a day ago ago

          I didn't know this existed for obsidian! This is perfect for me. Thanks for sharing

    • guiriduro a day ago ago

      - Pocket for bookmarking. - Onenote for longer form less structured note taking on copied/linked base material or needing exposition (somewhat reluctantly). Occasionally Word. - Anything/Jetbrains : Markdown for short form or dev docsor with intuitively clear sub-structure or heirarchy. (Pseudo)Code and comments for simple codable ideas, python-like. - Scapple for mind-mapping high level concepts, collections of related ideas or things, associations rather than hierarchies

    • bleomycin 19 hours ago ago

      Not sure hoarder is a direct replacement for evernote. As a former long time evernote note user I replaced it with https://notesnook.com

    • be_erik a day ago ago

      I use it exactly that way and publish an RSS feed of my archives: https://enlace.space/~erik/rss.xml

  • sigmonsays a day ago ago

    anyone know how this compares to archivebox?

    Is it just bookmarks or does it download full pages?

    Bookmark applications are generally a failure for long term storage because links always change over time.. so i'm not sure what lense to look at this app through.

    • elashri a day ago ago

      It does allow content retrieval so it does save local copy. And allow even downloading videos using yt-dl.

  • maestrae a day ago ago

    Been looking at these tools and honestly, my big issue is backup - specifically, I want a dead simple way to continuously back up my links to something like S3. Not really interested in relying on the Internet Archive or having to set up and babysit yet another backup system. is it like completely antithetical to the average self-hoster ethos to use the cloud where it makes sense?

    I really like hoarder, but kind of surprised how many tools either totally skip over backup functionality or treat it as an afterthought (like this Hoarder issue here: https://github.com/hoarder-app/hoarder/issues/75). Feels like this should be a no-brainer feature, right?

    • nrdvana 20 hours ago ago

      If you're hosting it yourself, I would expect that you also make backups of your whole server and/or database if one is involved. A custom backup feature built into the app seems redundant.

  • jrm4 a day ago ago

    So far I love and am probably about to go all in, especially since you can bring your own "AI."

    Relatedly (and I think the authors are working on it) anyone using local AI for tags and know good ways to tweak (I'm using Ollama and would love to constrain the the tags a bit?)

    • mohamedbassem a day ago ago

      Hoarder supports customizing the tagging prompt in the user settings. You can instruct the model with whatever rules you want to constrain the tags to your likings!

  • nirav72 21 hours ago ago

    Hoarder has been great. Was finally able to dump my annual $40 Pocket subscription. Auto tagging using LLM works well. Only issue I have is that mobile app doesn’t allow offline storage and viewing.

  • garrisonj a day ago ago

    These types of projects always look really cool, but I always doubt they will be maintained so I decide not to use them.

    • bravura 18 hours ago ago

      Does it really matter if it's simple to export your data and then import to another bookmark manager?

    • _-_-__-_-_- 18 hours ago ago

      I have to agree. I've been using standard notes, and honestly, will probably end up just paying for it.

  • evolve2k 18 hours ago ago

    When I think hoarder, there’s a different app I want.

    We’ve got a shed full of boxes and bags of stuff. Want an easy way to take pics of the contents of a box and the “box number” and be able to browse for the box or specific contents later. Eg a home archive solution.

    Anyone know of tools for that?

    • jhogendorn 18 hours ago ago

      Inventree or homebox are what you want.

  • edoceo a day ago ago

    I've one I built called Cras. Store all my neat things I find on the web there. Recently turned it into a PWA so it's a share-target on my phone (Android) which is awesome.

    It's self-hosted and all packed into SQLite so, IMO, very portable.

    Recently added a trick to snapshot all the public links I save - my copy and on Archive.is - link rot is real.

    • trestacos a day ago ago

      I've been thinking about building a service that makes it easy to self host apps like this. I'm curious if you'd find it useful.

      It would host webapps like yours that use in browser sqlite to store data, then the service provide a sync their sqlite data across different devices. The user not the app would pay for the storage of the data, so they would own their data. And you can use CSP to lock down the app from sharing with other domains, meaning an app can't leak your data.

      The service would handle identity (only you can access your sqlite data - the app just ) and could provide an app store like experience with different apps of this type.

      Sort of like a firebase style backend as a service, but the user would own the data instead of the app.

      • jonotime 5 hours ago ago

        I am working on this problem from another angle. Where instead of hosting, the user only has to bring a sync solution. No services or auth needed - only storage. See more here: https://github.com/jonocodes/savr?tab=readme-ov-file#why-ano...

      • georgebcrawford 19 hours ago ago

        Is that what pikapods is? I ask out of ignorance, not "somebody is already doing that".

        • trestacos 3 minutes ago ago

          Pikapods looks to be running containers for you, preconfigured for the self hosted apps they support. That's a cool service, I assume there are other similar services around.

          The concept I'm thinking about is different - it doesn't run any app code on the server, the apps are SPAs that run in the browser only (no backend supporting code), and then the server just syncs the data from the apps. This means the apps can focus on building ux/business logic and not worry about database, how data gets to/from clients, identity, etc. Somewhat like firebase but where the users pays for the server, not the app. That should hopefully be simpler for developers, and theres a lot less likelihood of issues with server config/etc (although presumably pikapod will handle that). It should also be cheaper since you're not constantly running a container, just storing data.

          I'm not sure if it's a useful concept or not yet :)

    • paul-tharun 7 hours ago ago

      is it open source? Can you share the link for it please.

      • edoceo 4 hours ago ago

        Not yet, but I will.

  • SeriousStorm 19 hours ago ago

    Love this app. The only thing on my wishlist is a way to "discover" stuff I've bookmarked before when googling. So the extension would search Hoarder when I search Google (or wherever search engine) and give me a list of those next to my Google search or in the extension drop-down. I sometimes forget that I bookmarked a solution 3 weeks ago, and now I'm searching for that solution again.

    I think Evernote had something like this when I was using it.

    • johntash 19 hours ago ago

      I haven't tried Hoarder yet, but Linkding has a linkding-injector extension that does this. It's pretty useful.

  • technotarek a day ago ago

    FYI the demo’s login credentials are case sensitive for the email field (when they probably shouldn’t be?).

  • lormayna a day ago ago

    I am using it since couple of months and it works really fine. The UI is improved a lot since the first release. Tagging is not the best, but it depends a lot on the AI response

  • stogot a day ago ago

    I just boookmarked this into a “figure out where to put this later” folder, then realized the irony of that action.

    • taude a day ago ago

      Went to my para system "0 - Inbox".

  • BOOSTERHIDROGEN a day ago ago

    How do you manage restrictions on Reddit and X?

    • lormayna a day ago ago

      It's using a chrome instance to access JS enabled sites.

  • activeradio a day ago ago

    really cool!

  • on_the_train 10 hours ago ago

    > with a touch of AI

    It's fascinating even to myself how fast I closed the tab. The annoyance and oversaturation with "ai" is on a level I didn't think was possible