25 comments

  • sebastiennight a day ago ago

    Alright, I've made several updates based on feedback!

    Cost Estimation

        - Shows (very rough) character count estimate (rounded to nearest thousand)
        - Displays approximate cost at $0.12 per thousand characters
        - Updates dynamically as selections change
    
    
    Advanced Input Options

        - Added toggle between single thread URL and top 100 stories selection
        - Implemented multi-thread selection with checkboxes
        - Saves input mode preference to localStorage
    
    
    Comment Limit Improvements

        - Changed to "All" as default with option for custom limit
        - Original post no longer counts against comment limit
    
    
    Quote Formatting

        - Text with > is now properly recognized as quotes
        - Quotes are transformed with random introduction phrases
        - Adds "End of quote" with variations at the end of quoted text
    
    
    Link Handling

        - Preserves shared links in expandable section at the bottom
        - Different random phrases for first, second, and multiple links
        - Links open in new tabs when clicked
    
    Voice Matching

        - Matches commenter usernames to ElevenLabs voices if names match
        - Falls back to deterministic assignment if no match found
    
    Error Handling & Recovery

        - Saves progress and allows resuming after errors
        - Shows "Retry" button with partial audio when errors occur
        - Audio generated so far is available for download
    
    UI Improvements

        - Added tooltip with API key information
        - Persistent theme preferences via localStorage
        - Improved responsive design for mobile
        - the filename of the generated MP3 file matches the thread title
  • sebastiennight 2 days ago ago

    Issues I've noticed when running it against more threads:

    - don't use Legacy voices as they seem to be of much lower quality (sounds like someone is calling in from an international landline)

    - when the same poster appears many times, it gets tedious to hear them restate who they are. I think after the first 3, we should recognize the voice so that's not necessary anymore

    Feature requests I'll add:

    - emphasize quotes better

    - add audio chapter marks if possible, so it's possible to skip ahead

    - attach a speaker's voice to the relevant voice in the 11Labs account if there's a voice with the same name as the username

    - add sound effects if people write down sound effects in their comments (this seems tough)

    Anything I'm missing?

  • rgbrgb 2 days ago ago

    This is cool. Any chance you can drop an example?

  • gojomo 2 days ago ago

    Can I upload my own voiceprint so my comments are said in my voice, voice of my choosing?

    Can I navigate by voice commands, for example if listening while driving?

    • sebastiennight 2 days ago ago

      1. This should be possible, I think for example if you saved your cloned voice in your account with the same name as your HN handle. I'll add this. This should then work for using any voice for a specific user (just use the right username as the voice's name in 11Labs).

      2. No navigation by voice commands sadly - it generates a single audio track. I might be able to insert chapter marks for each comment though, so that it'd be possible to "skip" to the next comment!

      • 01HNNWZ0MV43FF 2 days ago ago

        I don't have a voice print, can I put something in my profile to get a generic feminine voice? I don't suppose there's a pronouns field

        • sebastiennight 2 days ago ago

          I would think once I introduce the feature above, you could just create a "01HNNWZ0MV43FF" voice with the Voice Lab[0] inside your account (not necessarily duplicating your real voice but just using 11Lab's tool to get a feminine voice). Would that work?

          [0]: https://elevenlabs.io/app/voice-lab

  • mosquitobiten 2 days ago ago

    One big post can have a bigger reply counter-arguing every point 1b1. It would be nice if the arguments go back and forth, basically segmenting the post and the replies into multiple lines of dialog, rather than feeling like you are listening to a speech.

    • sebastiennight 2 days ago ago

      Wait... do you mean, quoting the original (or parent) poster in their own voice when there's a quote?

      That seems less natural. I think what I can do though, is turn quotes into actual quotes, eg. turning

      > One big post can have a bigger reply counter-arguing every point 1b1

      into:

      "Look; you said 'One big post can have a bigger reply counter-arguing every point 1b1'"

      • mosquitobiten 2 days ago ago

        >Wait... do you mean, quoting the original (or parent) poster in their own voice when there's a quote?

        yeah, I think what I'm getting at is when there is a big argumentative post crossing the line from chit-chat to speech, break out of the structure of the website, let the LLM get the arguments out and connect them to the counter-arguments and turn it into a back a forth with shorter dialog lines, without repeating too much or one person talking for very long.

        Also I agree, the LLM should be free to transform or add dialog how it sees fit so it feels more natural but always keeping it true to what is written.

        • sebastiennight 2 days ago ago

          In this app, the process runs entirely in the browser and has no LLM calls at all, so we don't have the ability to rewrite the conversation (other than performing regexes or other crude operations on the text of a comment, which is how links are turned into "See the link I posted in the thread").

          I also think it's incredibly difficult (even with an LLM) to render properly a multi-turn multi-user conversation without sticking to the actual hierarchy of the thread. We would probably run into the "summarize the thread and lose nuance" problem again.

  • sebastiennight 2 days ago ago

    Note: I'm particularly interested in feedback on making the conversation feel even more "natural" so that the audio is as similar as possible as if we were really listening in on the watercooler chat.

  • plun9 2 days ago ago

    It seems that in the generated audio, the number of comments is off by one. It is missing 1 comment.

    • sebastiennight 2 days ago ago

      I think it counts the original post as a comment, so the total shown is (original posts plus number of comments). Is it actually missing one comment in your audio ? which one? first or last?

  • wewewedxfgdf 2 days ago ago

    This is pretty good I might listen to this as alternative to a podcast.

    Maybe publish it as a podcast.

    • sebastiennight 2 days ago ago

      Thank you!

      I have no plans to publish as a podcast (if I was going to go through all the trouble to put a podcast together, it would be an actual podcast for my startup, not for a hobby project!) but I'd love it if someone did it!

  • devrandoom 2 days ago ago

    Oh nice cool water. It's a bit muddy looking? Is it safe to drink?

    • 01HNNWZ0MV43FF 2 days ago ago

      Continue straight for eleven thousand miles, then turn lreft

  • thegreatpeter 2 days ago ago

    rips hair out

    • sebastiennight 2 days ago ago

      This sounds painful! I think I'll add a feature so 11Labs generates sound effects for comments like this, so they can be enjoyed in their full glory