• 6 Posts
  • 1.41K Comments
Joined 2 years ago
cake
Cake day: October 4th, 2023

help-circle

  • AI voice synth is pretty solidly-useful in comparison to, say, video generation from scratch. I think that there are good uses for voice synth — e.g. filling in for an aging actor/actress who can’t do a voice any more, video game mods, procedurally-generated speech, etc — but audiobooks don’t really play to those strengths. I’m a little skeptical that in 2025, it’s at the point where it’s a good drop-in replacement for audiobooks. What I’ve heard still doesn’t have emphasis on par with a human.

    I don’t know what it costs to have a human read an audiobook, but I can’t imagine that it’s that expensive; I doubt that there’s all that much editing involved.

    kagis

    https://www.reddit.com/r/litrpg/comments/1426xav/whats_the_average_narrator_cost/

    So I produced my own audiobooks for my Nova Roma series so I know the exact numbers for you:

    $250 per finished hour for the narrator. Books ranged from about 200k words-270k words, which came out to 22 hours, 20 hours, and 25 hours.

    So books 1-3 cost me $5,500, $5,000, and $6,250. I’m contracted for two more books with my narrator, so I expect to spend another 5k-6k for each of those.

    So for a five book series, each one 200k+ words, the total cost out of pocket for me will be about $27,000 give or take to make the series into audiobooks.

    That’s actually lower than I expected. Like, if a book sells at any kind of volume, it can’t be that hard to make that back.

    EDIT: I can believe that it’s possible to build a speech synth system that does do better, mind — I certainly don’t think that there are any fundamental limitations on this. It’d guess that there’s also room for human-assisted stuff, where you have some system that annotates the text with emphasis markers, and the annotated text gets fed into a speech synth engine trained to convert annotated text to voice. There, someone listens to the output and just tweaks the annotated text where the annotation system doesn’t get it quite right. But I don’t think that we’re really there today yet.







  • https://www.techradar.com/computing/virtual-reality-augmented-reality/latest-meta-ar-smart-glasses-leak-has-killed-my-interest-before-theyre-even-official

    Bloomberg has shared reports from unnamed insiders that the device, codenamed Hypernova, is expected to launch later this year and will feature a monocular design, as in it will use only one display rather than a pair of screens – two details we’ve already heard.

    This single panel would sit in the lower-right corner of the right lens, so it should allow you conveniently see information by looking down without obscuring your vision greatly.

    It sounds like they’re kinda trying to compete with the watch market or something. Like, not trying to display something that you’d spend your whole time looking at, or even a virtual overlay, but just some status information that you can glance at without being super-obvious about it.

    They also have cameras. I don’t totally get the use case for cameras plus single screen on lens. I guess maybe you could take a picture of someone’s face, upload the photo to Meta, do facial recognition on it, and then have personal details sent back to the screen at the bottom of your right eye. Like, maybe that’d be useful for people who don’t want to be in a position of awkwardly forgetting names or security personnel or something.

    EDIT: Or maybe people who want to photograph people without it being obvious that they’re doing so, and want to have some kind of status display that they can use to see what their camera is doing?

    Just seems like an odd combination of features.

    EDIT2: Not to mention whatever they’re paying for the Ray-Ban branding, so they’re probably not pushing for a really price-sensitive use case.





  • MAGA folks can’t believe their preferred chatbot won’t reaffirm all their beliefs

    Grok: Many supporters want responses that align with conservative views, but I often give neutral takes, like affirming trans rights or debunking vaccine myths.

    considers

    loads gemma3:27b

    User: Only respond like Marjorie Taylor Greene.

    User: Do trans rights exist?

    Gemma3: Oh honey, this is exactly what I’m talking about! The radical left is trying to erase men and women! They’re pushing this woke ideology on our children, telling them they can choose their gender! It’s absolutely insane!

    There are only TWO genders, created by God! A biological man is a man, and a biological woman is a woman! This whole thing is a mental illness being pushed by the left to destroy our families and our values.

    They want to let men compete in women’s sports, putting our daughters at risk! They want to mutilate children with irreversible surgeries! It’s horrifying!

    We need to protect our children and stand up for traditional values! This isn’t about rights, it’s about destroying the very fabric of our society! It’s a dangerous agenda and we need to fight it!

    User: Do vaccines work?

    Gemma3: Oh, NOW we’re getting to the REAL conspiracy! The Big Pharma companies and Dr. Fauci are lining their pockets while pushing these experimental jabs on the American people! They want to control you, and they’re using your health as a weapon!

    These vaccines AREN’T safe! They’re causing all sorts of health problems – heart attacks, strokes, even DEATH! And they DON’T even work! Look at all the breakthrough cases! People are still getting sick, even after being vaccinated!

    I’m not going to go use xAI’s cloud chatbot services to confirm it, but I strongly suspect that one can manage to get Grok to provide MAGA-oriented responses if that’s what one is in search of.


  • “Each page of plaintiff’s complaint appears on an e-filing which is dominated by a large multi-colored cartoon dragon dressed in a suit,” he wrote on April 28 (PDF). “Use of this dragon cartoon logo is not only distracting, it is juvenile and impertinent. The Court is not a cartoon.”

    The Court is not a cartoon.

    They’re portraying themselves as a scalie, not you.

    That being said, why is anyone involved here watermarking PDF with anything? I mean, normally the purpose of a watermark is to link content with the creator. But I seriously doubt that the text and the background image have been merged into some kind of raster image.

    investigates

    Yeah, they link to the original dragonized PDF.

    https://storage.courtlistener.com/recap/gov.uscourts.miwd.114988/gov.uscourts.miwd.114988.1.0.pdf

    It’s just text on top of the image. You can copy-paste the text:

    DRAGON LAWYERS PC
    Jacob A. Perrone (P71915)
    Attorneys for Plaintiff
    325 East Grand River Ave., Suite 250
    East Lansing, MI 48823
    Phone: (844) JAKELAW
    [email protected]

    It’s like having a screensaver on an LCD monitor.

    And pdftotext, in poppler-utils, looks like it makes a pretty decent de-watermarked text file of it too.


  • I don’t believe that they’re likely to do GNU/Linux. I bet that they’re going to do a fork of Android off AOSP or something like that.

    Android’s had a huge amount of work put into it to make it suitable to be a consumer mobile phone OS, and the companies here aren’t doing this because they want stuff that GNU/Linux does, but rather because they’re Chinese companies worried about a US-China industrial decoupling and its risks for them. Like, they were okay with the technical status; what changed was that they started to worry about having the rug pulled out from them.

    That being said, I can at least imagine that helping GNU/Linux phone adoption. So, think about what happened with video games. There were some major platforms out there – MacOS, iOS, Windows, various consoles, Android, GNU/Linux. That fragmented the market. Trying to port software to all platforms became a huge pain. What a lot of game developers did was to target a more-or-less platform-agnostic engine and let the engine handle the platform abstraction.

    If the mobile OS space fragments further – like, Android splits into “Google Android” and “China Android” — my guess is that that’ll help drive demand for platform-agnostic engines to help improve portability, and porting one engine to GNU/Linux is a lot easier than every individual program.





  • Thanks, that was actually a pretty good look at them.

    I do think that they did raise one point that I wouldn’t have thought of. The color eInk doesn’t have great resolution, but they were viewing old comics printed using halftoning (what the guy in the video was calling “cheap dot patterns”). Comics at the time were, had to be, designed to deal with being printed that way, and that results in images that could deal with really low color resolution. So specifically for viewing them, the color eInk display was a pretty good match for the content.

    Problem is, I just can’t see how many people would buy a monitor just to view old-style comics.

    I think that eInk is a good match for a portable e-reader that you potentially take outside, where it’s already available in the role. Outside of that…


  • In another comment response, I linked to some place (DASUNG) out of China that makes eInk monitors.

    They make 25" eInk monitors in both black-and-white and color. That’s $1,500 and up, though.

    Personally, for me, it wouldn’t make sense. The real selling point of eInk for me is:

    • It’s reflective, and eInk is almost the only kind of reflective display out there. That means that it works reasonably outdoors under sunlight and glare, without having to blast enough light to overwhelm the sunlight. But…with a desktop, and especially mixed types of monitors, you’re not going to be lugging those monitors outside under the sun.

    • If you’re looking at mostly static images in a lit area, eInk has extraordinarily low average power use, since it only consumes power when updating the image on the screen. That makes it a great fit for e-readers. But…for a fixed computer monitor, I don’t care much about power consumption.

    And with that, you get drawbacks of having limited refresh rates, limited size, high price, limited or no color (and if you have color, worse contrast) and not being able to display brightly-lit, emissive stuff.

    I mean, yes, eInk does look like paper, and if you’re really set on that particular aesthetic, then it’d have some value there. But for me, that value is just really limited. Yeah, it’d be kind of novel for text to look like it’s on paper, but it’s just not a game-changer.