I record myself on audio 24x7 and use an AI to process the information

twobitshifter · on Nov 15, 2022

This is known as life logging with adjacency to sousveillance and it’s a fascinating topic.

https://en.m.wikipedia.org/wiki/Lifelog https://en.wikipedia.org/wiki/Sousveillance

We in general don’t want to be watched by others, but a managed record of our own activities can be extremely valuable, and even more so if you find yourself wrongly accused. Further it can be used to shine a light on corrupt officials, one example of this is the nycplacards exposes on twitter.

manholio · on Nov 15, 2022

The trouble with any such footage is that it can be used against you ("as the the defendant's own records show, they were present in the murder area") but they generally won't extricate you when produced by you, since you clearly have a motivation to use it selectively. So you showing a picture of yourself reading a book during what you claim is the murder night is not an alibi, because it could have been produced at any other time, and you will have a massive uphill battle in the court to authenticate that image, and risk even sink you further if you fail ("the defendant even prepared an alibi").

The only way I would accept a commercial product performing this always-on archiving is if:

1. It's encrypted by default with a strong key that can't be subpoenaed or circumvented.

2. The encoder generates its own random key upon installation which I don't know (recoding effectively random, undecodable data), and I then have to manually change the key if I expect to ever read the recording.

Number 1 allows me to review and footage and only release it if it's in my interest, and number 2 affords me plausible deniability, if I don't release the key I can claim I did not know you need to set it manually.

Sure, as long as you are the only nerd doing this, you don't need this complex setup, and you will probably get to use it the unencrypted footage only in your favor. But when it becomes widely accepted as a social norm (say, everyone wearing Google glasses), you can expect law enforcement will become aware of it as a cheap source of self-incriminatory evidence.

twobitshifter · on Nov 15, 2022

I‘d be interested in knowing of any cases where someone who recorded their own activities used it as an alibi. Right now it's all theoretical. Dash cams are really strong evidence in traffic court, but this isn’t criminal so it has a lower bar. From the other end, body cam footage is powerful when worn by police, and cell phone evidence by bystanders are also strong evidence.

lazide · on Nov 16, 2022

I was falsely accused of some serious crimes and abuse.

It turns out, having security camera footage of the entire period of time showing that not only did you do nothing of the sort, but fundamentally could not have done anything of the sort, and was actually just playing with your kids the entire time, and the accuser had to have known so, changes the dynamic of such a case quite quickly.

Unfortunately, no one seems to take Perjury seriously in California, at least when the accuser is a woman. Despite it being a felony with penalties of 2,3, or 4 years in prison, and airtight evidence of multiple repeated occurrences.

jimmySixDOF · on Nov 16, 2022

That works now but it won't be long until they can find someone to deepfake an alternative camera showing you red handed and then the court has two conflicting stories backed up by conflicting proofs.

lazide · on Nov 16, 2022

Possibly, but chain of custody and credibility of claims matter. In my case it was all originally stored on a cloud security video storage system which I had emails going back years documenting existed, evidence the video came directly from it, and that it had been installed years before.

Also, the footage I provided was not showing anything unusual or hard to believe. I also had significant history available showing that this was normal and par for the course for me, not that anyone asked.

Their claims were quite extraordinary, and they had zero concrete evidence to back them up.

It still took awhile for the case to resolve, and they tried to make all sorts of other claims - including that the footage was from cameras that were installed without anyones knowledge and was illicitly recording, and was therefor illegal and inadmissible. so I dug up the email where the accuser had asked me for my permission if it was okay if they had it installed because it recorded audio too. That said, it was visibly, blatantly, and obviously installed in the open, and there were witnesses who could testify that had always been the case, so they had no leg to stand on anyway.

Even if I had the opportunity to fake it (which I could have, I guess, or at least edited it or something) it’s pretty hard to fake all the other circumstances which support the validity of it. They never tried to impeach the video itself.

If someone provides an unsourced security video showing the person murdering the president, and the other person shows security video from a confirmed third party using a system that had existed for years, showing they were at home watching TV at the time - and the president appears to still be alive at the time and unharmed - it’s not hard to figure out who is faking it.

I’m having a hard time imagining how they could have produced a video supporting their claims with the same level of even apparent credibility where it wouldn’t have fallen apart immediately with any investigation.

The courts already have to deal with people lying all the time and being disingenuous - it’s why the procedures exist and everything is so painful, IMO.

Knowing the rules and keeping documentation does work, generally.

I would recommend being careful to avoid situations which could be easily twisted or misinterpreted from the evidence however, especially after this situation. And never proactively provide data or show your hand to someone trying to attack you this way, as it can provide them more means to try to twist things and make life harder.

Good attorneys are key here.

Spooky23 · on Nov 15, 2022

Time is really critical. Alot of police investigative work is about stitching camera footage together in a timeline.

I served on a jury where a case was built around camera evidence immediately before and after an event. Of about a dozen relevant data sources, only one had verifiable, correct time. The defense was able to impeach that evidence, and the whole case collapsed. A dude got away with manslaughter.

Ylpertnodi · on Nov 16, 2022

"A dude got away with manslaughter." He got away with it only if he did it.

If the presented evidence failed to prove his guilt (brd), then he didn't so much 'get away with it', as sufficient proof was not presented to prove his guilt.

Sorry to put words in your mouth, but it sounds like you thought he did it? Which, isn't necessarily a bad thing, as there are balances in the system that prevent you from being the sole judge, jury and executioner regarding the situation.

Spooky23 · on Nov 17, 2022

Absolutely true in theory and I accept the outcome, although it bothered me for some time. It didn't meet the standard of proof; however the family successfully obtained a civil judgement later. But there is no doubt in my mind that the guy was guilty of hitting a fallen pedestrian, probably by accident, and driving off.

The thing that broke the case was a rhetorically talented defense attorney breaking down a (poor) expert witness who was inexperienced and unprepared. Certain traffic control devices use NTP to set their clock, and the expert wasn't able to articulate responses to the questions competently... so the video was admitted into evidence, but the metadata was not.

The loss of the time source placed events depicted on a series of cameras from different sources, all with incorrect clocks, into a 10-minute window (as opposed to a 2 minute window), which broke the case and led to a dismissal. The attorney did a good job by her client, I'd hire her in a second.

manholio · on Nov 15, 2022

The body cam footage is a good example, it's deeply hated by the police and a frequent source of incriminatory evidence against the wearer.

Since "you got nothing to hide", as the old saying goes, why not bodycam yourself and offer the authorities a great source of evidence they can use against yourself?

twobitshifter · on Nov 15, 2022

I think that’s right, it’s a double edged sword. You would have to ask if you’re more likely to be wrongly accused or to be caught doing something wrong by your own recordings.

This guy has been recording himself publicly since 2002 after ending up on a no fly list. https://en.m.wikipedia.org/wiki/Hasan_M._Elahi

I think that’s taking it too far and would rather encrypt than publish it publicly, but doing it publicly does strengthen the alibi

iudqnolq · on Nov 15, 2022

You're missing an option: Wrongly accused on the basis of your own recordings. Imagine this real life situation occured, but it was your own recordings instead of surveillance cameras

> A key piece of evidence in the case is video surveillance footage showing Williams’ car stopped on the 6300 block of South Stony Island Avenue at 11:46 p.m.—the time and location where police say they know Herring was shot.

> How did they know that’s where the shooting happened? Police said ShotSpotter, a surveillance system that uses hidden microphone sensors to detect the sound and location of gunshots, generated an alert for that time and place.

(The defense argued ShotSpotter makes up data and hides behind opaque AI they refuse to rigorously test. Instead of responding, the prosecution dropped the case.)

https://www.vice.com/en/article/qj8xbq/police-are-telling-sh...

twobitshifter · on Nov 15, 2022

If I follow, the police would need access to the recordings to make the case, which would mean at least probable cause for a warrant. If Herring had a camera or mic on him running at the time of the shooting wouldn’t that contradict shotspotter? It seems more likely any data you have would create doubt rather than bolster the police case.

In general, the shotspotter and surveillance cameras already exist, so what do you have to counteract that? Doing things like leaving no paper trail because you pay everything in cash, or no location data because you keep your phone in airplane mode, leaves little crumbs for your defense, and may create the appearance of hiding something.

iudqnolq · on Nov 15, 2022

> If I follow, the police would need access to the recordings to make the case, which would mean at least probable cause for a warrant.

PC is a very low bar.

> In general, the shotspotter and surveillance cameras already exist, so what do you have to counteract that

It's not binary. Lots of areas, including the inside of your house, probably aren't covered by surveillance cameras.

lazide · on Nov 16, 2022

Having done so, it completely changes the dynamic.

Especially when you know the criminal code, and can ask questions such as ‘officer, I’m pretty sure they are currently committing felony <blah blah> against me. I don’t want them to go to jail, but I do want them to stop committing felonies against me.’

All the sudden, it goes from ‘nothing we can do’ to action.

iudqnolq · on Nov 15, 2022

Bodycams are actually an example of a reform successfully cooped by police bureaucracy.

https://www.mercurynews.com/2021/05/16/police-pr-video-machi...

ClumsyPilot · on Nov 15, 2022

The American Jurors have convicted people based on a man's interpretation of a dog signalling that a dead body was on someone's property 5 years ago. Once people are that gulliable, they are beyond help.

https://www.science.org/content/article/should-dog-s-sniff-b...

artificialLimbs · on Nov 15, 2022

"A jury of your peers consists of 12 people who were not smart enough to get out of jury duty."

iudqnolq · on Nov 15, 2022

Keep in mind that they're in an artificial environment designed to lead them to that decision. One of the judge's jobs is to ensure experts are appropriately qualified. Another of the judge's jobs is to restrict what the jury is allowed to hear.

Jach · on Nov 15, 2022

Why not just have the device occasionally send sha256 sums of chunks to a third party service, like Twitter tweets, where it's clear you can't forge the date of the message? If you need to produce some chunks, the matching hashes provide an independent time stamp showing you didn't just produce the content at any time. This sort of trick is already commonly done to demonstrate prior knowledge of something at a later point in time without having to reveal it just yet (if ever).

ramblerman · on Nov 15, 2022

Great idea, but are courts tech savvy enough to accept this already?

Even with a tech expert to explain it, I worry the opposition would just get their own expert and make a whole mess of it, confusing both the judge and jury enough to cast doubt.

Perhaps I have a very wrong view on how both such evidence is presented and accepted though.

Jach · on Nov 15, 2022

Courts are generally more tech savvy than techies like to give them credit for. But it's worth mentioning that in recent years several US states have already passed legislation expressly forbidding courts from denying such evidence (even/especially if put on Blockchains rather than a more traditional non-decentralized network/database), and you can find similar stuff in other countries around the world (even China). And of course in lower courts (like small claims) or even mediation the standards are a lot looser, there's not even a jury.

If you want a more thorough view of the rules in the US (which states deviate from to some extent), you might like to browse https://www.law.cornell.edu/rules/fre

xyzzy123 · on Nov 16, 2022

You could queue it up to send chunks of reassuring alibi video (or checksums of same) while you are doing your murders or crimes.

Put another way, the possibility that you could have done that renders the timestamping useless.

Jach · on Nov 18, 2022

f4d4cb1c06b690f5d6ee5d9012f743c330511effdb24e55b04e602f87d38d89e

I could have produced this hash at any point in the past, but after HN's edit window expires, one can be very certain I did not originally produce it at any point after the timestamp. This is far from useless, even if it doesn't cover all possible objections. The objection in the original comment that it does cover is: you just made up this footage/other evidence post-hoc to establish an alibi now that you've been accused, as people frequently do, or you can't prove you didn't make it up post-hoc so we want to get it tossed out as non-reliable.

RobRivera · on Nov 15, 2022

I had an awful breakup with a woman with tortuous tendencies and a false sense of how to abuse personal injury law and she keeps google home devices and car devices recording practically every moment of her life. as well as apple watch broadcasting her location at all times.

I observed she would always accuse me of things i never did in front of these cameras to get me to give false confessions. thankfully i am as blissfully honest to all people i meet, sometimes to my own detriment, so i never admit to fabricated stories.

given that context, can you point me to judicial precedents where plaintiffs had their self provided footage weakend due to the idea of fabricating false narratives with devices, selectively natrowing contexts etc.

vineyardmike · on Nov 17, 2022

> can you point me to judicial precedents where plaintiffs had their self provided footage weakend due to the idea of fabricating false narratives with devices, selectively natrowing contexts etc.

I was part of an organization that had communal space we wanted to install a camera in because one of our members was leaving a mess and we wanted to know who.

I’m not a lawyer, nor a judge, nor an expert but I’ve hired a lawyer about the above cameras before installation.

First of all, there’s no “selectively narrow contexts”. If you can submit footage into evidence, the courts incl the other lawyers can request different moments in time if you have it (eg for surveillance cams).

Our lawyer advised us to never start/stop the camera, never delete footage or generally mess with it in any way without a paper trail of why. Any footage should auto delete after a period of time, and not be manually done ever. Basically any manual manipulation of the footage could be suspicious if a crime were to occur where people might suspect the footage to have captured it.

I assume if our lawyer warned us that much, there’s probably a case history of trying to plant a narrative with footage.

hirundo · on Nov 15, 2022

> Further it can be used to shine a light on corrupt officials

Little Brother surveillance. It would be nice not to be surveilled at all, but since that's not an option the answer to "quis custodiet ipsos custodes?" is us.

danuker · on Nov 16, 2022

Little Brother is one of Cory Doctorow's books: https://craphound.com/littlebrother/about/

I read it and it was as fun as it was chilling.

thingification · on Nov 15, 2022

In times past, it was obvious that it wasn't an option to avoid pervasive violence (by orders of magnitude compared to today).

It was, though.

ClumsyPilot · on Nov 15, 2022

Also it was obviois that ut wasnt an option to avoid pervasive slavery.

Or a class system where lords and kings have more right than you do, although we are kinda bringing that back.

runnerup · on Nov 16, 2022

As a kid in the 90's in a city of 150,000 people it was stupidly easy to do most of an entire day's adventures anonymously and with either very little or entirely no record of my presence and activities. Yes, there were some cameras here and there, but you knew where they were and could avoid them. Cash was still accepted everywhere, you didn't have a cell phone and definitely didn't need a cell phone to go about your daily activities[0].

I definitely was able to walk/bike/bus/carpool wherever I wanted all day long as young as 6 years old and no one gave a shit except my parents. If I was lost (which happened often!) I'd just ask an adult to help me find my parents. Adults generally interacted with me either only in a professional context (as a clerk/ice cream truck salesman/etc) or if I went up to them and explicitly let them know I wanted their help.

Not all my friends had the same level of freedom but most had enough to go to any of the parks in the nearby neighborhoods and play with friends until their individual family's established "curfew".

If kids stayed out past their curfew, one parent would call each of the other parents and the other parent would drive around the area parks looking for their kid. Another kid or parent would generally be able to point them in the right direction and clear it up within 30 minutes.

Occasionally my parents might forget to pick me up from sports practice and I'd sit outside the ice rink or school for 2-4 hours until they figured it out, usually in cases where I spent all my payphone quarters on food or arcade games.

0: https://en.wikipedia.org/wiki/The_Scoots

xerox13ster · on Nov 15, 2022

Who watches the watchmen? The watched.

rzzzt · on Nov 15, 2022

How many Watchmans would a watchman watch if watchmen could watch Watchmans?

ben_w · on Nov 15, 2022

None, because justice is blind ;)

danuker · on Nov 16, 2022

Good thing the executive and legislative branches are not blind.

They are keenly observant of the benefits coming their way.

klyrs · on Nov 16, 2022

It's the same show no matter matter how many times the watchman watches it, so one.

rzzzt · on Nov 16, 2022

I was thinking of the small television that Sony produced: https://en.wikipedia.org/wiki/Sony_Watchman

stcredzero · on Nov 15, 2022

a managed record of our own activities can be extremely valuable

I've thought of this as a hardware product: A device that records your own voice and non vocal sounds, but which does not record the words of others. (That, plus maybe location and a video stream, provided one is in a location without "a reasonable expectation of privacy.")

Perhaps it doesn't even have to be hardware at this point! Maybe this could be installed as an app on an older smartphone?

fragmede · on Nov 16, 2022

a throat mic would do the trick

stcredzero · on Nov 17, 2022

Except, if it still faintly picks up the speech of others, then that's a violation of the law, and it opens one up for lawsuits.

roberdam · on Nov 15, 2022

Since everyone is interested in the hardware:

https://www.aliexpress.us/item/3256803349510543.html

https://www.aliexpress.us/item/3256803085687061.html

the particular choice was for the battery and the other for the size, both are generic and come with the same software and bios, several vendors, if I could buy something better I would look for one that can have a lavalier microphone

nelsonenzo · on Nov 15, 2022

I wanted to do this exact project - record audio all day and then have AI process it - to identify behavior outburst of my autistic toddler.

It's critical information for early diagnosis and treatment, but it's really hard to capture the data while also dealing with the actual situation. Being able to send the sounds he makes to his therapist could also be usefull when then are trying to get him to mimic sounds and talk.

With that said, is the audio AI open sourced? The part that analyzes the audio stream?

Thanks for the links to the hardware, also a really important part!

rockemsockem · on Nov 15, 2022

I would guess that they're using OpenAI's Whisper, which is open source: https://github.com/openai/whisper

It does speech-to-text, then you can use the full force of all the text analysis tools that are out there.

matthewbarras · on Nov 16, 2022

I've thought about this a lot.

My 8 y.o is Autistic and when he was little, I was struggling to catch evidence to provide to Speech and Language Therapy. I wanted a way to always record and have an easy way to pull out the key points.

Now I would love to correlate background noise (level and context) with meltdowns. We know babies crying set him off, as that's obvious, but would love to analyse further to spot other trends.

roberdam · on Nov 16, 2022

this one will suit you well, even has a magnetic back so you can attach it to something, https://www.aliexpress.us/item/1005003535825295.html

roberdam · on Nov 15, 2022

that's a fantastic use case!, the easiest way (and the one i'm currently using) is by upload the audio manually on :

https://replicate.com/openai/whisper

narag · on Nov 15, 2022

Thank you for the links and for the article. How long can record the smaller one? Actually if it can record for a day, it'd be enough for me.

I used to record all phone calls, until EU made Xiaomi remove the feature. It was very useful because I always could take notes later if they sent me a number, contact name or appointment hour.

roberdam · on Nov 15, 2022

At 128kbps the MP3 takes about 56mb per hour, I got the 16gb, so you have a lot of time, the battery of the smaller one I read is 800 mAh , according to the docs should last around 2hrs, but I try to recharge it as soon as I can

narag · on Nov 16, 2022

Thank you, hmmm... you wrote 2hrs, I guess it's a typo. In the page it's 20 and more than enough for my use case. And even the 4GB is overkill if you make a daily dump.

roberdam · on Nov 16, 2022

my mistake, according to the description "Working Time: About 7hours on one Charge Can store up to 96 hours of audio", I haven't let the battery run out yet

philote · on Nov 15, 2022

The Ali Express link says "Continuous recording:20hours". But since they offer sizes from 4GB to 32GB it's unclear which storage size that's for. That 20 hours could also be how long the battery will last while recording. But 20 hours is still enough to last the day and then some.

gruez · on Nov 15, 2022

It's probably the battery life. 128 kb/s AAC is effectively transparent even for music, and only translates to 1.1GB. Even if it's uncompressed (1 channel 16 bit 44100hz PCM), 20 hours only translates to 6.35 GB.

cbsks · on Nov 15, 2022

Super cool project! How do you carry the microphones on your person? The big one looks like it wouldn’t clip to a shirt very easily. Does it pick up your voice from your pocket?

roberdam · on Nov 15, 2022

thanks!, I try it on my shirt pocket but now I have it hanging from my neck with a badge rope as close to the mouth as I can

uncletammy · on Nov 15, 2022

Yeah but I want the software! Will you open source it? I'd contribute!

roberdam · on Nov 15, 2022

I'm doing it simple for now, transcribe it by uploading the files to colab or replicate.com, then using regex to extract the commands, the panel is in rails but nothing fancy so far.

As I clarify in the article: This is a “proof of concept” and not yet ready for production, everything described here works but probably “glued with tape”, several of the processes are probably not automated or polished.

Void_ · on Nov 15, 2022

Not as hardcore as OP, but after Whisper came out, I quickly built an app that allows me to record from lock screen: https://whispermemos.com/

runjake · on Nov 15, 2022

This app apparently sends data to their servers. If you don't want to share this information, you can use an app like Lockflow (https://apps.apple.com/us/app/lockflow-lock-screen-shortcuts...) to put an Apple Shortcut on your home screen.

That Apple Shortcut could be the Dictate Text action hooked to create/append to an Apple Note (thereby not leaving your device) or fire off an email or send a message via your favorite bot service (Discord/Telegram/Slack/etc).

Bonus: That Shortcut will also work on your Mac.

There's also the minimal friction app Just Press Record (https://apps.apple.com/us/app/just-press-record/id1033342465), which will transcribe and has a decent Shortcut library.

Void_ · on Nov 15, 2022

Yeah I tried JPR before but missed the workflow of sending it to my email. (Maybe they have it and I didn’t notice)

Also Whisper is better for my Slovakian accent.

toss1 · on Nov 15, 2022

That looks like it's iOS only; I've been using a similar app on Android, Voiceliner, but it doesn't yet also record from the lock screen. That would definitely make it more useful!

jconley · on Nov 15, 2022

This is a cool project. One of my pet ideas that I haven't done is to build a home assistant where all data is stored and processed by a home "server". The biggest benefit I see is that it could truly be omnipresent. There in the background, answering questions, jumping into your conversations without prompt. And it's much less creepy if all that data isn't going to someone else's computer.

Also piping in and processing the data from my mobile would be cool, but I wouldn't want to invade other people's privacy if I'm in public.

WR-Iso · on Nov 16, 2022

SAID this before physical VPN and open source cloud is exactly what Im trying to make a reality VM to the TVs not to mention we all need that Hillary Clinton privilege AS WELL as when you pull in at home you car uploads. updates, charges (manuel plug for now) and tie that in to the obd reader software that alone could invaluable also having two devices is the best solution one staying the car gives me a internet connection to my car for remote access does my maps or music the other does my Hud

jeffbee · on Nov 15, 2022

It does not sound like a realistic capacity plan. The reason this works in the cloud is the inference can be run in parallel on a huge amount of hardware for a short time. To run those kind of models on your rinkydink computer would take forever.

fragmede · on Nov 16, 2022

An Nvidia 3090 GPU can run open ai's whisper at 17x realtime[0]. they're not exactly cheap (~$500?), but they're cheap enough that running the transcription end at home is quite feasible. And, it includes translation, so you don't have to do it in English.

Searching all of a downloaded copy of Wikipedia wouldn't be that computationally expensive either if the assistant has hot words it picks up to look up.

[0] https://news.ycombinator.com/item?id=32928207

fennecfoxy · on Nov 17, 2022

It's also possible to use local AI processing chips like Coral or Gyrfalcon for this.

Could just load up a pcie card full of them if necessary. A local home AI would be such a boon to people, not just the average person but the elderly as well, combined with a refined GPT etc it could conversationally respond to requests rather than most assistants' current request->response "I am a robot" scheme.

>Your son called when you were asleep to ask if you wanted to get coffee today, shall I call him back for you or put you through to him? >X, you've fallen! Please let me know you're okay or I will call emergency services for you

It's sad that we have the technology to do this already but haven't.

Firmwarrior · on Nov 15, 2022

Respectfully, I don't think that's true. "The Cloud" is just computers in a warehouse somewhere

$5/month's worth of "cloud" is going to work out to be less actual raw CPU resources than a low end raspberry pi running full time in-house

exitheone · on Nov 15, 2022

I don't actually think they're true.

One second of Google cloud TPU has roughly the same number of floating point operations then 4 hours of raspberry pi 4B time.

So 3 minutes of cloud TPU time already covers your whole month of raspberry pi usage. Pretty sure it costs them less than 5$ as well, since they have the hardware anyways.

jeffbee · on Nov 15, 2022

"The cloud" is also massively parallel software. If I run a Google search, many thousands of CPUs will be brought to bear on my query, and a gazillion DIMMs, and all the throughput of a hell of a lot of SSDs, and so on. If you just happened to have a copy of the web, and an index of it, on "a computer" no matter how big, it would be impossible to get prompt answers.

If Google (or whomever) needs to run voice models, they take your query and all the other queries that arrive in the same millisecond, smoosh them all together and shove the batch into a TPU and run it. You don't have any TPUs and you also don't have any traffic you can use to amortize the cost of your infrequent queries.

The idea that you could run these kinds of ML inference tasks is economically fanciful. You would need a huge investment in hardware and the opex would be ridiculous.

vineyardmike · on Nov 17, 2022

> The idea that you could run these kinds of ML inference tasks is economically fanciful. You would need a huge investment in hardware and the opex would be ridiculous.

Google, Apple, Amazon and even Sonos are all releasing voice assistants that work locally on their relatively low powered speakers.

Apple seems to be ahead with what is local, while Google seems to be the smartest. (Sonos doesn’t have a cloud, but it’s not ‘general purpose’ afaik).

Sure you can’t amortize them across a bunch of TPUs BUT instead they can ship custom hardware. A tpu needs to be big and support parallel streams. A home server may only need to ever serve one stream. There are arduino style devices that can perform basic tensor flow audio models in real time now. And obviously most phones can perform this locally now, so depending on opinion that may be considered affordable.

arcturus17 · on Nov 15, 2022

I don’t think a $5 instance is enough for ML/AI workloads. You need something with a GPU.

tsejerome97 · on Nov 15, 2022

this is also one of my pet ideas, but I keep procrastinating. Have your idea transformed into any kind of repos that we can contribute to?

danuker · on Nov 16, 2022

> One of my pet ideas that *I haven't done*

I suspect OP was clear enough.

But there exists https://mycroft.ai/

https://github.com/MycroftAI

roberdam · on Nov 15, 2022

MORE INFO ON THE DEVICES:

https://www.aliexpress.us/item/3256803349510543.html

https://www.aliexpress.us/item/3256803085687061.html

both recorders are using the same generic bios, you have a .txt file called FACTORY.TXT, by changing the values of the file you configure the device, this is the content of the file.

---------------

TYP:1 (0:WAV 1:MP3)

VOR:0 (0:voice-activated off 1-7:voice-activated sensitivity,higher means record less)

BIT RATE:2 (0:32Kbit 1:64Kbit 2:128Kbit 3:192Kbit 4:Translate ON 5:512Kbit 6:768Kbit 7:1024Kbit 8:1536Kbit 9:3072Kbit)

GAIN:5 (0-7 record sensitivity 8 grades)

SECTION:(30) (1-999 record time exceed this,file will auto save,uint minutes)

DATE:2022-10-15 (year-month-day)

TIME:08:36:24 (hour:minute:second)

TIMER:1 (timer record 1:on 0:off)

START:08:39:32 (timer record start time)

TIMELONG:(120) (1-720,timer record length,uint is minute)

CYCLE:(030) (1-999,how many dyas,0:everyday)

--------------------------

I got the 32gb version of the bigger one and the 16gb version of the smaller one.

I configure the device to save a file each 30m, each 30m mp3 file takes 28.125kb, so around 56mb per hour at 128kbps

codeisawesome · on Nov 15, 2022

Thanks for the post! I find that other voice assistants (eg. Siri) are not particularly able to detect the activation command when there's any background sound (like music with lyrics). How does your system perform against this?

I understand that you're doing batch processing asynchronously and so any immediate task isn't affected; but it's arguably even more of a problem where you record a task, put it out of your mind, but then the AI fails to detect the command because it got confused by the background?

[EDIT] I see you've sort of responded to this already at this comment: https://news.ycombinator.com/item?id=33612155

zestyping · on Nov 16, 2022

How do you get the files off the device? Do you have to manually take out the SD card, put it into your computer, and copy the files over, every single day? I'd never be able to keep up a habit like that consistently, so I'm wondering if you found a more convenient way to transfer the data.

roberdam · on Nov 16, 2022

both recorders work as USB drives, once a day upload all your files, but is just drag & drop

justinlloyd · on Nov 15, 2022

Interesting work, glad to see I am not the only crazy one left in the life logging scene after all these years. Have been lifelogging since 2004-ish, and built a few custom bits of software and hardware to support it. I don't record 24x7 anymore, but I used to. Now my recordings are limited mostly to my office environment, and when I am out and about using a Sensecam-like device with custom firmware. When in my office I capture video, audio and depth data from multiple view points, along with images of the desktop of whatever computer I am on, and process most of it on a Jetson.

How's the audio quality on those devices you link to in other comments? I find I pick up a lot of ambient noise when outside of the office, and always struggled to come up with a viable algorithm and model to differentiate "background chatter" from the main conversation, and it is a problem I've never really managed to solve so I am interested in your experiences on the subject.

roberdam · on Nov 15, 2022

> Have been lifelogging since 2004-ish

Hopefully new advances in AI will let you try new things with your old recordings

> How's the audio quality on those devices you link to in other comments?

Decent, quality is directly proportional to the distance between the microphone and the mouth, but can't expect too much from 30$ devices.

>and always struggled to come up with a viable algorithm and model to differentiate "background chatter" from the main conversation

Yes, that's a big problem to solve, you can try Pyannote's Diarization https://lablab.ai/t/whisper-transcription-and-speaker-identi...

that will be a next step for the experience

L0in · on Nov 15, 2022

Do you mind sharing your experience, why you started, what you want to get out of this etc? I'm interested to read your experience.

askafriend · on Nov 15, 2022

Have you seen the show "My Strange Addiction"?

L0in · on Nov 15, 2022

This one https://www.wikiwand.com/en/My_Strange_Addiction?

No i haven't.

askafriend · on Nov 16, 2022

Yep that one.

Transisto · on Nov 25, 2022

Any specific episode is relevant?

justinlloyd · on Nov 15, 2022

Not interested.

bane · on Nov 16, 2022

This is really interesting and many of the comments here go into the utility of this, however, verify that you aren't recording somebody else without their consent, in many places it is illegal to record to conversations without the other party's consent.

I only ran across this problem years ago when, due to a serious potential workplace issue I suggested somebody basically "wear a wire" and record their workday to catch some HR problems. We found out that the state this was occurring in had a two-party consent law and violating it was not a great idea.

krageon · on Nov 16, 2022

This "be careful of recording" line is harmful, because folks tend to read it and assume that it is universal - there are places where it is unilaterally allowed if you have good reason to suspect not having things recorded will result in your rights being abused. Then there are places that just don't care if you do.

prmoustache · on Nov 17, 2022

I don't know of any juridiction where it is illegal to record someone else.

What is usually illegal is broadcasting or making that record available to someone else.

bane · on Nov 22, 2022

There are many examples, for example in the state of Pennsylvania, all parties must consent. While it is a felony to record and then playback without consent (you are correct in this case), it is a punishable misdemeanor to violate privacy (broadcasting is not required) with a fine of up to $5000 in the first violation and 2 years of prison and/or up to a $5000 fine after the first violation.

hoosieree · on Nov 16, 2022

Does this mean it's illegal to have an Alexa in your cubicle?

ranguna · on Nov 17, 2022

No because the terms of services and privacy policy says that you accept being recorded if you use alexa.

voakbasda · on Nov 16, 2022

I think it does, but I believe that we are still waiting for someone to test that theory in court.

AndrewKemendo · on Nov 15, 2022

Expanding on the structure the OP created, this is how I see us getting to human level AI:

1. Record video sound etc... (trajectories) egocentrically

2. Analyze the data and assign reward labels (more/good, less/bad) to state and transitions actions

3. Use the reward feedback and trajectories to build the policy for some set of actions in certain environments

This is why I'm bullish on anything sousveillance - so AR cameras on your head, always on mics etc...

The challenge is doing this democratically, without it being intermediated by a giant for-profit mega corp that doesn't care about you and wants to mess with your head

pa7ch · on Nov 15, 2022

Honest question,how does this make the lives of humans better?

AndrewKemendo · on Nov 15, 2022

Well, for example. Lets say that I have a goal BMI I want to maintain

If I reach for the Oreos, I can choose to have a flag set with a heuritic I created myself that will tell me:

"Having 5 oreos means you need to reduce other calorie intake by n calories to maintain your BMI"

That data can also be aggregated to give me my macro/micros for everything I've eaten etc... without me having to log it like I do now

Think about it as the ultimate personal assistant and all you need to do to instrument it is attaching a camera and mic to your face. You can decide what your goals are, and this kind of instrumentation will capture the data that you need without you having to actually annotate everything.

Your personal life API

unbalancedevh · on Nov 15, 2022

> all you need to do to instrument it is attaching a camera and mic to your face

It's funny that this is a reasonable thing to say.

crtified · on Nov 15, 2022

Likewise, if we go back to 1995, and tell a tech-fearing farmer that within 20 years he and all his salt-of-the-earth colleagues would soon be voluntarily (and gladly!) carrying in their back pocket (in the form of a cell phone) a small cheap generic device connected 24/7 to global corporate networks, with built-in high def cameras, microphones, location detectors, and data gatherers, and would casually store much of their personal and financial information within them.

They would find that notion preposterous. But now, some short years later, they would give it barely a thought.

AndrewKemendo · on Nov 15, 2022

It really is.

I've been in CV since 2009 and it's face melting how many things we thought were impossible are effectively "solved."

bick_nyers · on Nov 15, 2022

I've been thinking about doing this for a while now, cameras all over the house hooked up to ML algorithms that help you audit and tweak your behavior towards some specified goal.

When I used to play video games ultra competitively, I would analyze recordings of my gameplay to try and get better, and it worked wonders.

ar_lan · on Nov 15, 2022

I've honestly thought about recording my work sessions to do the same thing. RescueTime works _somewhat_ to track when I get distracted on something, but moreso I'm interested in identifying when I do something suboptimally and playing it back to identify why I went with that path and try to course-correct the next time around.

bick_nyers · on Nov 15, 2022

Recording the desktop 24/7 and making those videos searchable could be an incredible tool as well. Text on screen as well as audio from meetings. If you didn't document how you initially configured something, you could just go back and watch what you did.

Edit: Another comment has informed me of rewind.ai, which does this on Mac, interesting!

redstonefreedom · on Nov 16, 2022

I think the oreo thing is probably doable now with things like habitaware. It’s most likely a very easily distinguishable motor pattern. Not sure they could be programmed to give you an oreo-specific-reminder, but that’s a design gap more than it is a technological one.

azeirah · on Nov 15, 2022

I'd just get really annoyed at that AI

...being annoyed increases stress, which increases appetite

AndrewKemendo · on Nov 16, 2022

Ok, then have it do something else?

The point here is, you could have it do anything you choose

...or just don't all together in which case, why comment?

_gfwu · on Nov 15, 2022

Sounds like a nightmare!

AndrewKemendo · on Nov 16, 2022

Perhaps the BMI/Food tracking example isn't one that resonates with you

Can you explain a bit more about what part of having a non-intermediated "personal API" (or whatever you'd call it) is nightmarish?

thingification · on Nov 15, 2022

Another challenge is industry-wide bad security fundamentals.

Godspeed to work/people like agoric.com and seL4.

cameronh90 · on Nov 15, 2022

All of the rooms/corridors in my house except my bathrooms are covered by cameras. My initial motivation for installing them was to keep an eye on what my pets were doing when I'm not around, but I find in recent years that if I misplace something, I end up tracing back my history on the cameras and finding where I left it.

It seems obvious that at some point, AI will be able to do that for me and I'll just be able to say "Alexa, where did I leave my glasses?", "Hey Google, where did I put my box of spare fuses?".

ocimbote · on Nov 15, 2022

I would 100% prefer to lose my keys rather than letting Amazon or Google in.

FYI: I have zero Alexa/Siri enabled device, zero automated home device, a degoogled phone, etc etc. So we might have different perspectives on the matter.

cameronh90 · on Nov 15, 2022

Each to their own. Personally the value these cloud/AI assistants give me is worth the loss of privacy. There's nothing I do that I think anyone would be especially interested in spying on, other than to try and sell me things.

Note that I don't think anyone should be forced into this sort of surveilance. It should always be a choice. I also support the open source projects to bring it back to individual control - it's just too much hassle for me, personally.

cwkoss · on Nov 15, 2022

There is no reason this technology needs to rely on consumers sacrificing privacy. The big players are trying to create that perception in the public so consumers will willingly sacrifice their privacy regardless.

The tech is there so someone could make a box with no external data transferred that could store and analyze video data. I would be a customer for sure for something that had this capability without the privacy concerns.

Google and Amazon say they want this data for quality control, but I suspect each of them have plans (if not active projects) for converting video inside people's homes into actionable marketing data.

vineyardmike · on Nov 17, 2022

> but I suspect each of them have plans (if not active projects) for converting video inside people's homes into actionable marketing data.

I suspect not. Besides the fact that it’s a whole new level of creepy and that alone is a PR mess, I doubt it’s that useful. Sure a camera in your home sounds perfectly useful for marketing but whose camera is positioned like that. Mine is aimed at the entryway door. The best you can get from that is presence. I suspect that’s true for most peoples home.

Beyond the question of actually data quality, data processing would have to be very expensive. You couldn’t run those models locally (because the object detection would be too complex and changing) so you’d need to stream to cloud. That would instantly be the largest and most expensive streaming platform ever, dwarfing YouTube or Netflix or anything. Not to mention the actual ML components of it.

I suspect smarthome companies don’t want the data and begrudgingly accept that some cloud is needed because people are notoriously bad at protecting backups (and remote monitoring is a convenient feature).

I question if the incremental increase in marketing revenue would exceed the technical costs.

gaucheries · on Nov 15, 2022

> the value these cloud/AI assistants give me is worth the loss of privacy

they've got you right where they want you.

csallen · on Nov 15, 2022

He seems to have them right where we he wants them, too. Mutual transaction. Everybody's happy.

ocimbote · on Nov 15, 2022

The benefits of the consumer measures in comfort or social status, mostly.

The benefits of the producer measures in dollars.

However you balance it, the producer wins. By many orders of magnitude.

And since were talking about privacy and personal data, the more consumers there are, the more the producers improve their margin on each and all consumers.

csallen · on Nov 17, 2022

Why is it a competition? If you give me a slice of cake, and I give you $5, and we're both happy with that, why should I care if you're somehow "winning" or I'm "losing"? That mindset seems like a self-fulling prophecy that robs me of my satisfaction.

Also, I would argue that receiving money does not mean the producer wins, since ultimately the producer is also a consumer, and who will thus be spending those dollars on the same things as every other consumer… comfort and status, as you put it.

phendrenad2 · on Nov 16, 2022

Vague snipes like this are generally not allowed on HN, FYI. (Source: I've done it myself too many times)

gaucheries · on Nov 17, 2022

you're falsely characterizing my observation as a vague snipe.

phendrenad2 · on Nov 17, 2022

Just trying to help.

wongarsu · on Nov 15, 2022

Just because the data isn't interesting to anyone right now doesn't mean that a future oppressive government won't use it against you

nano9 · on Nov 15, 2022

> There's nothing I do that I think anyone would be especially interested in spying on, other than to try and sell me things.

Do Uyghurs have something to hide and are worth spying on? How many times are we going to hear this argument? It comes only from a position of privilege. You're only uninteresting to be spied on as long as it's allowed by the security apparatus you depend upon. There's a reason we have sayings like "power corrupts"; dismissing the potential for abuse of a cloud-based unencrypted surveillance system is narrow-mindedness at best and subversion at worst.

Note: the above hardly represents me politically, it is just a counterargument against the perennially repeated "I have nothing to hide."

cameronh90 · on Nov 16, 2022

I'm aware of all those arguments and I completely agree with them in principle, but I genuinely would be SO far down the oppression list.

It's definitely a privilege to be the majority ethnicity and sexuality in a modern western liberal democracy, but it is what it is. The chances of the British government suddenly turning against white straight apolitical irreligious men are just so low it's not something I worry about.

What I worry about more are things like people breaking into my house, my dog chewing up the carpet and forgetting where I left my glasses.

I do hope that we can figure out a way to package all the privacy violating cloud-based services in a way that's simple to use, encrypted, local only, etc. though so perhaps more subversive people can enjoy these systems without worrying about oppression.

To be quite honest, the most privacy sensitive things in my life are probably my emails and documents, but those are all already in Google Drive and Gmail anyway, along with basically everyone else's. All anyone will get from my cameras is a stream of me feeding my rabbits, browsing tiktok and scratching my arse. GCHQ are welcome to tune in any day, provided they also help me pick out my clothes in the morning.

gmadsen · on Nov 15, 2022

i mean you could always run a home server for the automated home things. heating/ac and lights are nice things to automate

forgetfreeman · on Nov 15, 2022

Because spending tens of thousands of dollars in home infrastructure to avoid fiddling with the thermostat four times a year definitely makes sense.

progman32 · on Nov 15, 2022

My heat is controlled and automated with open source software for the grand total of about fifty bucks and a free surplus server.

forgetfreeman · on Nov 16, 2022

So DIY solutions exist that raise the cost to an education in computer programming and $50 in hardware...to avoid touching a thermostat four times a year. That's neat.

ocimbote · on Nov 15, 2022

I don’t know how that can be true. Can you tell us more?

horsawlarway · on Nov 15, 2022

It's incredibly easy to do (caveat - at least if you're familiar with software dev already).

Most thermostats are literally just digital thermometers that control a relay that turns the furnace/ac on and off.

A simple arduino (or much cheaper IC) can easily do the same thing if you wire it in.

And then on the software side... there's several large, open-source projects that exist in this space and provide nice api tooling for interacting with those devices. Things like:

OpenHab: https://www.openhab.org/

HomeAssistant: https://www.home-assistant.io/

HomeBridge: https://homebridge.io/

etc...

Even Alexa has basically drop-in self hosted alternatives like Mycroft: https://mycroft.ai/ or ADA/Almomd (now Genie) https://genie.stanford.edu/

It's not only true - I strongly suspect you can do it for much less than 50 bucks if you don't need the physical thermostat to have buttons/screens.

ocimbote · on Nov 15, 2022

Makes sense. My setup doesn't allow for that. Hence my ignorance. Good for you!

seba_dos1 · on Nov 15, 2022

I'm considering making an OpenTherm controller for my heating boiler, I just researched this topic a few days ago - it's absolutely true, there are ready-made Arduino libraries for that.

omvtam · on Nov 15, 2022

I inherited a 1980's model AC/Furnace and controlling the AC at least is extremely simple and cheap. A 12V relay in the compressor housing activating the 220V switch, connected to another relay controlled by a Pi zero which is controlled by yet another PI zero with a $10 DHT 22. A bash script check the temp and activates the compressor via SSh when the temp goes above 74F. The furnace control hasn't died yet so I haven't bothered replacing it. Putting the cooling system on IoT total cost = ~ $100

karaterobot · on Nov 15, 2022

What if you charged someone to build and install the same system in their house? You'd probably charge a lot more than $100, and that's what the real cost would be for most people.

bee_rider · on Nov 15, 2022

Nobody has suggested professional installation though, the original suggestion was just a nice home automatic project to play with.

ocimbote · on Nov 15, 2022

controlled heating for my flat with open source and Zigbee compatible devices would cost me ~1k. I did not calculate the ROI but break even looks like it’d take many years.

daveidol · on Nov 15, 2022

Curious: are you concerned about data leaks or you don't trust the employees to not access your user data? Or something else?

jacksnipe · on Nov 15, 2022

Not who you asked but: I’m afraid of the data being stored and available to anybody. As long as it’s out there, the government can compel others to give it to them; and companies can get acquired and structures and laws can change in such a way that the data gets in others hands perfectly legally.

Thus, I should only be okay with it if I’m okay with the “nothing to hide” argument, which I’m not.

ilyt · on Nov 15, 2022

Yes, yes, and I don't like anything home automation to be dependant on anything cloud. Enhancing function is fine but house that stops working right the moment internet link is down is a dystopia.

leobabauta · on Nov 15, 2022

"Dystopia" seems like a stronger word than applies here.

ocimbote · on Nov 15, 2022

Both and more. Having my data sold to 3rd parties is an obvious first. And if you think the terms of service are enough to cover you, see how fast they can change in everyday life and please reconsider. Plus, data can be sold pseudo-anonymously and build up a profile against which your identity is compared and metered, as in, for example, health insurance risks or crime potential.

Additionally, we, the consumers, have lost the right to own things. Or at least, if we do own things, it comes with all sorts of strings attached in the form of "features" or "connectivity". Which is just marketing lingo to say that you're feeding the cash cow.

ClumsyPilot · on Nov 15, 2022

Nah, is it much more efficient to distract you into loosing your box of fuses and manipulate you into buying new ones.

Alexa AI doesn't work for you, it's a hired gun in your house.

cameronh90 · on Nov 15, 2022

Just like 2001: A Space Odyssey - but instead of Hal trying to kill me, it just tries to get me to buy things I don't need.

"I'm afraid I can't do that, Dave, not until you watch this advert"

6stringmerc · on Nov 15, 2022

Wish they’d at least release a Christopher Walken package.

“I can…NOT find your ANswer.”

bhawks · on Nov 15, 2022

I agree that it is an obvious extension for AI to use this data at scale to help users. It also is obviously a huge temptation to abuse it for other purposes.

Wasting a few minutes in the morning to find my glasses is a small price to pay to not be watched and analyzed all the time. Let's not build our own panopticons.

ysavir · on Nov 15, 2022

When you do, can you invite me over to show me how it works? Then I'll test it for "Alexa, in which mattress does cameronh90 keep their savings?".

oh_sigh · on Nov 15, 2022

"Voice not recognized. Releasing the robotic attack dogs"

cameronh90 · on Nov 15, 2022

Like pretty much everyone in my country, I already entrust a bunch of private corporations to safeguard my wealth. Worse, there's nothing to stop my bank from suddenly deciding tomorrow that I don't have any money, and I don't really have any paper records to prove otherwise...

I figure any AI advanced enough to monitor everything I'm doing and where all my stuff is, is probably smart enough to know if it's me asking.

relaxing · on Nov 15, 2022

Just having a front door cam paid off immensely this year when I was able to prove that I had left the house with an item (that I later misplaced, and was able to recover with that knowledge.)

habibur · on Nov 15, 2022

Excellent idea. You can later search through your logs in the future for reference. As it's all in text.

Prior solutions posted on the net, had this take photo / record audio 24/7 features, but then those were stuck there. What next? What would anyone do with these data?

But this Hi Jarvis styled recording of text on the go is a very useful feature.

Another step ahead.

lijogdfljk · on Nov 15, 2022

I've wanted to do the same thing with my online activity as well. Chat logs especially. They tend to go into a void and finding an older log is weirdly difficult. I've wanted to log everything and then be able to apply better search algos (semantic search perhaps) to try and make my chat logs useful.

giobox · on Nov 15, 2022

Cellphones are placed amazingly well to provide this sort of search. Seeing the post about BeOS and its amazing metadata-driven BFS filesystem yesterday really makes you think what might have been had iOS and Android been more ambitious about filesystems instead of just re-applying the same old conventions from our desktop computers.

You should be able to just text search every phone call you have made on iOS/Android, today, similar to the automated voicemail transcription features already present etc etc.

roberdam · on Nov 15, 2022

I think the "total recall" search can be a killer feature

unsupp0rted · on Nov 15, 2022

I remember an Asimov short story in which scientists developed a machine that could see backward in time.

If I recall correctly, the upshot was the government became terrified because any machine that can see 1000 years into the past can also see 1000 milliseconds into the past and therefore functionally be used to spy on anyone in real time.

lbayes · on Nov 15, 2022

There was an article some years ago (2 or 3?), that described a drone (or drones?) that flew 24/7 over Mexico city taking high resolution video of the entire city at all times.

Whenever there was a crime, the police could zoom into that location at the time of the crime and then run backwards to see where the vehicles came from. They then knocked on that door.

I'm disappointed that I can't seem to find it using Google anymore, maybe it was from a movie or TV show?! That would be weird though, because it seems technically quite reasonable to achieve and hard to believe governments wouldn't jump on it.

garblegarble · on Nov 15, 2022

The term for this is 'WAMI' - Wide Area Motion Imagery[1]. Here's a Bloomberg article about an instance of it in Baltimore[2] (although this wasn't where I learned about it first, like you I can't find my original source either)

1: https://en.wikipedia.org/wiki/Wide-area_motion_imagery

2: https://www.bloomberg.com/features/2016-baltimore-secret-sur...

nl · on Nov 15, 2022

I remember the same article, although I can't find it now.

These two seem to reference the same demo, although neither are the article I remember: https://www.bloomberg.com/news/articles/2016-08-23/watch-thi...

There's this reference to it: https://www.pressreader.com/usa/the-washington-post/20140206...

https://www.bloomberg.com/features/2016-baltimore-secret-sur... has a lot more details

netsharc · on Nov 15, 2022

I've read about that happening in Cleveland, using tech developed to find insurgents leaving IEDs in Afghanistan. Yeah, citation needed...

lbayes · on Nov 15, 2022

So glad someone else saw this, I'm not finding anything on it and I'm starting to question my own memory, as I'm quite sure I saw the original article about the Mexico program on this site.

FWIW, I also recall the tech being originally used to find people who planted IEDs in Afghanistan.

I'm kind of shocked about how all the articles I am finding seem to emphasize real-time police chases.

Now I'm feeling super suspicious.

dorkwood · on Nov 15, 2022

I first heard about this on Radiolab. Maybe you heard it there too?

>> In 2004, when casualties in Iraq were rising due to roadside bombs, Ross McNutt and his team came up with an idea. With a small plane and a 44 mega-pixel camera, they figured out how to watch an entire city all at once, all day long. Whenever a bomb detonated, they could zoom onto that spot and then, because this eye in the sky had been there all along, they could scroll back in time and see - literally see - who planted it.

https://radiolab.org/episodes/eye-sky

lbayes · on Nov 16, 2022

I think you found it. Now I recall that episode exactly.

Apparently, my mind created some very visual memories from the narrative.

Thanks!

netsharc · on Nov 15, 2022

Well, a bit more googling ( https://www.google.com/search?q=police+drone+afghanistan+rew... ) got me just 2 relevant hits.

https://www.theatlantic.com/national/archive/2014/04/sheriff...

https://scholarship.law.uc.edu/cgi/viewcontent.cgi?article=1... (search for "rewind")

I'd rather think it's because Google sucks now, and those keywords just bring up too many similar articles, but my metaphorical tinfoil hat is my hands.

lbayes · on Nov 15, 2022

Nice job!

Your tips got me to this one, where it more clearly spells out the "rewind" capability. I think the problem was that the tech was attached to a low-flying, piloted plane, not drones.

https://www.csoonline.com/article/2226742/record-and-rewind-...

Whew! It feels better to set my tinfoil hat down on the table next to me...

filoeleven · on Nov 15, 2022

There was a website shown on HN a few years ago that used AI and plane transponder data to find circling planes which were presumably doing this kind of surveillance over American cities. It might have used further parameters to narrow it down, e.g. “over a city, circling for >3 hours” to rule out planes waiting to land. I thought it was named something simple like “plane-circles.com” but I’m not having any luck finding it again.

See also https://en.m.wikipedia.org/wiki/ARGUS-IS

Edit: found it. Should have limited the search to HN from the start. https://news.ycombinator.com/item?id=24188661

_tom_ · on Nov 15, 2022

There have been a few products that record everything you see on the web, so you don't have this problem. Obviously analogous to recording everything you hear.

https://www.searchenginejournal.com/all-about-seruku-search-...

alspaca · on Nov 16, 2022

https://www.pss-1.com/media-contacts

iwillbenice · on Nov 15, 2022

Not sure about the Mexico City drone, but a similar thing was developed by the US military: https://en.wikipedia.org/wiki/Gorgon_Stare

I know some folks who deployed during OEF/OIF and used these types of systems. Many a night raids were conducted simply by watching where attackers originated from.

SimonPStevens · on Nov 15, 2022

Different author, but sounds somewhat similar to 'The Light of Other Days' by Arthur C Clarke and Stephen Baxter.

Although iirc correctly it starts with being able to see other locations in space but at the same time, and the historical viewing is a second development.

Fantastic book, even if it's not the same one you were thinking of.

nonrandomstring · on Nov 15, 2022

Pretty sure both those authors wrote similar concepts, with the same creepy conclusions of taking the technology to a limit.

It came up in an acoustics class once. I said that sound never really dies. It just bounces around until it becomes thermal energy, thus warming the room a little as a prelude to joking about professors talking hot air.

A student asked whether, one could recover sound from reverberations that had fallen below RT60? Could you listen back in time to conversations that had happened hours ago?

Obviously entropy can't be put back in the box with the technology we have now, but it makes you wonder.

Two things have since made me revise the question. One is recovery of sound from video images. The other was an archaeological recovery of sounds from a ceramic vase spun on potters wheel many centuries ago. Sorry but the references for both escape me atm.

progman32 · on Nov 15, 2022

The pottery record thing was tested on mythbusters and hailed from an episode of csi.

nonrandomstring · on Nov 15, 2022

Fake? Got a link so I can dig in a bit. Thanks.

EDIT: found this thread

https://groups.google.com/g/sci.archaeology.moderated/c/5Jec...

Damnit, seemed so plausible.

abruzzi · on Nov 15, 2022

Clark also used it as a throwaway line in Childhood's End. IIRC, humans were given a device that would allow them to see the past--most religions didn't survive seeing the true origins of their faith.

andrewla · on Nov 15, 2022

It was The Dead Past [1]

The idea of it was that it was known that the technology existed, but the government went to great lengths to imply that it could only see into the far distant past. The reality was it could only see 20 years back or so, and the government was covering it up because of the 1000 milllisecond issue.

[1] https://en.wikipedia.org/wiki/The_Dead_Past

unsupp0rted · on Nov 15, 2022

Yes, that was it! Nice find :)

creativeembassy · on Nov 15, 2022

I wonder if this was an inspiration to the "Devs" miniseries. Won't say more about it for fear of ruining it. Amazing show.

warrenm · on Nov 15, 2022

Sounds very similar to the guy talked about in Albert-László Barabási's book (either Bursts, or Linked ... don't recall which atm) - he was photoing/videoing his whole life, but never of himself - ie, the camera was always facing outward (like a policeman's bodycam)

sixstringtheory · on Nov 15, 2022

The entire topic and many posts in this comment page also sound like things straight out of The Circle and The Every by Dave Eggers.

warrenm · on Nov 16, 2022

I had forgotten about The Circle :)

tegiddrone · on Nov 15, 2022

I did an experiment where I lived for awhile with a sony recorder/mic on me 24/7. It was nice to be able to refer back to conversations and events when I wanted them. Biggest issue was sorting through the data-- timestamps and recorder bookmarks were OK but I really needed full text search on the audio. It would have been great to tag via `Robert, mark timestamp, end Robert`. AI seems to be required, especially when dealing with wind noise and other issues (like the mic twisting around and all of a sudden one channel is my heartbeat.)

The sony voice recorder out there easily last 24 hrs on 1 AAA battery.. dumping to mp3 on a large sd card.

troydavis · on Nov 15, 2022

I did a similar experiment in about 2005 using a small iRiver iFP [1] and reached the same conclusion.

It needed a physical "Something interesting just happened" button that could be annotated later. At the time, creating custom hardware as well as the entire software/service stack was more than I was willing to bite off.

The iFP is tiny, roughly a 4" long by 1.5-2" cylinder. It easily covered a full day, the silence detection worked great, and quality was fine when used in a pocket or on a belt. Basically, the stuff that I expected to be difficult was already solved.

[1]: https://en.wikipedia.org/wiki/IRiver_iFP_series, https://www.cnet.com/reviews/iriver-ifp-790-digital-player-r...

specialist · on Nov 15, 2022

Excellent. Just terrific.

My future perfect system also logs my location and what I'm doing. And probably health metrics too, like heart and breathing rate.

Instead of initiating my exercises, I just want to say "Robert, start jog". The "modal" nature of my Apple Watch's Activities really frustrates me.

I don't want to take notes while I'm listening to a podcast. I'm generally doing something else at the time. I just want to say "Robert, bookmark". And magically a link will be made to whatever I'm listening to at the time. (Audio book, radio, stream, podcast, whatever.)

Ditto identifying songs (Shazam!).

I don't want to fart around with exchanging contact information. My hands are usually full or whatever. Just say "Robert, contact info" and then repeat out loud whatever I hear.

I also want to rewind after the fact. When trying to recall a tidbit, I'll remember the song, where I was (eg while walking the dog), who I was with, what I was eating. So if I want to remember which podcast I was listening to while at the park, I'd just start with my location log and jump over to my podcast listening log.

What could be more simple?

FWIW, I'm still waiting for my "bicycle for the mind".

PS- I've tried, half-heartedly, to use the voice recorder app, and notes with voice transcription. But then it quickly becomes a treasure hunt. And my attempts to do this stuff with Siri just leaves me more frustrated.

Thanks for listening.

Great project. Please keep us posted on updates.

roberdam · on Nov 15, 2022

thanks!, you should try to transcribe your recordings now for free with whisper and see what you can make of them: https://replicate.com/openai/whisper

dsalzman · on Nov 15, 2022

I've been experimenting with this recently as well, but with an app on my apple watch. Looking for a method/model to split different speakers into different tracks to only look at audio from myself and certain people.

dsalzman · on Nov 15, 2022

Someone is experimenting with diarization (speaker identification) + Whisper here https://github.com/openai/whisper/discussions/264

fragmede · on Nov 15, 2022

If you know how many speakers there are, https://twitter.com/dwarkesh_sp has it working here:

https://colab.research.google.com/drive/1V-Bt5Hm2kjaDb4P1RyM...

waprin · on Nov 15, 2022

Ahh I’m working on exact same project. I applied to YC with the idea and was told that “nobody wants this” during the interview.

There’s a ton of problems in the space around privacy and UX. But I’m incredibly excited about projects in this space because in modern society we’re basically surrounded by a million unhealthy things designed to tempt us. Logging forces you to “stay honest”. I’ve been shocked already by how many unhealthy habits I underestimated and how many healthy habits I overestimated.

My #1 priority is just to improve my own physical and mental health. Whether there’s a market for this stuff, who knows.

Good luck!

dsalzman · on Nov 15, 2022

My original inspiration is to better understand how I talk to others and study my own behavior

waprin · on Nov 15, 2022

A noble goal. One of my bad habits I've been tracking and trying to reduce is rude behavior to people, online or in-person.

TOMDM · on Nov 15, 2022

Check out this model, I've had limited success with it. Best I've done so far is to just add the labels it gives to the overlapping segments whisper spits out, which means some sentences have multiple speakers, but that's mostly the case because of cross-talk. I'd say it gets it right ~80% of the time with the 5 speakers I've done it on across ~16 hours of audio.

https://huggingface.co/pyannote/speaker-diarization

dsalzman · on Nov 15, 2022

I will!

roberdam · on Nov 15, 2022

Speaker identification is the next step, you might want to read about Pyannote's Diarization:

https://lablab.ai/t/whisper-transcription-and-speaker-identi...

jordanlwalker · on Nov 15, 2022

we're experimenting building out a version of this too, but on desktop with www.usebacktrack.com - should have splitting speakers/inputs early next year and seeing what that's like

hipjiveguy · on Nov 18, 2022

what app are you using on the apple watch?

miguelrochefort · on Nov 15, 2022

Here's a 24/7 background audio recorder app I made for Android. The impact on battery and storage is surprisingly reasonable.

https://github.com/miguelrochefort/eardrum

gajus · on Nov 15, 2022

I like this. It vibes with a language learning app concept idea I recently shared out loud.

https://twitter.com/kuizinas/status/1591867392220594183

dotancohen · on Nov 15, 2022

I've been doing this with Anki.

When I have a conversation with someone in a language that I'm learning (was Russian and Greek, now Arabic) I record the conversation. I then get both native-speaker audio to add to Anki for the things they said, plus I get a list of words that I either needed to use or that the other person used, to add to Anki.

A secondary benefit is that this system encourages me to go out and seek interactions with people, a clear benefit for a natural introvert.

apienx · on Nov 15, 2022

Well done!

Got a similar PoC that uses Tasker to record sound on my phone, Whisper to convert it to text, and neatly organizes everything into Obsidian.md. The continuous recording kills the battery life on my phone so it's only usable if you don't mind going around with a powerbank. Would be great if a manufacturer would put in a separate low-energy chip with a good ADC.

P.S. "Active functions" with custom home automation is easy as pie with joaoapps's suite. I use BusyBox to SSH into a Pi with a Tellstick Duo. And some RFID tags for the system to know where I am (e.g. bedtime routine gets triggered when I place my phone on the bedside table). But yeah...traffic goes thru Google.

roberdam · on Nov 15, 2022

you should write about it!

sorwin · on Nov 15, 2022

How would this work with other voices, like a coffee shop, would it hear those simultaneously, and interupt a command?

Also, how do you handle using OpenAi whisper, seems like they do 30 second intervals - would that be an issue if your command is cut off mid word?

roberdam · on Nov 15, 2022

For now I try to give the commands when there is not much noise, but you can lower the gain of the microphone so that it only record my voice.

The 30 second limit is not a Whisper model limit, but a limit some of the free online "try whisper" put.

rolisz · on Nov 15, 2022

I think he means that even whisper segments the audio into 30 second bits and does transcribing on them and then stiches everything together.

frontman1988 · on Nov 15, 2022

The future will definitely have devices which record visually/verbally all your life. VR headsets are already able to record all your facial expressions. A google glasses like gear which records all your life is pretty much possible in the near future. The future influencers won't have to carry a phone/camera to create vlogs, they would just see wherever they want and the glasses will record not only the thing they are seeing but also their expressions. Privacy will probably not be such a big thing as now given most people with each generation are increasingly becoming more and more comfortable sharing their whole lives online.

yannyu · on Nov 15, 2022

Ted Chiang explores this idea in a short story called "The Truth of Fact, The Truth of Feeling" (https://devonzuegel.com/post/the-truth-of-fact-the-truth-of-...), which takes place in a world where commercial, individual, always-on recording exists. Ted Chiang also wrote the short story that the movie Arrival was based on.