Malicious Subtitles Threaten Kodi, VLC and Popcorn Time Users

ConfucianNardin · on May 24, 2017

Was annoying to find the details.

Looks like PopcornTime was rendering subtitle text as HTML, inside their app (html/js-based), creating an XSS vector (looking at https://github.com/popcorn-official/popcorn-desktop/commit/a..., https://github.com/butterproject/butter-desktop/pull/602). Likely the javascript runtime they're using allows file access and execution of arbitrary executables, enabling the metasploit shell shown in the demo.

For VLC there are a bunch of out of bound reads and heap buffer overflows.

    f2b1f9e subtitle: Fix potential heap buffer overflow
    611398f subtitle: Fix potential heap buffer overflow
    ecd3173 subsdec: Fix potential out of bound read
    62be394 subsdec: Fix potential out of bound read
    775de71 subtitle: Fix invalid double increment.

The article implies that VLC and the others are affected by the same issue (leading to code execution), but according to available information it seems to be completely different issues.

The Kodi issue was a zip archive path traversal (i.e. no protection against zip files extracting files to parent directories).

AdmiralAsshat · on May 24, 2017

Thanks for that. I read the article and was really confused at first. I don't do a whole lot of video editing, but I've opened up a .srt file a handful of times and noticed that it was nothing more than timestamps and text. The fact that the article made it seem like this was some kind of universal vulnerability made me wonder, "A simple subtitle file should be opened in read-only mode. Are these programs just reading whatever is in the .srt file and EXECUTING it!?!" That would be beyond horrible.

The fact that it's multiple, independent vulnerabilities makes me feel a little better. I've used Kodi and OpenSubtitles before while watching a movie to search and download subs for the movie without ever leaving Kodi. When it works, it's nothing short of magical.

jbk · on May 24, 2017

> The article implies that VLC and the others are affected by the same issue (leading to code execution), but according to available information it seems to be completely different issues.

Yes, those are very different issues.

From what I understood, one is an XSS (popcorn-time), one is a heap-based buffer overflow (VLC), and one is a zip-transveral (Kodi).

And tbh, I don't see how you can exploit the bug for VLC (with ASLR and HEASLR).

pjmlp · on May 24, 2017

Easy, you cannot count with an executable being always compiled and executed in an OS with ASLR and HEASLR enabled.

So it becomes a game of luck getting some users exploited.

driverdan · on May 24, 2017

Thank you! I was frustrated when I saw this last night and it didn't contain any details. I assumed buffer overflow but different attacks for each is more interesting.

leni536 · on May 24, 2017

Yeah, the article was a bit poor on details. I expected some libass or other common library/codec vulnerability.

the8472 · on May 24, 2017

Are those things vlc-specific or is there a common vulnerability shared with the underlying libs (libavcodec, libass?)

easuter · on May 24, 2017

If only VLC had been re-written in rust this would never have happened. For shame.

dang · on May 24, 2017

We ban accounts for trolling, so please don't do that here.

Also, you've posted many uncivil and/or unsubstantive comments. We ban accounts for that too, so please don't do that either.

easuter · on May 24, 2017

[flagged]

jackhack · on May 24, 2017

No, that's not the point and now you're antagonizing the moderating team. You were asked, and warned, "Do not do that" so, don't do it under your usual name, or a throwaway. There is no shortage of places on the internet to cause trouble. This isn't one of them.

O5vYtytb · on May 24, 2017

Yes but it would also not have nearly as many features as it does right now.

remotehack · on May 24, 2017

Java would work much better for VLC.

easuter · on May 24, 2017

No. Java doesn't have fearless concurrency, zero-cost abstractions or move semantics.

thaumasiotes · on May 24, 2017

"Fearless concurrency" and "zero-cost abstractions" sound a lot like meaningless marketing terms.

rdiddly · on May 24, 2017

Dunno about "fearless concurrency," but "zero-cost abstractions" and "move semantics" are straight off the front page of rust-lang.org. So they're kind of marketing-ish in trying to make you go "hmm sounds intriguing" and click to find out more.

jarman · on May 25, 2017

"fearless concurrency" is meaningless marketing, "zero-cost abstractions" is valid term

nradov · on May 24, 2017

Feel free to rewrite VLC in Rust. No one is stopping you.

usefulcat · on May 24, 2017

Pretty sure that was sarcasm.

lawl · on May 24, 2017

Oh look, it's the Rust Evangelism Strike Force at work again.

dang · on May 24, 2017

Please don't react to provocation by making the thread worse (a.k.a. please don't feed the trolls).

lawl · on May 24, 2017

How are you so sure it's trolling? Someone from the Rust Team has defended such behavior as good marketing [0].

[0] https://lobste.rs/s/wq6eov/changes_i_would_make_go#c_ofxtj1

dang · on May 24, 2017

One is never 100% sure, but note that the first part ("Please don't react to provocation by making the thread worse") holds regardless.

OneLessThing · on May 24, 2017

I did security research on VLC on Windows a year or two ago. I may be remembering incorrectly, but last I recall every module was protected by ASLR. Which means that remote code execution is not likely because there is no scripting or network comms to dynamically create a valid ROP chain.

I also didn't check for executable heaps at the time but given that all heaps are non executable (which they really shouldn't be executable in VLC) again I don't see how RCE is possible. Maybe there is some way to validate and therefore brute force addresses? I don't know. But there was no VLC POC and I'm sure they would have made one if they could have.

Use VLC it's the most secure media player I've seen.

yegle · on May 24, 2017

ROP: return oriented programming ASLR: Address space layer randomization

Having ASLR is not bullet proof to remote code execution, e.g. iOS has ASLR for a long time and can still be jailbroken (which usually involves a code injection etc). The key is info leak, e.g. if you somehow can reliably find the memory location of open() syscall, the memory location of the whole libc can be inferred, and libc is usually large enough to construct a ROP chain. (I haven't work in security area for a long time so correct me if I'm wrong).

The researcher unable to provide a POC for vlc could simply mean it's hard due to ASLR, but it's not impossible.

Also: I believe ASLR is a compiler option (with a supported OS), it should be relatively easy for Kodi and Popcorn Time to start using ASLR.

luch · on May 24, 2017

Most of moderns RCE POCs lift off a scripting engine (VBS for Office, Javascript for browsers, ActiveX for Flash, etc..) in order to facilitate exploitation. The only ones which does not use a script engine are POC exploiting a "network" vuln (like SMB).

Scriptless 0day RCE is still possible in a ROP+ALSR world, but exploitation is a real bitch. Ex : https://scarybeastsecurity.blogspot.fr/2016/11/0day-exploit-...

OneLessThing · on May 24, 2017

1) ASLR: address space layout randomization 2) Yeah libc is commonly ropped against (though you'd need to check with a linux guy) 3) Yes ASLR is a compiler option (/DYANMICBASE for windows). For windows a flag exists in the PE header, probably something similar in ELFs. When loaded the modules are fixed up so pointers and such are correct.

jbk · on May 24, 2017

That's also why we're perplexed at the supposedly code execution.

Also, the security researcher did not provide a demo for the VLC exploit. Their demo is only on Kodi and popcorntime.

But anyway, security issues means releases.

Animats · on May 24, 2017

every module was protected by ASLR.

Address space randomization is not "protection". It's a form of security by obscurity. The odds of an exploit working are reduced, at the expense of more crashes due to exploit failure.

It helps developers ignore bugs, since they can no longer reproduce them.

throwaway91111 · on May 24, 2017

In my experience, bugs are almost always easier to reproduce with address randomization. It's easier to see a process leave readable/writable memory than it is to see it overrun a buffer and only trash app code.

"Only" security by obscurity is the best we can get in the c/++ world without compiling for a virtual machine.

alasdair_ · on May 25, 2017

>Address space randomization is not "protection". It's a form of security by obscurity.

This is somewhat akin to saying "Randomly generated passwords are not 'protection'. They are a form of security by obscurity."

If things are random enough that an attacker is significantly hampered in most cases, that's one measure of security, no?

saurik · on May 25, 2017

It is going to vary quite a bit depending on the entropy of the ASLR implementation. Many have only had 8-12 bits of entropy to start with, and you sometimes don't need the full address. It is also important to note that services that crash typically restart, allowing retries (sometimes as many as you want). In this case, one might imagine trying to attack thousands of people: some of them will randomly work (and a lot of users are going to see VLC crash and will retry playing the file a number of times, increasing your probability).

OneLessThing · on May 24, 2017

Total facepalm to this comment.

Does modern ASLR increase costs (time, difficulty, money, skill, etc.) necessary for exploitation and decrease benefits (privs, chances of success, etc.)? If yes, then it's a protection. Any security engineer will tell you unequivocally ASLR is a protection. And one of the most successful ones to date.

legulere · on May 24, 2017

Keeping the location secret is just like keeping the key secret in encryption. You also wouldn't call that security by obscurity.

Still you're perfectly right that ASLR does not provide perfect safety, but merely makes exploitation way harder.

jbk · on May 24, 2017

> It helps developers ignore bugs, since they can no longer reproduce them.

Well, it would crash, so they can reproduce it, no?

foobarrio · on May 24, 2017

Off topic: I love VLC but can't get it to use hardware acceleration on my late 2015 mac. 4k 60fps @ 40mbps consume all CPU if I try to play a lower compression 150mbps video it studders and all my fans turn on. mpv and quicktime play the same videos with 15-20% CPU. The poor performance of VLC on my macOS makes it a no go for me.

jbk · on May 24, 2017

Try 3.0, this is fixed.

resoluti0n · on May 24, 2017

Kodi 17.2 with the fix for this flaw has now been released:

https://kodi.tv/article/kodi-v172-minor-bug-fix-and-security...

kutkloon7 · on May 24, 2017

The thing that most amazes my about Popcorn Time is how they find the subtitles. It seems to succeed even when I can't find subtitles myself.

More related to the article, you would think that subtitles are literally the easiest file format in existence to safely handle. It's incredibly well-defined in terms of textual data and times.

FranOntanaya · on May 24, 2017

> literally the easiest file format in existence to safely handle.

Well, which one of them. There's nearly a hundred different subtitle formats, and each one has a whole set of variants. Just Timed Text alone (XML) can have more layouts than one could count, specially since it's meant to be able to replicate technically all previous industry formats.

kutkloon7 · on May 29, 2017

Let me phrase it this way: one would expect that the class of file formats for subtitles are easiest to handle (as opposed to say, the class of file formats for images or videos).

On the other hand, images and videos are likely to be handled using some library, which might be better at safely handling the files.

amptorn · on May 24, 2017

> it's meant to be able to replicate technically all previous industry formats

Even the DVD subtitle format, which is just a mostly transparent image overlaid on the picture? In XML?

FranOntanaya · on May 26, 2017

Yes, in the TTML2 spec https://www.w3.org/TR/ttml2/#embedded-content-vocabulary-chu...

etix · on May 24, 2017

They use a hash function to match subtitles.

http://trac.opensubtitles.org/projects/opensubtitles/wiki/Ha...

tydok · on May 24, 2017

The problem isn't only about matching subtitles to movies but also where to look for subtitles, e.g. opensubtitles.org, subscene.com, etc.

heinrichf · on May 24, 2017

A great script: https://github.com/Diaoul/subliminal

tydok · on May 24, 2017

Very interesting. It contains a few subtitle providers I've never heard before. Thanks for posting it.

bingojess · on May 24, 2017

I believe it just scrapes them all. I can't remember the last time opensubtitles didn't have a sub I was looking for

tydok · on May 24, 2017

Perhaps the most difficult problem is to find a subtitle in multiple languages.

bingojess · on May 24, 2017

True I have only needed English subs

stordoff · on May 24, 2017

> More related to the article, you would think that subtitles are literally the easiest file format in existence to safely handle. It's incredibly well-defined in terms of textual data and times.

Depends on the format. SSA for instance can have embedded font and image files, which presumably have much more complex decoders.

phkahler · on May 24, 2017

It seems subtitles aren't important enough to have reduced the number of formats. From reading the comments, it seems like the world would benefit from a single format with most capabilities and have everyone convert all files to that. Until then, we need players that understand everything.

_jomo · on May 24, 2017

These are the VLC commits adressing the issue:

https://github.com/videolan/vlc/search?utf8=%E2%9C%93&q=subt...

pjmlp · on May 24, 2017

As usual, the common set of friends we already know since the 80's:

> Fix potential heap buffer overflow

> Fix potential out of bound read

> Fix invalid double increment.

airza · on May 24, 2017

i've never seen a double increment exploited before- it's undefined behavior, but what is the typical route against that?

mikeash · on May 24, 2017

The double increment itself isn't undefined behavior. Note that the two increments were separated with a semicolon, making them separate statements. It's equivalent to pzs_text += 2;.

The exploit would presumably involve structuring your data so that the excess increment skips over a terminator of some sort. If it's scanning until it hits a zero byte, and you get it to skip over the zero byte, then you have a buffer overflow.

airza · on May 24, 2017

Ahh, that does make sense, i didn't see that.

pjmlp · on May 24, 2017

Use a memory safe language that doesn't require direct pointer manipulation to access string and memory buffers, with an optimizer able to elide bounds checking if proven safe to do so.

aruggirello · on May 24, 2017

I don't seem to find a way to update VLC to 2.2.5 on Ubuntu (or Debian, or Mint for the matter). I understand Canonical does not provide updates in the repos - but the VideoLAN website's download URL for Ubuntu is just "apt://vlc" - it would be nice to be able to download one or more .deb's too.

Do we have to build it from source?

agnivade · on May 26, 2017

Damn, building from source is nigh impossible. Given the sheer amount of plugin libraries that need to be installed. - https://wiki.videolan.org/Contrib_Status/

thresh · on May 24, 2017

You can use snaps, but they are currently broken due to build issues.

pawadu · on May 24, 2017

Holy crap, that code doesn't look good. I predict we will see more exploits for this project.

Maybe we should stop random people from contributing to complex C projects?

unwind · on May 24, 2017

Wouldn't go that far from reading a single commit, but to anyone looking to pick up tips from a well-known respected C codebase: don't ever write

    (*(psz_text + 1 ) ) == '~'

when you can instead write

    psz_text[1] == '~'

Fewer tokens means less overhead for the human reader, and that asterisk-and-add pattern is exactly what the bracket array indexing operator does, so why not use it? This is one of my many C pet peeves, heh.

Also on a more personal note, if you're going to be putting things inside parentheses with whitespace, make it symmetrical.

viraptor · on May 24, 2017

"random people"? You mean there's some select group we know of that doesn't ever write bugs? (DJB doesn't make a group)

pawadu · on May 24, 2017

The main VLC developer is an amazing programmer. But if he uses his time to shave cycles off some SIMD decoding algorithm then boring things like file processing is done by random jr. developer.

The problem is that boring stuff can also be very security sensitive.

aclsid · on May 24, 2017

You are more than welcome to contribute and since you have a very strong opinion it seems you know your stuff, so go for it, nobody is charging a dime to work there in any case.

pawadu · on May 24, 2017

> you have a very strong opinion it

Yes I do, this is internet after all!

> seems you know your stuff,

Now you lost me :)

pjmlp · on May 24, 2017

That was my hope when C was just gaining market share outside UNIX, and here we stand now.

jbk · on May 24, 2017

Look at FFmpeg and all the multimedia libraries and you will be horrified.

pawadu · on May 24, 2017

I thought they cleaned up after the last round of exploits?

jbk · on May 24, 2017

hahah :)

I wish :)

eth_hero_12 · on May 24, 2017

vlc has a bug and yet you talk shit about well developed and fuzzed by google projects. thats why vlc will never be better than mpv.

jbk · on May 24, 2017

FFmpeg, VLC, MPlayer, libdvd*, libxvid, x264, libflac, libvorbis and all the other have multimedia library codebases started in the late 90s/early 2000. Noone cared much about security at that times.

All those projects are under-funded, done by volunteers, on countless platforms, doing very low-level stuff, and supporting many formats.

This has nothing to do with one project or another.

eth_hero_12 · on May 24, 2017

thats sad to hear, I didn't know volunteers did so much

smcleod · on May 24, 2017

Interestingly running VLC 2.2.4 on MacOS 10.12 and checking for updates returns 'VLC 2.2.4 is currently the newest version available.', obviously I downloaded 2.2.5.1 from videolan.org but still odd.

jbk · on May 24, 2017

The update will be deployed today or tomorrow in the updaters.

zuck9 · on May 24, 2017

Is that a default behavior or something you chose to do?

What if there's a bigger security fix you need to push to people asap?

jbk · on May 24, 2017

It is something that we chose to do.

We usually let between 24hours and a few days before doing an upgrade, seeing the possible regressions.

From tag to release to updates can take only 4hours, if we want enough mirrors.

muterad_murilax · on June 4, 2017

Well, 10 days later and 2.2.4 is still shown as the latest version when trying to upgrade... :/

anon1385 · on May 24, 2017

Same here. It appears to check http://update.videolan.org/vlc/sparkle/vlc-intel64.xml for updates and the newest version listed there is 2.2.4

zippoxer · on May 24, 2017

Can confirm the same on Windows. I downloaded the newest version manually as well.

jbk · on May 24, 2017

2.2.6 is deployed.

greggman · on May 24, 2017

AFAICT every plugin to Kodi has full machine access. Subtitles of course you don't expect to install malware but I wish plugins ran in a sandbox

pawadu · on May 24, 2017

Slightly related to this: where can I find data sanitizers for common file formats (PDF, MP3 and so on)?

rsync · on May 24, 2017

I strip all mp3 metadata using the 'id3mtag' tool[1].

  id3 -d *.mp3 ; id3 -2 -d *.mp3

That deleted all tags - v1 and v2 id3 tags.

I don't do this for security - I just don't like mp3 metadata competing with metadata in the filename and most mp3 metadata is laughably bad anyway[2] so I just wipe it.

[1] /usr/ports/audio/id3mtag on FreeBSD

[2] Misspellings, First Last instead of Last, First, ALL CAPS ALL THE TIME and using special characters/unicode that always breaks car stereo implementations.

chii · on May 24, 2017

what counts as sanitizing? How do you know a file is malicious?

Piskvorrr · on May 24, 2017

Especially with PDFs, my "sanitization" can be your "stripped away all the fonts and functionality - might as well have given me a plain .TXT", and vice versa.

rsync · on May 24, 2017

"might as well have given me a plain .TXT""

Yes, please - that sounds fantastic.

Piskvorrr · on May 24, 2017

I agree - but it's 1.surprisingly complicated for a general solution (positioning and such), and 2.not really a solution for the usual end user (who might appreciate a JPEG instead)

Piskvorrr · on May 25, 2017

(btw there's `pdftotext`, which is pretty good in most cases)

pawadu · on May 24, 2017

Read data according to spec, drop stuff that is incorrect and write it back.

For example if MP3 genre field is 999 bytes long cut it down to 32 bytes.

runeks · on May 24, 2017

Can anyone recommend a video player written in a memory-safe language for OSX that handles MKV files? Or is the simple truth that the problem lies in the parsers, which are shipped as a library written in C, because no sane developer wants to rewrite parsers for 25 different subtitle formats when writing a video player?

jbk · on May 24, 2017

There are none. You can use VLC inside VLC sandbox, but you won't get something perfect.

peruvian · on May 24, 2017

What about mpv? That's my preferred video player.

rossy · on May 24, 2017

mpv is not affected, at least by these four vulnerabilities. They all seem to be specific to each video player, rather than affecting shared code or code in open source multimedia libraries.

m1el · on May 24, 2017

While I too prefer mpv, I suspect that there are plenty of vulns in that player.

Filligree · on May 24, 2017

It's written in C, so I imagine that's almost guaranteed. In this case obscurity helps to protect you, however.

sparaker · on May 24, 2017

It would be interesting to see which subtitles are using these vulnerabilities and what they are achieving with them. We could estimate how long this has been around.

mplewis · on May 24, 2017

This is another reason you should use a tool like a parser generator when you have to parse untrusted data, rather than writing your own parser by hand.

janpio · on May 24, 2017

Does anyone know if the subtitle hosting services added checks for this as well?

soylentcola · on May 25, 2017

This is interesting to me for reasons outside of anything to do with exploits or malware. A while back I had a bit of a brain fart while playing with my Hue bulbs: would there be a way to use the subtitle track for a video to encode time-controlled data that can be sent to/read by another application that sends these values to a set of Hue bulbs or similar devices for synchronized ambient lighting?

I figured that subtitles were an obvious place to start because you can download them in small files, play them back alongside a video, and they are designed to be "timed out" to synchronize with a video already.

I looked into it for a bit but never really found a way (within my abilities at least) to do anything like this from within a .srt file or similar. I'd be interested in hearing if anyone else has more info on how you might do more with that "framework" than displaying text on screen.

Filligree · on May 24, 2017

Speaking of Popcorn Time, last I heard there were a couple of forks and doubts about the safety of each and every one.

Is there any more clarity around the situation now?

captainmuon · on May 24, 2017

Wow, that is bad. I'm always amazed by such vectors in supposedly passive formats, like fonts, images, and so on.

There is no excuse that these kind of applications are not completely sandboxed. All you need is some kind of DLL, raw data in, raw pixels out. In case of hardware accelerated codecs, raw pixels in, surface pointer in, nothing out. There is no need to be able to access the filesystem, etc.. To render subtitles on top of the video it's the same.

I wish a fraction of the energy we put into DRM would go into sandboxing instead.

jbk · on May 24, 2017

Ha, the famous sandboxing remark. I wish it was that simple!

So, let me share some light on the sandboxing for multimedia (I work on VLC).

If you sandbox an application like VLC, in the current way of doing sandboxing, which we've done for macOS, WinRT/UWP, and snaps, you still need a lot of permissions.

Namely:

- you need to be able to open files without user interactions (no file picker), in order to open playlist, MXF or MKV files;

- you need the same if ever you have a database of files (media center oriented);

- you need raw access to /dev/* to play DVD, CD and other optical disk (and the equivalent on Windows);

- you need ioctl on such devices, to pass the MMC for DVD/Bluray;

- you need raw access to /dev/v4l* for your webcams and be able to control them;

- you need access to the GPU stack, which is running in kernel-mode, btw, to output video and get hw acceleration;

- you need access to the audio stack, also in low-level mode;

- you need access to the DSP acceleration (not always the GPU);

- on linux, you have access to x11 for the 3 above features, which is almost root;

- you need access to /etc/ (registry) for proxy informations, fonts configuration and accessibility;

- many OpenGL client libraries need access to the /etc too;

- you need access to the network, as input and output (think remote control);

- you need access to the system settings to disable screensavers, and adjust brightness;

- you need access to mounts to be able to see the insertion of DVD/Bluray/USB/SD cards and such;

- you need to expose an IPC (think MPRIS on Linux);

- you need to unzip, untar, decrypt, decipher and so on;

- you need access to the fonts and the fonts configuration (see fontconfig).

and I probably forgot one or another case.

The point is, all those features have good reasons to exist and very good use cases; but the issue is that for a media player, it will request almost all permissions except GPS and address book.

And quite a few of them are very close to kernel mode.

So, what is the solution?

Probably do a multi-process media player, like Chrome is doing, with parsers and demuxers in a different process, and different ones for decoders and renderers. Knowing that you probably need to IPC several Gb/s between them.

I've been working on such a prototype, but it's a lot of work... I accept donations :)

phkahler · on May 24, 2017

Thanks for that. This type of thing comes up all the time. I used to wonder how web sites could be so dangerous, but it becomes clear when you think about all the extra access developers wanted for good reasons - imagine a web browser that didn't have access to the file system, and so on. I still don't like this state of affairs, but I don't have an alternative solution. Wayland should be more secure than X, but they're starting to poke holes in there for various reasons (color picker, warp pointer for compat, etc...).

viraptor · on May 24, 2017

Not even multi-process. Threads on Linux can have their own seccomp profiles. You don't need to sandbox absolutely everything at the same time either. In this case opening the file in the main, unrestricted app and spawning a new thread that will read from the existing FD and only send you simple, time sorted messages over a shared IPC/pipe is not that crazy.

Other points may be more tricky, and it's a good list of potential issues, but we can start chipping away some stuff right now. There's a lot we can fix without fixing everything at the same time.

jbk · on May 24, 2017

> Threads on Linux can have their own seccomp profiles.

Not on Windwows or on macOS.

> new thread that will read from the existing FD and only send you simple, time sorted messages over a shared IPC/pipe is not that crazy.

Of course that does not solve anything, because your demuxer|decoders|output needs access to the FS, have access to kernel-mode and those are the dangerous parts.

zeveb · on May 24, 2017

> > Threads on Linux can have their own seccomp profiles.

> Not on Windwows or on macOS.

It's a shame, then, that Windows & macOS are holding back security improvements for software running on Linux. I understand (& even agree with!) your desire to have a sandboxing mechanism which runs acceptably on all supported systems; it's just sad that this security mechanism in the Linux kernel can't be taken advantage of in vlc.

jbk · on May 25, 2017

Well, no. Because you can do it per-process. I don't see the reason of doing it per threads here.

viraptor · on May 24, 2017

I'm not sure what you're trying to say. Yes, I meant Linux. Yes, it can solve the issue of separate subtitle files, which this article is about. Read access to an existing FD is not the same as full FS access, and there's no demux involved here.

jbk · on May 24, 2017

> Yes, I meant Linux

The demo is on Windows. The goal is to do a sandbox that works on most OSes.

And, it will not solve the decoder issue, since it is on the decoding side, which still has access to the GPU/Aout and the kernel.

> Read access to an existing FD is not the same as full FS access, and there's no demux involved here.

You're totally missing the point here. The issue is demuxers/decoders/output, not really the access.

Reading from an FD or not would not solve the buffer overflow exploitation (if it was actually exploitable).

dom0 · on May 24, 2017

> Not even multi-process. Threads on Linux can have their own seccomp profiles.

Feels kinda pointless, since all threads in a process share the same memory protection.

viraptor · on May 24, 2017

They don't have to. Clone can do a lot of magic without full processes.

jbk · on May 24, 2017

But then you need to copy the memory from the decoder to the video output or you get back to the same problem to work-on.

Filligree · on May 24, 2017

No, you can use a shared memory segment for a buffer just for that.

It's more coding, certainly, but it's possible. Security is an option if we wanted it.

jbk · on May 24, 2017

That's exactly the point above. See my above comment.

Filligree · on May 24, 2017

I'm not sure, it sounds like you're saying we'd need to copy memory.

The shared memory segment can be a GPU image buffer, so I don't think that's true.

jbk · on May 24, 2017

See comment above with "the solution".

Either you need to have multi-process and correct IPC, or you need to copy.

johncolanduoni · on May 24, 2017

> Probably do a multi-process media player, like Chrome is doing, with parsers and demuxers in a different process, and different ones for decoders and renderers. Knowing that you probably need to IPC several Gb/s between them.

That's not actually how Chrome's renderer sandboxing works. Both Windows and OS X allow you to share a GPU-resident texture between processes (DXGI shared surfaces and IOSurface respectively), so there's no need to copy any video data.

jbk · on May 24, 2017

But you need to pass data from the access to the stream_filter, from the stream_filter to the demuxer, from the demuxer to several decoders, from the decoders to potentially a few video-filters and chroma-converters, and then finally to the output. Each of them need different access policies, and several of them require FS access.

The last part is just one of the issues, very far from all of them.

Seriously, stop thinking that noone has given a thought to the question...

rossy · on May 24, 2017

These shouldn't require IPC at GB/s speed either. Modern sandboxes, like the one in Chrome, have a broker process which can open filesystem objects, device objects and sockets (file descriptors or handles) and pass them to a sandboxed decoder/renderer process, so there would be no need to stream filesystem data to the sandbox when the sandbox could do the file I/O itself. Even for Matroska ordered chapters, where the demuxer would have to tell the broker which files to open, the broker could enforce certain rules, such as enforcing that local mkv files only reference other local files, the files are all in the same directory, and that the files are always opened in read-only mode.

As for isolating decoders from video filters and chroma conversion, I'm not sure why that would be necessary, since those shouldn't require any additional privileges. I understand that retrofitting an existing program to use a multi-process sandboxing model is far from easy, and I'm definitely not volunteering to do it, but I don't think there is anything specific about a video player that is harder to sandbox than a web browser.

jbk · on May 24, 2017

> I understand that retrofitting an existing program to use a multi-process sandboxing model is far from easy, and I'm definitely not volunteering to do it,

Yes, that's the core of the issue.

johncolanduoni · on May 24, 2017

I don't think nobody has thought about it, but since you were apparently unaware that there was an alternative to performing several GB/s of IPC for moving buffers around there's obviously some options that haven't been considered. The Chromium sandbox has to deal with every issue you've listed (it's even calibrated to run ffmpeg inside the sandbox, since that's something Chromium needs to do).

jbk · on May 24, 2017

> but since you were apparently unaware that there was an alternative

I will refrain from answering to such attacks. As you seem to know better, I'm waiting for your patches.

johncolanduoni · on May 24, 2017

It's not an attack. Each platform's methods of GPU IPC are pretty sparsely documented. Two months ago I wouldn't have known about them; I only learned by working on integrating Chromium's sandbox into an application that needed to work with the GPU within a sandboxed process.

That doesn't change the fact that none of the things you listed are unsupported by Chrome's sandbox model, and if you only need to establish a barrier around the video pipeline (and not e.g. VLC's ability to notice device status or interact with webcams) you don't even need 3/4 of what Chromium's sandbox has implemented. Like I said, I've actually walked the walk when it comes to using their sandbox for Windows and Linux with a process that needed to access certain user files, the GPU, and even each platform's font server equivalent, so this isn't me just spitballing about some theoretical solution.

justin66 · on May 24, 2017

What features could the OS offer you (to help your program be "sandboxed") that it currently does not?

jbk · on May 24, 2017

I think we can do everything now for the majors OSes, but I'd guess this is a 50-100 man-month work for VLC.

fulafel · on May 24, 2017

You don't need special fast IPC. Even uncompressed video is fine over standard IPC.

jbk · on May 24, 2017

Blurays are 60Mbps.

Then with 40k60 + HDR, displaying is quite a lot of bandwidth.

fulafel · on May 24, 2017

A 10 year old PC carries 2.1 GiB/s (= 17 Gbps) over bog standard pipes without tuning or parallelism, as measured by "pv /dev/zero | cat > /dev/null". Uncompressed full HD is 1.5-3 Gbps. (Less actually, since codec output is going to be 4:2:2 or similar)

Yeah, you can come up with high bandwidth scenarios like stereo VR 144 Hz 4k HDR running on barely capable hardware. But 99% of users don't require such tricks and never see any upside from the performance-over-security compromise.

Even if you decide basic IPC is not fast enough, a shared memory buffer for raw frame data is reasonably secure too.

jbk · on May 25, 2017

Knowing that today we still see bandwidth issues in VLC, even without IPC, I kind of doubt it.

FrozenVoid · on May 24, 2017

All this means Linux is misdesigned for user apps, forcing low-level code instead of proper APIs. Maybe stuffing everything into the kernel isn't a good idea after all? All these things are exploit attack surfaces.

adgasf · on May 24, 2017

Interesting.