Programmer and sysadmin (DevOps?), wannabe polymath in tech, science and the mind. Neurodivergent, disabled, burned out, and close to throwing in the towel, but still liking ponies 🦄 and sometimes willing to discuss stuff.

  • 3 Posts
  • 54 Comments
Joined 1 year ago
cake
Cake day: June 26th, 2023

help-circle


  • There is some irony to be had, in discussing this stuff on a page that starts by asking me to login, then to be good and disable my ad blocker, only to proceed with keeping half the text of the article as images so you can’t copy+paste it… and even all the comments!

    Anyhow…

    https://www.boredpanda.com/tik-tok-reverse-engineered-data-information-collecting/?utm_source=twitter&utm_medium=social&utm_campaign=organic

    😈 Thanks for telling us where you got the link from, I didn’t really care. 😁

    Static backup (possibly): https://archive.is/UD2SA

    *Phone hardware (cpu type, number of course, hardware ids, screen dimensions, dpi, memory usage, disk space, etc)

    Check out: https://amiunique.org/fingerprint

    No app needed!

    Using that as a baseline… the CPU type, memory usage, disk space, etc. are some extra data points freely available to all apps.

    A developer can distribute an app with multiple versions, some targeting more modern and capable devices, some older and more limited. It’s a feature, not a bug!

    *Other apps you have installed (I’ve even seen some I’ve deleted show up in their analytics payload - maybe using as cached value?)

    This is overreaching for an app that has nothing to do with managing other apps. Still, you may want some app with those capabilities… so let’s call it “sus”.

    *Everything network-related (ip, local ip, router mac, your mac, wifi access point name)

    Your IP is… well, you’re using it to connect, they will see it, duh.

    The rest is overreaching and comes into PI violation terrain, but can be used for geo location… the OS does it, that’s the data it uses to fine-tune the GPS’s location.

    *Whether or not you’re rooted/jailbroken

    Typical feature for banking ad DRM protected apps. Nothing to see here.

    *Some variants of the app had GPS ping- ing enabled at the time, roughly once every 30 seconds - this is enabled by de- fault if you ever location-tag a post IIRC

    Best answered by a comment [1] (SEE BELOW).

    TL;DR: more DRM stuff.

    *They set up a local proxy server on your device for “transcoding media”, but that can be abused very easily as it has zero authentication

    This is somewhat sus, but a local proxy by itself, doesn’t mean any sort of risk, or that it could be exploited.

    For example, Tor can be accessed using a local proxy (although VPN mode is safer).

    The scariest part of all of this is that much of the logging they’re doing is remotely configurable,

    Not exactly. It’s how feature flags, and remote testing/debugging works too.

    and unless you reverse every single one of their native libraries (have fun reading all of that assembly, assuming you can get past their customized fork of OLLVM!!!) and manually inspect every single obfuscated function.

    This is worse (why do they use a custom OLLVM fork?), and obfuscation usually means they have something to hide. It’s the opposite of security for the user.

    They have several different protections ir. place to prevent you from reversing or debugging the app as well. App behavior changes slightly if they know you’re trying to figure out what they’re doing.

    Not good, but unfortunately allowed. That behavior is shared by both DRM protected software, and malware.

    There’s also a few snippets of code on the Android version that allows for the downloading of a remote zip file, unzipping it, and executing said binary. There is zero reason a mobile app would need this functionality legitimately.

    False.
    There are two legitimate reasons: plugins, and DLCs.

    It can be used for shady stuff, but is also a “feature, not a bug”.

    On top of all of the above, they weren’t even using HTTPS for the longest time. They leaked users’ email addresses in their HTTP REST API, as well as their secondary emails used for password resets. Don’t forget about users’ real names and birthdays, too. It was alllll publicly viewable a few months ago if you MITM’d the application.

    Well, that’s just stupid, there is zero reason to send data unencrypted.

    They encrypt all of the analytics requests with an algorithm that changes with every update (at the very least the keys change) just so you can’t see what they’re doing.

    Ehm… this is the correct behavior. See previous point.

    They also made it so you cannot use the app at all if you block com- munication to their analytics host off at the DNS-level.

    Sus… but see the introductory part of this comment. Should boredpanda also be banned?

    TikTok put a lot of effort into preventing people like me from figuring out how their app works. There’s a ton of obfuscation involved at all levels of the application, from your standard Android variable renaming grossness to them (bytedance) forking and customizing ollvm for their native stuff. They hide functions, prevent debuggers from attaching, and employ quite a few sneaky tricks to make things difficult. Honestly, it’s more complicated and annoying than most games I’ve targeted,”

    This is bad, and a reason to use FLOSS apps… but since it’s been an accepted behavior for Privative Software, along with DRM… don’t blame the player, blame the game.

    No, seriously, blame the DMCA and friends. There is no way to at the same time “enforce DRM, keep a copy of all keys at a trusted third party, and keep users secure”… so the current situation is “you get none of those”.


    [1]

    sr71Girthbird 39 points 1 day ago

    Not OP but I work at a company providing video infrastructure, and one of our products is an analytics suite. It provides all the data he men- tioned and ton more. Turner, Discovery, New York Times, Hulu, and everyone’s favorite company, MindGeek all use our Analytics, among hundreds of other large customers. Specifically where this guy says, “Some variants of the app had GPS pinging enabled at the time, roughly once every 30 seconds” that’s called a heartbeat. The app or video player within the app has to have a heart- beat so that the player can detect if a viewer is still watching video etc. Our analytics + video player services send a regular heartbeat every 8 seconds. It definitely pulls in your exact location.












  • Yeah… “problem” was kind of tongue in cheek.

    But it’s not exactly a “default”, it’s more of a “demographic with little data”… and I bet it’s small enough that the algorithm is showing exactly the content most of its members are looking for. It’s somewhat of a sad reflection on the state of privacy, when keeping things private becomes a segmenting parameter.


  • Interesting article, but in my experience it overstates the problem… at least for Facebook itself (I have zero interaction with Instagram, Threads, or VR).

    I’ve gone back to Facebook for the last few months, and out of what it mentions, I’ve only seen like half of it, mostly in the comment sections.

    Or to be more precise, for 2024 Q2, I’m seeing:

    • election disinformation - almost none
    • violent content
    • child sexual abuse material
    • hate speech - only in comments
    • fake news - almost none
    • crypto scams - a few
    • phishing - a few
    • hacking
    • romance scams - almost none
    • AI content - almost none
    • uncanny valley stuff

    The article however forgot to include:

    • science deniers - a lot in open comments, very few in groups
    • religious zealots - in comments
    • political trolls - a few in comments
    • state-sponsored propagandists - a few in comments
    • general trolls - a few in comments

    Still interesting how I get close to zero of these in my main feed.

    there’s a level of disinvestment in Facebook

    Disagree. Facebook has reached a “plateau of stability” where the current moderation tools keep enough people on the platform to make it profitable.

    I’ve been actively reporting+blocking problematic content, and while about 99% of my reports end up in “no action was taken”, it works wonders to keep my feed and group comments clean.


  • Because I have basically nothing in my feed on this account, Facebook backfills it with “recommended” posts and I was pretty shocked at how universally terrible they are. […] since I’ve provided very little in the way of alternative information or interaction for it to use

    There is your problem.

    When an information-hungry platform like Google or Meta asks you to fill out your preferences “to serve you more relevant content”… they are not lying. I mean, it’s also to select ads that will pay more for your attention, but the thing with the content algorithm is, if you don’t give it data, then it will ass-u-me that you’re statistically most likely to engage with content that is getting most engaged… by people who have also not provided it any data.

    The problem with that cohort, is it not only includes the few people with legitimate security concerns, but also those who got dark secrets to hide, and/or are using “incognito” browser mode to look for porn.

    I don’t like to give too much info about myself, but I also don’t want to get stuff intended for the “average horny fanatics” group, so I try to give enough data for the algorithm to put me into a group that makes more sense to me.

    And it works. The strongest signal you can send to the algorithm, is blocking content you don’t want to see. It’s amazing how quickly modern algorithms learn to avoid showing me most porn, politics, or religious content, and instead show me science and humor. They still send like 1% of trash my way, clearly checking whether I’ll maybe engage with it, but report+block works wonders.





  • Let me clarify: I’ve seen the sand bucket guy’s art featured twice on the news in the past few days, filmed at an art gallery, described as art, commented as being art. It’s not some random event, it’s the current publicly accepted definition of “art”.

    My statement, not insinuation, as to why AI art is comparable to “traditional” art, comes after that.

    What comes across as desperate however, is generalizing all AI output and disparaging it, without considering the quality of input from the person behind it. Reminds me of how photography used to not be art, how electric instruments couldn’t be art, or how using a computer couldn’t be art either. Tools don’t make or break an artist.