I’m a tech interested guy. I’ve touched SQL once or twice, but wasn’t able to really make sense of it. That combined with not having a practical use leaves SQL as largely a black box in my mind (though I am somewhat familiar with technical concepts in databasing).

With that, I keep seeing [pic related] as proof that Elon Musk doesn’t understand SQL.

Can someone give me a technical explanation for how one would come to that conclusion? I’d love if you could pass technical documentation for that.

  • vorb0te@lemmynsfw.com
    link
    fedilink
    arrow-up
    0
    ·
    4 months ago

    He could also refer to the mere possibility of having duplicates which does not mean there are duplicates. And even then it could be by accident. Of course db design could prevent this. But I guess he is inflating the importance of this issue.

  • GreenKnight23@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    because he doesn’t understand that SSNs aren’t globally unique when you account for time.

    Barry Allen born in 1905 has SSN 123-44-5678. Barry died in 1983.

    Clark Kent was born in 1996 with SSN 123-44-5678.

    Musks assumption of a deduplicated DB table ignores the fact that SSNs were never designed to be GUIDs. he lacks the fundamentals of basic data modeling and critical thinking needed to understand a simple construct that a child could grasp.

    This is why he doesn’t understand SQL. He also doesn’t understand COBOL, which SSNs were built on top of.

    This isn’t because Elon Musk is an idiot.

    #Elon Musk is a criminal.

      • Maggoty@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        But I was assured he was a materials engineer, rocket scientist, computer programmer, and businessman extraordinaire!

      • thatKamGuy@sh.itjust.works
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        The SSN is 9 digits long; so technically they would have to start re-using them after the billionth one. Given the current population size, and how many people have been born/died since its implementation - it’s fair to say they haven’t had to re-use any figures yet.

    • Phoenixz@lemmy.ca
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Elin musk is a (criminal) scammer, he always has been.

      He was fired for incompetence from his own company

      Pretty much everything he’s promised for every company he has headed had been a lie. Tesla full self driving? Lie. Hyperloop? All lies to successful kill high speed rail and start a movement that wasted billions of dollars including tax payer money. Even SpaceX, the least shit of all, is shit. Once you really look at it, its all promises with no results and lots of cheering when millions of tax payer dollars -yet again- blow up in the sky.

      The guy has one quality: convincing people that he’s smart even though he literally doesn’t know shit

    • bitchkat@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 months ago

      The sheer size of the federal government and its age would mean there are thousands of databases out there. Some may be so old that they predate RDBMS/SQL.

      That alone makes his comment come from a place of ignorance. Of course it’s confident ignorance. The worst kind.

      • lmmarsano@lemmynsfw.com
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        5 months ago

        Some may be so old that they predate RDBMS/SQL.

        I don’t follow. Wouldn’t that lend credence to his assertion that it’s incorrect to assume that everything in government is SQL?

        People here are being irrationally obtuse about the possibility that an agency that’s existed since the 1930s may keep business-critical records on legacy systems predating relational databases. Systems serving a national agency may not migrate databases frequently.

          • lmmarsano@lemmynsfw.com
            link
            fedilink
            English
            arrow-up
            0
            ·
            5 months ago

            Were those his exact words? When words are ambiguous, are we selecting interpretations that serve best in the contention? Does the context suggest something obvious was left unstated? Yours seems like a forced interpretation.

            1. He complains about 1 specific database.
            2. Some rando assumes it’s SQL & retorts he doesn’t know it.
            3. He literally writes “This retard thinks the government uses SQL.”

            Always, sometimes, here? In typical Twitter fashion, it’s brief and leaves room for interpretation.

            In context, always or here makes the most sense as in “This dumbass thinks the government always uses SQL.” or “This dumbass thinks the government uses SQL here.” Does it matter some other database is SQL if this one isn’t? No. With your interpretation, he pointlessly claims that it does matter for no better reason than to discredit himself. With narrower interpretations, he doesn’t. In a contention, people don’t typically make pointless claims to discredit themselves. Therefore, narrower interpretations make more sense. Use context.

            All I did here was apply textbook guidelines for analyzing arguments & strawman fallacies as explained in The Power of Logic. I welcome everyone to do the same.

            The fact is there’s very little information here. We don’t know which database he’s referring to exactly. We don’t know its technology. Some of us have worked enough with local government & legacy enterprise systems to know that following any sort of common industry standards is an unsafe assumption. No one here has introduced concrete information on any of that to draw clear conclusions, though there’s an awful lot of conjecture & overreading.

            He seemed to use the word de-duplicated incorrectly. However, he also explained exactly what he meant by that, so the word hardly matters. Is there a good chance he’s wrong that multiple records with the same SSN indicate fraud? Without a clear explanation of the data architecture, I think so.

            I despise idiocy. Therefore, I despise what Musk is doing to the government. Therefore, I despise it when everyone else does it.

            Seeing this post keep popping up in the lemmy feed is annoying when it’s clear from context that there’s nothing there but weak reasoning.

            Wow! It's fucking nothing!

            We don’t have to become idiots to denounce idiocy.

            • bitchkat@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              5 months ago

              He literally writes “This retard thinks the government uses SQL.”

              That is all you need. He’s not saying “This retard thinks the SSA uses SQL”. He is saying “the government” which means all of it. Saying someone is a retard because they think the government uses SQL means Elon doesn’t think they do because we all know he doesn’t consider himself a retard.

              You are looking for ambiguity where there is none.

              • lmmarsano@lemmynsfw.com
                link
                fedilink
                English
                arrow-up
                0
                ·
                edit-2
                4 months ago

                Nah, that’s ignoring context irrationally. Context matters. I’ll show.

                He’s not saying “This retard thinks the SSA uses SQL”.

                Can SSA not be called “the government”?

                He is saying “the government” which means all of it.

                So, let’s try your suggested interpretation.

                This retard thinks all the government uses SQL.

                That seems to agree with mine.

                However, you denied ambiguity of language, and that context matters, so let’s explore that: which government? The Brazilian government? Your state government? Your local government? No? How do you know? That’s right: context.

                Why stop there? There’s more context: a Social Security database was specifically mentioned.

                Does “the government” always mean all of it? When a federal agent knocks someone’s door & someone gripes “The goddamn government is after me!” do they literally mean the entire government? I know from context I or anyone else can informally refer to any part of the government at any level as “the government”. I think you know this.

                Likewise, when people refer to the ocean or the sky or the people, they don’t necessarily mean all of it or all of them.

                Another way to check meaning is to test whether a proposition still makes sense when something obvious unstated is explicitly written out.

                This retard thinks the government uses SQL. Why assume they use SQL here?

                Still make sense? Yes. Could that be understood from context without explicitly writing it out? Yes.

                A refrain:

                Use context.

    • Snothvalpen@lemmy.blahaj.zone
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Wait, SSNs weren’t designed to be GUIDs? I mean, I fully follow that they aren’t and we’ve had to reuse them when the circle of life does its thing, but I thought they were just designed poorly and we found out the hard eay they don’t work as GUIDs. What purpose were they designed for if not to act as GUIDs?

      • jonne@infosec.pub
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        They were designed to be only used for the administration of social security. Since they were sending monthly checks, they needed a way to know that the person going to the office and saying their address changed was who they said they were. This was at a time before driver’s licences were common and they didn’t have any other type of ID, and there were just a lot fewer people.

        Later on the SSN started to be used by banks and other entities even though it was never meant for that, and the risks associated with the relatively insecure design just compounded, because instead of just fraudulently claiming someone else’s social security checks (which, unless the target died, would probably be figured out within a month), it opened up all sorts of extra avenues for fraud.

    • Dkarma@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Lol talk about burying the lede… The issue here is that the government absolutely uses SQL to traverse a DB and anyone who thinks otherwise is an idiot.

      • DahGangalang@infosec.pubOP
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        Naw, I definitely meant to be asking about duplication of data in databases (vs if the government actually uses SQL).

        Sorry to have communicated that so poorly. Everyone seems to be taking the angle you’re arguing though. Guess I’ll need to work on that.

    • Aeao@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      I’m not arguing that Elon musk is anything but an absolute tool.

      SS numbers have 999 million options. Are we already repeating them?

      • vonbaronhans@midwest.social
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        We have over 300 million people in the US right now. Social security started in the US in 1935 with just over 127 million people then.

        Yeah, we probably have gone through 999 million options by now.

          • vonbaronhans@midwest.social
            link
            fedilink
            arrow-up
            0
            ·
            5 months ago

            Just read that, and it says they’ve only issued 453 million numbers so far. Huh. I really thought it would’ve been a lot more than that.

        • starman2112@sh.itjust.works
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          5 months ago

          I don’t think we’ve gone through 999 million options yet. Only about 350 million people have been born since 1933, so even if we add all 127 million US citizens alive in 1935, that’s just over half of the possible social security numbers.

          The reason we’ve likely reused numbers is because they weren’t randomly assigned until like 2011. Knowing that I was born in 1995 in Wichita, KS, you could make an educated guess at the first three digits of my SSN

          • vonbaronhans@midwest.social
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            5 months ago

            We have 335 million people in this country literally right now. I don’t think “350 million born since 1933” makes sense. There gotta be a lot of churn just from early deaths alone.

            Edit: number fixin

            • tempest@lemmy.ca
              link
              fedilink
              arrow-up
              0
              ·
              5 months ago

              Not every person in the United States was born in the United States and even temporary workers can get a SSN

            • starman2112@sh.itjust.works
              link
              fedilink
              arrow-up
              0
              ·
              edit-2
              5 months ago

              I mean you can check my math, I just added up all the births per year in this article

              https://www.usatoday.com/story/money/2020/06/12/how-many-people-were-born-the-year-you-were-born/111928356/

              Rounding to one significant figure, it’s 311.9 million people born in the US between 1933 and 2018. Adding an average of 4 million births per year since then, it’s 335.9. I rounded up to 350 to bring it to a nice round number

              A bit of research tells me that around 44.8 million of us are first generation immigrants, so 291.1 million were born here. Is it reasonable to assume that 291.1 out of the 335.9 million people born since 1933 have survived so far? I have absolutely no idea, I’m not a professional census taker

    • KillingTimeItself@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 months ago

      it seems that nobody really cares about the word retard anymore, it’s quite funny how it went from super common language, to being less common, to people just saying it again now.

      I’m curious how many people actually consider the word a slur, and how many people even care these days.

    • localhost443@discuss.tchncs.de
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Because that’s really just to be expected at this point, and what his audience would want…

      Better to focus on constantly poking at him for being dumb, which he and his fans hate, rather than give them what they want, ie being upset at their hateful language

  • h4x0r@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    I think a lot of comments here miss the mark, it’s not really just about stating the gov does not use SQL.

    Deduplication is generally part of a compression strategy and has nothing to do with SQL. If we’re being generous he may have been talking about normalization, but no one I have ever met has confused the two terms (they are distinctly different from an engineering perspective).

    There are degrees of normalization too, so it may make total sense to normalize 3NF (third normal form) rather than say 6NF.

    • whotookkarl@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      This is it, relational databases are normalized under forms, deduplicate is usually a term used when talking about a concrete data set from data sources like a database, not the relational data model in the database itself.

  • P00ptart@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    Everything they don’t understand (which is nearly everything) is either God or fraud. Do with that information what you will.

  • SolidShake@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    How come republicans keep saying that doggy is going to expose all the fraud in the government but yet the biggest fraud with 37 felonies is president? What the actual fuck to these people think?

  • rational_lib@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    5 months ago

    To me I’m not really sure what his reply even means. I think it’s some attempt at a joke (because of course the government uses SQL), but I figure the joke can be broken down into two potential jokes that fail for different, embarrassing reasons:

    Interpretation 1: The government is so advanced it doesn’t use SQL - This interpretation is unlikely given that Elon is trying to portray the government as in need of reform. But it would make more sense if coming from a NoSQL type who thinks SQL needs to be removed from everywhere. NoSQL Guy is someone many software devs are familiar with who takes the sometimes-good idea of avoiding SQL and takes it way too far. Elon being NoSQL Guy would be dumb, but not as dumb as the more likely interpretation #2.

    Interpretation 2: The government is so backward it doesn’t use SQL - I think this is the more likely interpretation as it would be consistent with Elon’s ideology, but it really falls flat because SQL is far from being cutting-edge. There has kind of been a trend of moving away from SQL (with considerable controversy) over the last 10 years or so and it’s really surprising that Elon seems completely unaware of that.

    • dnick@sh.itjust.works
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      My guess is that he thinks SQL is an app or implementation like MS-SQL. It would be pretty surprising if the government didn’t use SQL as in relational databases, but if it doesn’t it’s even more unlikely that he understands even the first part over whether having duplicate SS numbers is in any way unexpected or unreasonable. Most likely one of the junior devs somewhere along the lines misunderstood a query and said something uninformed and mocking, and he took that as a good dig to toss into a tweet.

    • DahGangalang@infosec.pubOP
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Thanks for genuine response. Lol, most who interpret my question that way you did don’t seem interested in a good faith discussion. But ol’ boy is def tripping if he thinks SQL isn’t used in the government.

      Big thing I’m intending to pry at is whether there would be a legitimate purpose to have duplicated SSNs in the database (thus showing the First Bro doesn’t understand how SQL works).

  • RabbitBBQ@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    5 months ago

    It’s more than just SQL. Social Security Numbers can be re-used over time. It is not a unique identifier by itself.

    • KillingTimeItself@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 months ago

      i’ve heard conflicting reports on this, i have no idea to what degree this is true, but i would be cautious about making this statement unless you demonstrate it somehow.

        • KillingTimeItself@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          0
          ·
          4 months ago

          On June 25, 2011, the Social Security Administration changed the SSN assignment process to “SSN randomization”,[36] which did the following:

          The Social Security Administration does not reuse Social Security numbers. It has issued over 450 million since the start of the program, about 5.5 million per year. It says it has enough to last several generations without reuse and without changing the number of digits. https://www.ssa.gov/history/hfaq.html

          evidently they must be doing something else on the backend for this to be working, assuming there are quite literally 100M numbers, which is going to be static due to math, obviously, but they clearly can’t be reassigning numbers to 3 people on average at any given time, without some sort of external mechanism.

          There are approximately 420 million numbers available for assignment.

          https://www.ssa.gov/employer/randomization.html

          that certainly doesnt seem like it would support several generations, possibly at our current birth rate i suppose.

          DDG AI bullshit tells me that there are a billion codes. https://www.marketplace.org/2023/03/10/will-we-ever-run-out-of-social-security-numbers/ this article says it’s 1 billion

          https://www.ssn-verify.com/how-many-ssns

          this website also lists it as approximately 1 billion.

          • DacoTaco@lemmy.world
            link
            fedilink
            arrow-up
            0
            ·
            4 months ago

            I think i see the change. They are mentioning the ssn is 9 numbers long, which is 1 longer than the 3-3-2 format wikipedia mentions. That does mean its around 999mil numbers, which ye allows for a few generations ( like, 1 or 2 lol )

  • Garlicsquash@lemmings.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    Having never seen the database schema myself, my read is that the SSN is used as a primary key in one table, and many other tables likely use that as a foreign key. He probably doesn’t understand that foreign keys are used as links and should not be de-duplicated, as that breaks the key relationship in a relational database. As others have mentioned, even in the main table there are probably reused or updated SSNs that would then be multiple rows that have timestamps and/or Boolean flags for current/expired.

    • werefreeatlast@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Is this is true, then by this time we are all fucked. Like Monday someone checks their banking or retirement and it all gone. That’s gonna be a crazy day.

      I hope they’re not using the actual SSN as the primary key. I hope its a big ass number that is otherwise unrelated.

  • sneaky@r.nf
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    Might seem like a stupid question, but I’m in nostupidquestions sooo… Did Elon really do this tweet with the word “retard” in it? Obviously am on Lemmy so don’t use Twitter.

  • GaMEChld@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    Because of course the government uses SQL. It’s as stupid as saying the government doesn’t use electricity or something equally stupid. The government is myriad agencies running myriad programs on myriad hardware with myriad people. My damned computers at home are using at least 2-3 SQL databases for some of the programs I run.

    SQL is damn near everywhere where data sets are found.

    • DahGangalang@infosec.pubOP
      link
      fedilink
      arrow-up
      0
      ·
      5 months ago

      Yeah, obviously ol’ boy is tripping if he thinks SQL isn’t used in the government.

      Big thing I’m prying at is whether there would be a legitimate purpose to have duplicated SSNs in the database (thus showing the First Bro doesn’t understand how SQL works).

      • GaMEChld@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        5 months ago

        Oh, well another user pointed out that SSN’s are not unique, I think they are recycled after death or something. In any case, I do know that when the SSN system was first created it was created by people who said this is NOT MEANT to be treated as unique identifiers for our populace, and if it were it would be more comprehensive than an unsecure string of numbers that anyone can get their hands on. But lo and behold, we never created a proper solution and we ended up using SSN’s for identity purposes. Poop.

        • 【J】【u】【s】【t】【Z】@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          5 months ago

          I’m pretty sure there is a federal statute that says ONLY the SSA may collect or use SSNs, as to federal agencies. I argued it once when a federal agency court tried to tell me that it couldn’t process part of my client’s case without it. I didn’t care but my client was crotchety and would only even give me the last four.

          Edit. It’s a regulation:

          https://www.law.cornell.edu/cfr/text/28/802.23

          An agency cannot require disclosure of an SSN for any right or benefit if a specific federal statute requires it or the agency required the disclosure prior to 1975.

          In my case the agency got back to me with some federal statute that didn’t say what they said it said, and eventually they had to admit they were wrong.

      • aesthelete@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        edit-2
        5 months ago

        SSNs being duplicated would be entirely expected depending upon the table’s purpose. There are many forms of normalization in database tables.

        I mean just think about this a little bit, if the purpose is transactions or something and each row has a SSN reference in it for some reason, you’d have a duplicate SSN per transaction row.

        A tiny bit of learning SQL and you could easily see transactional totals grouped by SSN (using, get this, a group by clause). This shit is all 100% normal depending upon the normalization level of the schema. There are even – almost obviously – tradeoffs between fully normalizing data and being able to access it quickly. If I centralize the identities together and then always only put the reference id in a transactional table, every query that needs that information has to go join to it and the table can quickly become a dependency knot.

        There was a “member” table for instance in an IBM WebSphere schema that used to cause all kinds of problems, because every single record was technically a “member” so everything in the whole system had to join to it to do anything useful.

        • DahGangalang@infosec.pubOP
          link
          fedilink
          arrow-up
          0
          ·
          5 months ago

          had to join to it

          I don’t think I get what this means. As you describe it, that reference id sounds comparable to a pointer, and so there should be a quick look up when you need to de-reference it, but that hardly seems like a “dependency knot”?

          I feel like this is showing my own ignorance on the back end if databasing. Can you point me to references that explain this better?

          • aesthelete@lemmy.world
            link
            fedilink
            arrow-up
            0
            ·
            edit-2
            5 months ago

            I’m talking about a SQL join. It’s essentially combining two tables into one set of query results and there are a number of different ways to do it.

            https://www.w3schools.com/sql/sql_join.asp

            Some joins are fast and some can be slow. It depends on a variety of different factors. But making every query require multiple joins to produce anything of use is usually pretty disastrous in real-life scenarios. That’s why one of the basics of schema design is that you usually normalize to what’s called third normal form for transactional tables, but reporting schemas are often even less normalized because that allows you to quickly put together reporting queries that don’t immediately run the database into the ground.

            DB normalization and normal forms are practically a known science, but practitioners (and sometimes DBAs) often have no clue that this stuff is relatively settled and sometimes even use a completely wrong normal form for what they are doing.

            https://en.m.wikipedia.org/wiki/Database_normalization

            In most software (setting aside well-written open source), the schema was put together by someone who didn’t even understand what normal form they were targeting or why they would target it. So the schema for one application will often be at varying forms of normalization, and schemas across different applications almost necessarily will have different normal forms within them even if they’re properly designed.

  • KillingTimeItself@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    TL;DR de-deuplication in that form is used to refer a technique where you reference two different pieces of data in the file system, with one single piece of data on the drive, the intention being to optimize file storage size, and minimize fragmentation.

    You can imagine this would be very useful when taking backups for instance, we call this a “Copy on Write” approach, since generally it works by copying the existing file to a second reference point, where you can then add an edit on top of the original file, while retaining 100% of the original file size, and both copies of the file (its more complicated than this obviously, but you get the idea)

    now just to be clear, if you did implement this into a DB, which you could do fairly trivially, this would change nothing about the DB operates, it wouldn’t remove “duplicates” it would only coalesce duplicate data into one single tree to optimize disk usage. I have no clue what elon thinks it does.

    The problem here, as a non programmer, is that i don’t understand why you would ever de-duplicate a database. Maybe there’s a reason to do it, but i genuinely cannot think of a single instance where you would want to delete one entry, and replace it with a reference to another, or what elon is implying here (remove “duplicate” entries, however that’s supposed to work)

    Elon doesn’t know what “de-duplication” is, and i don’t know why you would ever want that in a DB, seems like a really good way to explode everything,

    • valtia@lemmy.world
      link
      fedilink
      arrow-up
      0
      ·
      4 months ago

      i genuinely cannot think of a single instance where you would want to delete one entry, and replace it with a reference to another

      Well, there’s not always a benefit to keeping historical data. Sometimes you only want the most up-to-date information in a particular table or database, so you’d just update the row (replace). It depends on the use case of a given table.

      what elon is implying here (remove “duplicate” entries, however that’s supposed to work)

      Elon believes that each row in a table should be unique based on the SSN only, so a given SSN should appear only once with the person’s name and details on it. Yes, it’s an extremely dumb idea, but he’s a famously stupid person.

      • DacoTaco@lemmy.world
        link
        fedilink
        arrow-up
        0
        ·
        4 months ago

        Ssn being unique isnt a dump idea, its a very smart idea, but due to the us ssn format its impossible to do. Hence to implement the idea you need to change the ssn format so it is unique before then.

        Also, elons remark is stupid as is. Im sure the row has a unique id, even if its just a rowid column.

      • KillingTimeItself@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        0
        ·
        4 months ago

        Well, there’s not always a benefit to keeping historical data. Sometimes you only want the most up-to-date information in a particular table or database, so you’d just update the row (replace). It depends on the use case of a given table.

        in this case you would just overwrite the existing row, you wouldn’t use de-duplication because it would do the opposite of what you wanted in that case. Maybe even use historical backups or CoW to retain that kind of data.

        Elon believes that each row in a table should be unique based on the SSN only, so a given SSN should appear only once with the person’s name and details on it. Yes, it’s an extremely dumb idea, but he’s a famously stupid person.

        and naturally, he doesn’t know what the term “de-duplication” means. Definitionally, the actual identity of the person MUST be unique, otherwise you’re going to somehow return two rows, when you call one, which is functionally impossible given how a DB is designed.

        • valtia@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          4 months ago

          in this case you would just overwrite the existing row, you wouldn’t use de-duplication because it would do the opposite of what you wanted in that case.

          … That’s what I said, you’d just update the row, i.e. replace the existing data, i.e. overwrite what’s already there

          Definitionally, the actual identity of the person MUST be unique, otherwise you’re going to somehow return two rows, when you call one, which is functionally impossible given how a DB is designed.

          … I don’t think you understand how modern databases are designed

          • KillingTimeItself@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            0
            ·
            4 months ago

            … That’s what I said, you’d just update the row, i.e. replace the existing data, i.e. overwrite what’s already there

            u were talking about not keeping historical data, which is one of the proposed reasons you would have “duplicate” entries, i was just clarifying that.

            … I don’t think you understand how modern databases are designed

            it’s my understanding that when it comes to storing data that it shouldn’t be possible to have two independent stores of the exact same thing, in two separate places, you could have duplicate data entries, but that’s irrelevant to the discussion of de-duplication aside from data consolidation. Which i don’t imagine is an intended usecase for a DB. Considering that you literally already have one identical entry. Of course you could simply make it non identical, that goes without saying.

            Also, we’re talking about the DB used for the social security database, not fucking tigerbeetle.

  • jerryh100@lemmy.world
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 months ago

    That dipshit does not even know how his dear friend at Oracle make tons of money in the past decades.