Data Cleaning

About a year ago, the other half had a computer that malfunctioned in a particularly nasty way.

All of the data from that computer was essentially wiped because the computer itself managed to rewrite data enough times on an SSD that it killed the drive. FYI. Solid State Drives have a finite lifespan that is dictated by the number of times data is written to them.

The other half’s computer managed to kill not one but two SSD drives in 6 months. Wow! that’s a whole lot of rewriting of data. 20 years worth in 3 months.

The upshot of all of this is that I ended up digging data up from backups, and places on the network where they’d stored data. This doesn’t sound like much of an issue until you realize that the other half stores data in a completely random way.

This is not surprising since hardcopy data is stored in exactly the same way in odd little places all around the house.

After I’d consolidated all the data I could recover into a single group of folders on the server, I said, “Okay that’s all I can do, you’re going to have to sort through and delete duplicates and what you don’t need.”

That was over a year ago. Guess what? Nothing has been touched since the important documents got transferred to the new computer.

Here’s a lovely chart showing how much data is needlessly occupying space on the server.

These numbers are astounding. Gigabytes of duplicate data? Really?

Well, since none of this has been touched in over a year. I’m going to clean house!

This is going to take a really long time. I’m not even going to bother to examine files, I’m going to use a utility to merge folders and delete duplications.

Whatever is lost will probably be of no consequence since it wasn’t important enough to look through in the first place.

The fact is, there’s no reason to have 5 folders of duplicate information indexed and stored on the server. It’s not that I’m all that worried about space I’ve got tons available. It’s about the possibility of a drive crash and all of this crap would make recovering from a one or two drive malfunction in the RAID array really tedious.

I’m sure that I’ll hear about something being lost, sometime in the future, but I’ll deal with that when/if the time comes.

At this point, I’m curious to see just how the deletion of all of this crap affects the performance of the server. I’m betting that the reduction in size of the index files alone will get me a speed boost.

I just hope the utility can swallow and digest a mountain of redundant data.

I should know in another 12 hours or so.

Now off to deal with some other poop. This time it’s legitimate poo, from the dog. Strangely I don’t mind cleaning the poop from the back yard nearly as much as I mind cleaning up the poop on the server.

Have a great weekend.


Update:

260GB deleted. Probably 100GB more to sort through.

The cleaning application, got lost in the mess. This happened several times.

I finally got annoyed with the utility. I was seriously considering asking for a refund. I went in manually to do the job. Initially I was thinking, “DAMN a human brain will still be needed to do the most basic things!”

Uh Huh…

I found that the application was having a problem with folders nested within folders that all had the same names.

It looked like the image to the right.

But in each one of the subfolders, there were files. Some of these files were originals and some of them were duplicates.

This diagram shows a recursion only 5 levels deep. Something like 63 directories and what I was looking at was in some cases 7 levels deep. It was not uncommon for there to be 1000’s of files in each directory.

As you can imagine, it got out of hand very quickly. Especially when you consider that many of the file names were duplications.

I’d been annoyed at the utility for not being able to keep it all straight. But I was far less annoyed when I realized that I, (The Human,) had gotten lost more than once in this digital house of mirrors.

As a interesting aside, It took me two different applications to create even this simple representation. One of the applications flat out refused to engage in the illogic and crashed.

The second application was somewhat uncooperative but eventually allowed me to create it, then save it as a PNG.

I’m no longer annoyed at the duplicate finding utility that threw up its hands. Even trying to create the representation was harder than I thought it would be. I could picture in my head what I wanted to show, but translating that kind of irrationality to something clear was… Odd.

In the case of the utility, I very much doubt that the programmer who created it even considered that someone would do something like this. So I still think the utility was worth what I paid for it.

Looking at the recycle bin, it looks as though the utility handled 4 levels of recursion without a complaint.

It’s because of this kind of thing that I personally am a bastard when it comes to training computer users on the importance of reasonable directory structures. I’ve always said that if you need nested directories more than five levels deep, you’re doing something wrong.

Obviously, there are exceptions to this rule. Computers can keep track of much deeper nesting, but non-technical average human beings??? More than five levels down, you’re asking for files to be “lost” and setting yourself up to chew hard disk space with duplicate files that will never be accessed again.

Yet again… There’s usually a good reason for me doing things the way I do.

So that was my Saturday… I hope yours was more fun and less, uhh… interesting.

That’s it! I want to get the HELL out of California!

In addition to all of the absolute bullshit that is California…

Traffic, Lockdowns, Abridgment of rights, Water shortages, Electricity Shortages, $500 Auto Registration Fees, Incompetence of the State Government and all the state’s government offices, State waste of tax dollars. Complete lack of planning, Crime, Outrageous Property Taxes, Outrageous Gas prices .55 on every gallon of gas really? Shitty roads, Outrageous State Income Taxes, and on and on and on…

I mean, pick literally any subject and then look to California to see how to do it exactly wrong.

I hate this place and everything about it.

There was a time when the benefits outweighed the hassle. That time is long gone.

This is a losing proposition any way you cut it.

That’s not to say that there aren’t some good people here and even some good Representatives in the Legislature. The problem is there are too few people with any common sense and those numbers apparently are dwindling.

The straw that has broken this camels back is EDD.

I don’t think there’s ever been a more incompetent government entity short of perhaps the last of the Roman Empire, maybe King George III’s tax collectors or court.


First EDD screwed up their modernization of their computer systems. They contracted with, and hired a bunch more people to actually do the work. Apparently the work not only didn’t get done, but they bought a shit ton of equipment that sat in its boxes until the warranties expired. Then the equipment sat for another few years until it was literally obsolete and had to be scrapped.

They were completely unprepared for the number of unemployment claims that were being filed when the pandemic hit. How did they address this? By putting more people on the phones supposedly to answer phone calls, which of course still didn’t get answered.

Oh yeah, there were hearings and excuses and in the end, after the uproar died down nothing was done. No heads rolled, and EDD went right back to its usual incompetent self.

They screwed up who was being paid during the pandemic. Turns out they paid a lot of people that perhaps weren’t eligible for benefits.

What? You mean they threw taxpayers dollars at people that shouldn’t have received benefits??? Yep… in at least the hundreds of millions of dollars range if not billions.


Now, EDD is sending out stupid emails that demand the people who received benefits prove they were eligible to receive those benefits and they’re threatening to charge 30% penalties on those benefits.

But of course the email they send you provides links to the EDD website that frankly is a hot steaming pile of shit. So even if you want to comply with their demand the odds are you can’t since the site sucks so bad.

Just connecting to their web site makes me and my computer morons. The intellect draining capacity of California’s EDD site should be harnessed and used to combat hackers and cyberterrorists the world over.

Come to think of it, the EDD site could be used as a firewall. It’s amazing at creating endless loops of login after login.

The funny parts of EDD’s demands are that they seem to think; A) The criminals who gamed the system are going to send documentation. B) That they’re going to get the money they threw away back from criminals who gamed the system.

Hey California EDD, NEWSFLASH Those people are in the wind and you’re not going to find them!

D’Oh!

But for people who had legitimate claims. We’re having to jump through hoops to clean up EDD’s fuckup.

When you talk to EDD they predictably absolve themselves of responsibility by saying the Federal Government is who is requesting this information.

That is probably true, however EDD should have all the data. They should not be asking for tax returns. After all if EDD approved a person for unemployment benefits that presumes that EDD knows who your previous employer was and verified that you were eligible for benefits in the first place. RIGHT?

It gets better, the EDD representatives are apparently unsure what exact documents you need to provide. Do you need to provide the entirety of your tax returns or just a copy of the W2 or will pages from your California Tax return suffice? They’re not sure.

It’s the Federal Government that’s asking, it goes into a computer to determine if the document is right…

Uh huh.

Having spent some time working for a Federal contractor, I can tell you that the Feds are pretty damn specific about what they want to see.

But wait, there’s more! EDD told a representative from a legislator’s office yesterday that they’d called me, (they did). I wasn’t available to take their call so they went to voice mail. This same person told the same legislative representative that they’d left a voice mail, (they did not). I sent screen shots proving the point to the legislators representative. So I have a little bit of a trust issue with EDD.

I have a real problem with providing EDD documents of this sensitive nature.

They’re incompetent, they’ve reported at least one data breach if I recall correctly. I’ve caught them literally lying to a California Legislator’s office.

Given their incompetence and Laissez-faire attitude about what they seem to need I’m concerned that by providing these documents in an electronic format I’m just setting myself up for identity theft.

After all what better hacking target than an organization with a proven track record of stupidity? Just imagine all the wonderful identities that will suddenly be available for the picking.

EDD appears to be auditing the citizens. EDD needs to be audited by a totally independent source. Maybe a group of accountants from Texas or Florida? Someone who’s not likely to sweep things under the carpet in the interest of not embarrassing California.

They made this mess. Why should the citizens be hassled, threatened, or further annoyed to help them clean it up?

I believe, California’s corruption and incompetence goes from the top all the way down to the local level.

I’m sick of it.

Voting is pointless because the people who’ve created this fucked up system greatly outnumber the people in this state who demand fiscal responsibility.

With that realization, I choose to work to get my ass out of here.

Even if that means divorce after 33 years and leaving the house and everything else behind. Honestly at this point I’m thinking about cutting my losses and doing anything to be free of this third world shit hole.

At least in another state I might have a shot at a job where I don’t have to worry about skin color quotas and layer upon layer of politically correct bullshit!

Fuck California!

p.s. In case you had any doubt… I’m not in a very good mood today.

To-Go California, You can order your meal but you’ll never eat.

Consider this a bit of a PSA.

I stumbled across this article in The Wall Street Journal about some new regulation for fast food and dine-in restaurants no longer providing plastic utensils with your order, unless you specifically ask for them.

Gone are the days when you can blow through a Taco Bell drive through and be assured that you’ll be able to eat all your meal.

You’d think, “No worries, I’ll use my hands.”

TRUE you could, but then you’ll find that the napkins you assumed would be in the bag, AREN’T.

Not to worry though, It’s not like you’ll have hot sauce in the bag either because those too you’d have to ask for.

So, Taco Bell, and other fast food options will kinda be a no option without a checklist. I’ll attempt to help by providing my personal Ordering checklist. I’ve got it on my phone, but I’m thinking a post-it note stuck to my car dashboard might be better.

Fast Food ordering:

  1. Food
  2. Drink
  3. Condiments (Ketchup, Hot Sauce, etc.)
  4. Straw
  5. Napkins at least 5 (More depending on type of food)
  6. Necessary utensils (Spork, Knife, etc.)
  7. Pull up to window. Pay.
  8. CHECK that meal is correct and necessary Utensils, Napkins, and condiments are present.
  9. IGNORE HONKING of impatient people waiting behind you!

Yet again The State of California is working to make your life better, through unintended consequences.

Most of the time, If you’re a working stiff, perhaps hourly you’ve maybe got a 30 minute lunch. If the company you work for is exceptionally generous you might even have 45 minutes or a whole hour!

With traffic in most areas around California industrial parks, for a working stiff, it works out like this;

5 minutes to get out of the building, 5 minutes to get out of the parking lot (due to everyone else trying to leave for lunch) 10 minutes to navigate the rest of the lunch hour traffic from all the other companies in the industrial park.

5 minutes at traffic lights and turning into the nearest strip mall or gas station parking lot.

Pull into shortest line for for one of the fast food places, (Wendy’s, Mac Donalds, Taco Bell, Starbucks, Panda Express, etc) wait in line 5 to 10 minutes (By which time you’re already late if you’re on a 30 minute lunch.)

Place your order, 5 minute wait for food, then mad 10 minute dash, (You’re late at this point if you have a 45 minute lunch,) back to the industrial park.

5 minutes waiting at lights, 5 minutes to get into parking lot and find parking space, 5 minutes to get back into building. (You’re on the raggedy edge of being late at this point if you have an hour lunch.)

You get back to your desk, ready to resume work and eat your meal while you’re working…

You open your bag of cold soggy burger & fries or tacos that started out as crunchy but which are now, anything but. Voilà you discover that the whole exercise was pointless because even though you could eat without the condiments, or perhaps even the utensils, you have nothing to wipe your hands with.

Your lunch sits in the bag, not getting any fresher until finally it smells disgusting and ends up in the trash. Thereby contributing to food waste and spewing CO2 into the air, in the dash to get lunch that was also a pointless waste of energy.

So tell me again how wonderful it is that you’ve eliminated basic necessities to protect the planet? Huuuummm?

I swear to God, all of these jackass politicians should be under mandatory orders to live with their proposed laws for six months before they can put them into action.

Not ONE of the political elites in any California city has ever had to punch a clock or be screamed at by an overbearing manager over their lunch break being 2 minutes too long.

All one need do is look at the distribution of restaurants in and around industrial parks in San Diego, Irvine, Victorville, Huntington Beach, The San Fernando Valley, Ventura, or Los Angeles to see that most of the “working class” lives what I’ve described above daily. Or they bring their lunch so that they can at least have a real few minutes of rest during their lunch break.

Now, thanks to the politician brain trust, this new anti utensil or condiment law will ultimately slow the food ordering process down even further.

Not that these Politicians give a shit. After all, if the workers aren’t spending money on fast food, after skipping breakfast to get the kids off to school, it means the workers will have more money that can collected as taxes.

Obviously the low wage earner who collects $12 a week in unspent lunch money doesn’t need it now do they?

Plus, the Arch Ministers of Public Health can chalk up a “Win” because obesity will be less of a problem. Who cares if the workers are starving while they toil away to earn enough to pay their tax burden? That just means they’ll die sooner and a whole new group can be imported from wherever.

Maybe the Politicians are hoping it’ll be mostly white people!