Who do you trust: your data, or your enemy’s?

Imagine, just imagine, that you’re head of an organisation, and you suspect that there’s been some sort of attack on your systems[1]. You don’t know for sure, but a bunch of your cleverest folks are pretty sure that there are sufficient signs to merit an investigation. So, you agree to allow that investigation, and it’s pretty clear from the outset – and becomes yet more clear as the investigation unfolds – that there was an attack on your organisation. And as this investigation draws close to its completion, you happen to meet the head of another organisation, which happens to be not only your competitor, but also the party that your investigators are certain was behind the attack. That person – the leader of your competitor – tells you that they absolutely didn’t perform an attack: no, sirree. Who do you believe? Your people, who have been laboring[2] away for months, or your competitor?

What a ridiculous question: of course you’d believe your own people over your competitor, right?

So, having set up such an absurd scenario[4], let’s look at a scenario which is actually much more likely. Your systems have been acting strangely, and there seems to be something going on. Based on the available information, you believe that you’ve been attacked, but you’re not sure. Your experts think it’s pretty likely, so you approve an investigation. And then one of your investigatory team come to you to tell you some really bad news about the data. “What’s the problem?” you ask. “Is there no data?”

“No,” they reply, “it’s worse than that. We’ve got loads of data, but we don’t know which is real.”

Our logs are our memory

There is a literary trope – one of my favourite examples is Margery Allingham’s The Traitor’s Purse – where a character realises that he or she is somebody other than who they think they are when they start to question their memories. We are defined by our memories, and the same goes for organisational security. Our security logs define our organisational memory. If you cannot prove that the data you are looking at is[5] correct, then you cannot be sure what led to the state you are in now.

Let’s give a quick example. In my organisation, I am careful to log every time I upgrade a piece of software, but I begin to wonder whether a particular application is behaving as expected. Maybe I see some traffic to an external IP address which I’ve never seen before, for instance. Well, the first thing I’m going to do is to check the logs to see whether somebody has updated the software with a malicious version. Assuming that somebody has installed a malicious version, there are three likely outcomes at this point:

you check the logs, and it’s clear that a non-authorised version was installed. This isn’t good, because you now know that somebody has unauthorised access to your system, but at least you can do something about it.
at least some of the logs are missing. This is less good, because you really can’t be sure what’s gone on, but you know have a pretty strong suspicion that somebody with unauthorised access has deleted some logs to cover up their tracks, which means that you have a starting point for remediation.
there’s nothing wrong. All the logs look fine. You’re now really concerned, as you can’t be sure of your own data – your organisation’s memories. You may be looking at correct data, or you may be looking at incorrect data: data which has been written by your enemy. Attackers can – and do – go into log files and change the data in them to obscure the fact that they have performed particular actions. It’s actually one of the first steps that a competent attacker will perform.

In the scenario as defined, things probably aren’t too bad: you can check the checksum or hash of the installed software, and check that it’s the version you expect. But what if the attacker has also changed the version of your checksum- or hash-checker so that, for packages associated with this particular application, they always return what you expect to see? This is not a theoretical attack, and nor is it the only way approach that attackers have to try to muddy the waters as to what they’ve done. And if it has happened, they you have no way of confirming that you have the correct version of the application. You can try updating the checksum- or hash-checker, but what if the attacker has meddled with the software installer to ensure that it always installs their version…?

It’s a slippery slope, and bar wiping the entire system and reinstalling[6], there’s little you can do to be certain that you’ve cleared things up properly. And in some cases, it may be too late: what if the attacker’s main aim was to exfiltrate some of your proprietary data, for example?

Lessons to learn, actions to take

Well, the key lesson is that your logs are important, and they need to be protected. If you can’t trust your logs, then it can be very, very difficult not only to identify the extent of an attack, but also to remediate it or, in the worst case, even to know that you’ve been attacked at all.

And what can you do? Well, there are many techniques that you can employ, and the best combination will depend on a number of questions, including your regulatory regime, your security posture, and what attackers you decide to defend against. I’d start with a fairly simple combination:

move your most important logs off-system. Where possible, host logs on different systems to the ones that are doing the reporting. If you’re an attacker, it’s more difficult to jump from one system to another than it is to make changes to logs on a system which you’ve already compromised;
make your logs write-only. This sounds crazy – how are you supposed to check logs if they can’t be read? What this really means is that you separate read and write privileges on your logs so that only those with a need to read them can do so. Writing is less worrisome – though there are attacks here, including filesystem starvation – because if you can’t see what you need to change, then it’s almost impossible to do so. If you’re an attacker, you might be able to wipe some logs – see our case 2 above – but obscuring what you’ve actually done is more difficult.

Exactly what steps you take are up to you, but remember: if you can’t trust your logs, you can’t trust your data, and if you can’t trust your data, you don’t know what has happened. That’s what your enemy wants: confusion.

1 – like that’s ever going to happen.

2- on this, very rare, occasion, I’m going to countenance[2] a US spelling. I think you can guess why.

3 – contenance?

4 – I know, I know.

5 – are?

6 – you checked the firmware, right? Hmm – maybe safer just to buy completely new hardware.

Author: Mike Bursell

Long-time Open Source and Linux bod, distributed systems security, etc.. Founder of P2P Consulting. マイク・バーゼル：オープンソースとLinuxに長く従事。他にも分散セキュリティシステムなども手がける。 View all posts by Mike Bursell

Our logs are our memory

Lessons to learn, actions to take

Share this:

Related

Author: Mike Bursell

Leave a comment Cancel reply