Is Manual Remediation with Repository Health Check as Good as It Gets?

April 25, 2018 By Daniel Sauble

8 minute read time

If you're a Sonatype Nexus Repository admin, you understand the importance of keeping a repository healthy. We recently made a few changes to Repository Health Check (RHC) to help you in this quest. It now surfaces often-used vulnerable components and gives the information you need to research and remediate those components.

Let's talk about what RHC knows, how to remediate those components manually, and how to save time and effort in the future by automating the research and remediation.

What Does RHC Know?

If enabled, RHC knows a few things about a given repository:

It knows what vulnerable OSS components exist in that repository.
It knows how many and how severe the vulnerabilities are.
It knows how often these components are downloaded over time.

The information above is enough to figure out which components need to be remediated. Let's see an example of what that might look like.

How Do I Remediate Vulnerable Components?

There are many ways to research and remediate vulnerabilities. The following is a common example of what this can look like for a Maven component.

I'm looking at the RHC summary report for one of my proxy repositories, and notice that several vulnerable components are listed. One of the most vulnerable components seems to be xerces:xercesImpl:2.9.1 (two vulnerabilities found).

I try to see what these vulnerabilities are about. First, I go to a public repository (such as Maven Central) and search for those coordinates. The search doesn't recognize the GAV format I provided, so I search using only the group name (xerces) instead.

Picture2-4

A list of components appears, including the one I'm after. I click "all (17)." The different versions of the xerces:xercesImpl components are listed, I click "2.9.1."

There seems to be no information about vulnerabilities here, though I see this version is quite old (2008). I click the back button to get back to the list of component versions, then click "2.11.0" to see how old it is for comparison.

Through this website, I've learned that 2.9.1 is old (2008), and the newest version 2.11.0 is much newer (2013). However, I've learned nothing about the vulnerabilities that affect my version, and am unsure whether the latest version has resolved those problems.

Next, I go to Google to see if I can get better information. Upon searching for "xerces vulnerabilities," I see many promising results, so I refine by adding the version "xerces vulnerabilities +2.9.1."

Picture4-2

I click what looks like the official release notes for the component and scroll down to my version (2.9.1). Nothing here is related to vulnerabilities, so I go back to my search page.

On the search page, I see another page which appears to list security vulnerabilities for xerces. Five vulnerabilities are listed, but it appears I have to click into each one to see version information. I open all vulnerabilities in separate tabs and look at them individually.

CVE-2016-4463	All versions before 3.1.4 are affected
CVE-2004-1575	All versions before 2.5.0 are affected
CVE-2009-1885	2.7.0 and 2.8.0 are affected
CVE-2008-4482	All versions before 3.0.0 are affected
CVE-2004-1575	2.5.0 is affected

Of these, the first and fourth vulnerabilities seem to relate to my current version (2.9.1). This matches the number of vulnerabilities RHC told me about, so I'm pretty confident I've found the correct details.

Unfortunately, it appears the newest version of xercesImpl (2.11.0) still has those vulnerabilities, so upgrading won't help. I'm not sure what the next course of action is. I send an email to the developer mailing list with my findings, and hope that the application owners can figure it out from here.

Your results may vary, but this is a common outcome of vulnerability research. Our security researchers at Sonatype go through a similar process when researching vulnerabilities for our own database.

This Seems Painful; There Must Be a Better Way

As seen above, using what RHC knows can help keep your repositories healthy. However, as awesome as RHC is, there are a few things it can't do that would make your life even easier.

It doesn't have a list of the vulnerabilities that affect your components (unless you have Sonatype Nexus Repository).
It can't contact the owners of an application that uses vulnerable components.
It can't keep those components out of your software development life cycle (build, test, deploy, etc.).

In each of these cases, Sonatype Lifecycle (in concert with Sonatype Firewall) can help. You can still use what RHC knows to do the research needed to remediate those vulnerable components. It's just often easier to use an automated solution with better data instead.

I'd like to talk through a few of the problems demonstrated in the story above to illustrate what I mean. In each case, the problem is due to a lack of information and could be addressed with better data and automation.

Problem: There isn't a version of a component that is vulnerability-free, and it's not immediately obvious what a suitable replacement would be.

Solution: Sonatype Lifecycle provides recommendations when there is no path forward. In the xercesImpl example, it suggests updating Java and switching to JAXP.
Problem: You don't know what applications use a vulnerable component, or who you should contact with this information, so you're left with spamming a developer mailing list or similar.

Solution: Sonatype Lifecycle or Sonatype Firewall would help by automatically notifying application owners, and keeping vulnerable components out of their development life cycle.
Problem: The vulnerability information on cvedetails.com was actually incomplete and misleading (C++ vulnerabilities usually don't apply to Java, and CVE-2013-4002 was missing from the list).

Solution: Sonatype Lifecycle data is more accurate than that found in public databases, because we manually vet the data, and fix it as needed. We also tie it to the specific ecosystem used by the component (e.g. Java) to eliminate false positives.

What's Right for Me?

RHC allows manually research and remediation of vulnerable components. This is a huge step forward in helping you maintain the health of your repositories.

For some people, manual remediation is enough. That said, it can be time-consuming and frustrating, especially if the public data you're using isn't good enough. If you're strapped for resources, an automated solution like Sonatype Lifecycle or Sonatype Repository Firewall might be a better fit. Alternatively, if you just want to see the full list of vulnerabilities that affect the components in your repository, Sonatype Nexus Repository is a great way to go.

In either case, we hope this was a helpful overview of how to use RHC to maintain the health of your repositories. If you have questions or feedback about any of our products, please let us know.

Written by Daniel Sauble

Daniel is a Product Owner at Sonatype. He enjoys building software tools for developers and sysadmins and has spent the last eight years in DevOps startups. He has experience in Product Management, UX Design, User Research, Software Development, and Data Science.

Is Manual Remediation with Repository Health Check as Good as It Gets?

What Does RHC Know?

How Do I Remediate Vulnerable Components?

This Seems Painful; There Must Be a Better Way

What's Right for Me?

Try Nexus Repository Free Today

Related Resources

Defining Community Open Source Is Harder Than It Looks

Walking the Walk on Package Registry Sustainability

AI Changes the Software Supply Chain and How We Secure It