How to find all duplicate assets in AEM dam? | Community
Skip to main content
tarun11011991
New Participant
January 11, 2017
Solved

How to find all duplicate assets in AEM dam?

  • January 11, 2017
  • 15 replies
  • 17527 views

Hi,

I need to find all the duplicated assets in DAM.

I followed https://docs.adobe.com/docs/en/aem/6-2/author/assets/managing-assets-touch-ui/duplicate-detection.html

This document is not helping me in searching for all duplicate values. Also, it doesn't tell that i am uploading a duplicate asset. It just pops-up a window asking for replacing or create new version for that asset.

My priority is to get list of all duplicated DAM assets. Please advise

Thanks in advance.

Best answer by kautuk_sahni

Hi 

This is done by checking SHA 1 checksum value. 

//If the asset you upload already exists in AEM Assets, the Duplicates Detected dialog warns you that you are attempting to upload a duplicate asset. The dialog appears only if the SHA 1 checksum value of the binary of the existing asset matches that of the asset you upload. In this case, the names of assets is immaterial. In other words, the dialog can even appear for assets that have different names if the SHA 1 values for their binaries are the same. With each asset uploaded you will see SHA 1 JCR node created.

So in you use-case you need to write a custom service to do the needful.

Where to find SHA1 number in Image Assets -> jcr:content ->metadata -> dam:sha1  (/content/dam/geometrixx-outdoors/banners/best-season.jpg/jcr:content/metadata)

I hope this would help you.

Thanks and Regards

Kautuk Sahni

15 replies

tarun11011991
New Participant
January 11, 2017

Hi,

Thanks for the reply.

DAM SHA1 value is coming differently for a same image with a different name. Is there any way to calculate SHA of an asset  and then compare it with other assets?

tarun11011991
New Participant
January 11, 2017

We have already uploaded the images into the DAM. Is there any way to identify the assets that are duplicated (Same Asset with different names) ?

kautuk_sahni
kautuk_sahniAccepted solution
Employee
January 11, 2017

Hi 

This is done by checking SHA 1 checksum value. 

//If the asset you upload already exists in AEM Assets, the Duplicates Detected dialog warns you that you are attempting to upload a duplicate asset. The dialog appears only if the SHA 1 checksum value of the binary of the existing asset matches that of the asset you upload. In this case, the names of assets is immaterial. In other words, the dialog can even appear for assets that have different names if the SHA 1 values for their binaries are the same. With each asset uploaded you will see SHA 1 JCR node created.

So in you use-case you need to write a custom service to do the needful.

Where to find SHA1 number in Image Assets -> jcr:content ->metadata -> dam:sha1  (/content/dam/geometrixx-outdoors/banners/best-season.jpg/jcr:content/metadata)

I hope this would help you.

Thanks and Regards

Kautuk Sahni

Kautuk Sahni
smacdonald2008
New Participant
January 11, 2017

When you refer to duplicate asserts, what do you mean? Same asset name for example? If you have two assets in two locations named car. png, would this be duplcate fot your requirement? Or are you somehow wanting AEM to tell you if two assets have same data stream?

tarun11011991
New Participant
January 11, 2017


Hi,

I tried uploading an image asset in Touch UI and it seems Duplicate-Detection working there. I got a pop-up window telling me that duplicates values are present.

Could you please tell me how can i generate a report (or a list ) to get all duplicate assets in one go?

Thanks in advance