-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate Product Format String #61
Comments
I'm seeing this in the Plus, it's trivial to use |
Further debugging seems to show that the releases where this is occurring actually do have two Markup for a result without the issue: <li class="buyItem digital">
<h3 class="hd">
<button class='download-link buy-link' type="button">
<span class="buyItemPackageTitle primaryText">Digital Album</span>
</button>
<div class="digitaldescription secondaryText"> Streaming + Download </div>
</h3>
...
</li> Markup returned for a result with the duplicate text issue: <li class="buyItem digital">
<h3 class="hd">
<button class='download-link buy-link' type="button">
<span class="buyItemPackageTitle primaryText">Digital Album</span>
</button>
<span class="buyItemPackageTitle primaryText you-own-this">Digital Album</span>
<div class="digitaldescription secondaryText"> Streaming + Download </div>
</h3>
...
</li> This is from a dump of the First, I don't own this. Second, how would the scraper know that if the request is being made from node? Seems like a weird edge case, but I am seeing this behavior consistently on specific URLs. Either way, I assume this is the cause of the duplicated text. I'm going to try to debug this further but I just wanted to post this as an update to my initial report that there wasn't duplicate text. Also, I'm not sure what's happening with this line {
products: [
{
imageUrls: [],
name: 'Digital AlbumDigital Album',
nameFallback: '',
format: 'Digital AlbumDigital Album',
formatFallback: '',
priceInCents: 350,
currency: 'EUR',
offerMore: true,
soldOut: false,
nameYourPrice: false,
description: 'Includes unlimited streaming via the free Bandcamp app, plus high-quality download in MP3, FLAC and more.'
}
]
} |
When using getAlbumProducts, some URLs return duplicated strings for the format prop.
For example:
This consistently returns "Digital AlbumDigital Album" as the format. I'm not sure how this is happening, since the
. buyItemPackageTitle
element only contains this text once.This seems to happen to certain URLs consistently, ex:
I'm using a random URL out of a set of 1000 for debugging in my app, and I'm seeing this ~5% of the time.
It also seems to happen to the
name
prop for some URLs, and I'm also seeing the string "Full Digital Discography" doubled.The text was updated successfully, but these errors were encountered: