A small tool to view real-world ActivityPub objects as JSON! Enter a URL
or username from Mastodon or a similar service below, and we'll send a
request with
the right
Accept
header
to the server to view the underlying object.
{
"@context": [
"https://www.w3.org/ns/activitystreams",
{
"ostatus": "http://ostatus.org#",
"atomUri": "ostatus:atomUri",
"inReplyToAtomUri": "ostatus:inReplyToAtomUri",
"conversation": "ostatus:conversation",
"sensitive": "as:sensitive",
"toot": "http://joinmastodon.org/ns#",
"votersCount": "toot:votersCount",
"litepub": "http://litepub.social/ns#",
"directMessage": "litepub:directMessage",
"Hashtag": "as:Hashtag"
}
],
"id": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136",
"type": "Note",
"summary": null,
"inReplyTo": null,
"published": "2024-12-08T11:40:46Z",
"url": "https://tldr.nettime.org/@remixtures/113617013704354136",
"attributedTo": "https://tldr.nettime.org/users/remixtures",
"to": [
"https://www.w3.org/ns/activitystreams#Public"
],
"cc": [
"https://tldr.nettime.org/users/remixtures/followers"
],
"sensitive": false,
"atomUri": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136",
"inReplyToAtomUri": null,
"conversation": "tag:tldr.nettime.org,2024-12-08:objectId=22607303:objectType=Conversation",
"content": "<p>"Now that the seal is broken on scraping Bluesky posts into datasets for machine learning, people are trolling users and one-upping each other by making increasingly massive datasets of non-anonymized, full-text Bluesky posts taken directly from the social media platform’s public firehose—including one that contains almost 300 million posts.</p><p>Last week, Daniel van Strien, a machine learning librarian at open-source machine learning library platform Hugging Face, released a dataset composed of one million Bluesky posts, including when they were posted and who posted them. Within hours of his first post—shortly after our story about this being the first known, public, non-anonymous dataset of Bluesky posts, and following hundreds of replies from people outraged that their posts were scraped without their permission—van Strein took it down and apologized."</p><p><a href=\"https://www.404media.co/bluesky-posts-machine-learning-ai-datasets-hugging-face/?ref=daily-stories-newsletter\" target=\"_blank\" rel=\"nofollow noopener noreferrer\" translate=\"no\"><span class=\"invisible\">https://www.</span><span class=\"ellipsis\">404media.co/bluesky-posts-mach</span><span class=\"invisible\">ine-learning-ai-datasets-hugging-face/?ref=daily-stories-newsletter</span></a></p><p><a href=\"https://tldr.nettime.org/tags/SocialMedia\" class=\"mention hashtag\" rel=\"tag\">#<span>SocialMedia</span></a> <a href=\"https://tldr.nettime.org/tags/Bluesky\" class=\"mention hashtag\" rel=\"tag\">#<span>Bluesky</span></a> <a href=\"https://tldr.nettime.org/tags/AI\" class=\"mention hashtag\" rel=\"tag\">#<span>AI</span></a> <a href=\"https://tldr.nettime.org/tags/ML\" class=\"mention hashtag\" rel=\"tag\">#<span>ML</span></a> <a href=\"https://tldr.nettime.org/tags/GenerativeAI\" class=\"mention hashtag\" rel=\"tag\">#<span>GenerativeAI</span></a> <a href=\"https://tldr.nettime.org/tags/AITraining\" class=\"mention hashtag\" rel=\"tag\">#<span>AITraining</span></a> <a href=\"https://tldr.nettime.org/tags/WebScraping\" class=\"mention hashtag\" rel=\"tag\">#<span>WebScraping</span></a></p>",
"contentMap": {
"pt": "<p>"Now that the seal is broken on scraping Bluesky posts into datasets for machine learning, people are trolling users and one-upping each other by making increasingly massive datasets of non-anonymized, full-text Bluesky posts taken directly from the social media platform’s public firehose—including one that contains almost 300 million posts.</p><p>Last week, Daniel van Strien, a machine learning librarian at open-source machine learning library platform Hugging Face, released a dataset composed of one million Bluesky posts, including when they were posted and who posted them. Within hours of his first post—shortly after our story about this being the first known, public, non-anonymous dataset of Bluesky posts, and following hundreds of replies from people outraged that their posts were scraped without their permission—van Strein took it down and apologized."</p><p><a href=\"https://www.404media.co/bluesky-posts-machine-learning-ai-datasets-hugging-face/?ref=daily-stories-newsletter\" target=\"_blank\" rel=\"nofollow noopener noreferrer\" translate=\"no\"><span class=\"invisible\">https://www.</span><span class=\"ellipsis\">404media.co/bluesky-posts-mach</span><span class=\"invisible\">ine-learning-ai-datasets-hugging-face/?ref=daily-stories-newsletter</span></a></p><p><a href=\"https://tldr.nettime.org/tags/SocialMedia\" class=\"mention hashtag\" rel=\"tag\">#<span>SocialMedia</span></a> <a href=\"https://tldr.nettime.org/tags/Bluesky\" class=\"mention hashtag\" rel=\"tag\">#<span>Bluesky</span></a> <a href=\"https://tldr.nettime.org/tags/AI\" class=\"mention hashtag\" rel=\"tag\">#<span>AI</span></a> <a href=\"https://tldr.nettime.org/tags/ML\" class=\"mention hashtag\" rel=\"tag\">#<span>ML</span></a> <a href=\"https://tldr.nettime.org/tags/GenerativeAI\" class=\"mention hashtag\" rel=\"tag\">#<span>GenerativeAI</span></a> <a href=\"https://tldr.nettime.org/tags/AITraining\" class=\"mention hashtag\" rel=\"tag\">#<span>AITraining</span></a> <a href=\"https://tldr.nettime.org/tags/WebScraping\" class=\"mention hashtag\" rel=\"tag\">#<span>WebScraping</span></a></p>"
},
"attachment": [],
"tag": [
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/socialmedia",
"name": "#socialmedia"
},
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/bluesky",
"name": "#bluesky"
},
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/ai",
"name": "#ai"
},
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/ml",
"name": "#ml"
},
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/generativeAI",
"name": "#generativeAI"
},
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/aitraining",
"name": "#aitraining"
},
{
"type": "Hashtag",
"href": "https://tldr.nettime.org/tags/webscraping",
"name": "#webscraping"
}
],
"replies": {
"id": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136/replies",
"type": "Collection",
"first": {
"type": "CollectionPage",
"next": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136/replies?min_id=113617027103419310&page=true",
"partOf": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136/replies",
"items": [
"https://tldr.nettime.org/users/remixtures/statuses/113617027103419310"
]
}
},
"likes": {
"id": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136/likes",
"type": "Collection",
"totalItems": 18
},
"shares": {
"id": "https://tldr.nettime.org/users/remixtures/statuses/113617013704354136/shares",
"type": "Collection",
"totalItems": 23
}
}