We can see that the majority of copies are coming from users with 0 reputation. To start, we wanted to see if our higher reputation users are copying more. Things started to become more interesting when we asked more detailed questions about who was copying and what they were copying. Finally, 86% of all copies came from anonymous users, aka users with 0 rep. Our largest geographies make up the majority of copies Asia 33%, Europe 30%, and North America 26%. Most copies occurred during the work week and during working hours. We also quickly realized that the overall copy behavior closely followed what we already knew about our site traffic. Questionsīen already mentioned some of the high-level stats that quickly proved what people had long joked about: everyone is copying from Stack Overflow. The following analysis is based on the behavior during that time. We collected data for two full weeks, from March 26th 2021 to April 9th 2021. We pretty much captured everything except the actual text being copied. With these events we were able to capture many different attributes tags, question answer or comment, code block or plain text, copier reputation and post score, region, and if the post was accepted or not. Using our homegrown web tracking tool, we created custom events to capture when a user copied from the site. What started as a joke has snowballed into a worthwhile exploration, producing new insights and sparking many internal conversations about how we can continue to innovate our public platform and bring more value to Stack Overflow for Teams. To add some direction to the analysis, the team and I came up with a list of questions that we wanted to answer. How many people really are copying from Stack Overflow? Are people just copying code? Are people more likely to copy the accepted answer? What did surprise me was the number of questions we could finally answer. If you want to hear about how we built the software modal and physical keyboard behind our April Fools joke, check out the podcast below.Īs someone who has been unapologetically copying from Stack Overflow for years, I was not surprised to see the millions of copy events rolling in. That’s the high level TL DR, but for folks who want a deep dive into all the things we learned while studying the copy data, please read on for some marvelous insights and charts from David Gibson, a data analyst on our product marketing team. Beyond that, we encourage everyone to share in the benefits of what the community has created. And of course, be aware that some code requires a certain license to use. You should still follow some basic best practices to prevent bugs or safety issues from sneaking into your code when copying, so make sure you educate yourself before grabbing and pasting. You can stand on the shoulders of giants and use their prior lessons learned to build new things of value. Our whole site runs on knowledge reuse – it’s the altruistic mentorship that makes Stack Overflow such a powerful community. Knowledge reuse isn’t a bad thing – it helps you learn, get working code faster, and reduces your frustration. So, if you’ve ever felt bad about copying code from our site instead of writing it from scratch, forgive yourself! Why recreate the wheel when someone else has done the hard work? We call this knowledge reuse – you’re reusing what others have already learned, created, and proven. People copy from code blocks more than ten times as often as they do from the surrounding text, and surprisingly, we see more copies being made on questions without accepted answers than we do on questions which are accepted. People copy from answers about ten times as often as they do from questions and about 35 times as often as they do from comments. That adds up to 40,623,987 copies across 7,305,042 posts and comments between March 26th and April 9th. One out of every four users who visits a Stack Overflow question copies something within five minutes of hitting the page. We were able to catalog every copy command made on Stack Overflow over the course of two weeks, and here’s what we found. Once we set up a system to react every time someone typed Command+C, we realized there was also an opportunity to learn about how people use our site. Ok, jokes over, hope everyone had a good laugh and no one got too freaked out. What would the world look like if we suddenly decided to monetize the act of copying code from Stack Overflow? Our company was inspired by the founders frustration with websites that kept answers to coding questions behind paywalls. We wanted to embrace a classic Stack Overflow meme and tweak one of our core principles. In the case of our recent April Fools gag, it might be more like an entire cob, perhaps a bushel of truth. They say there’s a kernel of truth behind every joke.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |