Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Hacker News (HN) – Part 1: analysis (osm-ai.net)
138 points by sebg on April 8, 2024 | hide | past | favorite | 119 comments


I think there could be an aspect of social cooling contributing to decreased per-user engagement. I find myself considering my online comments more carefully now than I did 10 years ago across all platforms.


Hard agree- a lot of people just opt not to at all- that's what I usually do (other than HN). And when you do decide to go ahead and participate, the energy it takes for each post is a lot higher- you have to consider every contingency you can think of and make sure they all check out. And none of us know if we're going to be wrong when they unearth these posts and pin them on us later, unfortunately.


> I find myself considering my online comments more carefully now than I did 10 years ago across all platforms.

I find myself doing this, but I chalked it up to aging and the reality that I may eventually have no choice but to work for one of the companies I have ridiculed online.


I know that participating on HN has made my comments on all forums more thoughtful, less hyperbolic, more serious, and generally better considered. The reasonable rules, excellent moderation and the community culture here have been excellent teachers. The voting mechanism helps me gauge if my comments are reasonable (I don't care about the magnitude but more the negative/positive score telling me if I'm off the rails).

Perhaps a similar effect eventually takes hold on many of the contributors, and for the ones who can't moderate themselves to the culture here, I guess they'll recess into lurkers or simply go away elsewhere.


> The voting mechanism helps me gauge if my comments are reasonable (I don't care about the magnitude but more the negative/positive score telling me if I'm off the rails).

This is my sense of why pg picked a balance number, not a + and - number separately.


The "decreased per-user engagement" evidence in this post is a bit thin. Mostly based on a kind of strange apparent outlier of accounts going idle (1+ year) in 2023 but the binning is also a full year and the 'idle year' is counted in a weird clippy (i.e. looking at calendar year rather than elapsed year) way. So it's one (aggregate) data point at the very end of the data. It might not be wrong but it feels somewhat iffy to draw conclusions from.


> Mostly based on a kind of strange apparent outlier of accounts going idle (1+ year) in 2023

Well, it is just outliers in 2023. This is an upward trend since 2020.

> but the binning is also a full year and the 'idle year' is counted in a weird clippy (i.e. looking at calendar year rather than elapsed year) way

Granted, and I acknowledge this limitation. My idea, however, is that when studying many users in the same manner, this will even out. Why? Because a full calendar year implies somewhere between 0-2 elapsed years. So the average elapsed year, over many users, is 1 year.


The upward trend is much smaller than what happens in 2023 so that looks worth looking into. When you have this one outlier and one year can actually mean two years, it's not completely clear how much of the outlier is actual outlieriness and how much is some accidental artifact.


Hey,

I double checked. I don't really see an issue. The only specific thing that affects 2023 is that I removed the users seen / last seen in 2024 (since it is not complete year). The aggregation is simple also: count the users first seen, grouped by year. count the users last seen, grouped by year.

There was a separate issue though (I didn't filter out the "dead" and "deleted" stories / comments). I fixed that and updated the article. Some values changed, but the patterns and conclusions stands.


Thanks for looking into this. I'll try to reproduce this myself (but with elapsed times) and see what happens.

Just to double check we're talking about the same thing: The red line is 'users who have been inactive for a year or more, at the time of the aggregate point'. So, for instance, for 2016 you'd have a point for 'users with a year+ inactivity, counted from 2016 back'.


> Thanks for looking into this. I'll try to reproduce this myself (but with elapsed times) and see what happens.

That will be great! Please don't hesitate to reach out if there is anything I can help with.

> Just to double check we're talking about the same thing: The red line is 'users who have been inactive for a year or more, at the time of the aggregate point'. So, for instance, for 2016 you'd have a point for 'users with a year+ inactivity, counted from 2016 back'.

Not quite. It is means the user has been last seen in that year (2016). By "last seen" I mean the user last shared story or comment (separate graphs) was that year.


I guess I don't exactly understand 'last seen, (not active from >= year)'. So to be part of the red value for a given year, you have to be seen in that year and then what? Be idle for a year after that? What's the connection between seen-ed-ness and idleness?


Perhaps I should have articulated this in a better way.

> So to be part of the red value for a given year, you have to be seen in that year and then what? Be idle for a year after that?

Exactly! Last seen: this is year of their last contribtuion (story / comment).

A user shared their first story in 2012, and last one in 2016: 2012 is when they were first seen, and 2016 is when they where last seen. So, on the blue line, they are part of 2012, and on the red line, they are part of 2012

> What's the connection between seen-ed-ness and idleness?

If I am last seen in 2016, then I am idle since then, no?


Aha ok, but if I am understanding this right, the future can change the past of this graph, right? Like our hypothetical user who first appeared in 2012 and last posted in 2016 - right now they appear in the 2016 red line but if they showed up again today and you made the graph again next year, they wouldn't be in the 2016 red line anymore. Or put another way and one that you can try: What happens if you cut off the data at 2022, 2021, 2020, 2019, 2018, etc and plotted those graphs? You'd see a different (rather than merely truncated) graph, no? Maybe even a different trend. So if my understanding is right, this is a pretty wiggly metric. The history of something you want to use as a historical trend line should not change as you append more data.


> Like our hypothetical user who first appeared in 2012 and last posted in 2016 - right now they appear in the 2016 red line but if they showed up again today and you made the graph again next year, they wouldn't be in the 2016 red line anymore

That is correct.

> What happens if you cut off the data at 2022, 2021, 2020, 2019, 2018, etc and plotted those graphs? You'd see a different (rather than merely truncated) graph, no? Maybe even a different trend. So if my understanding is right, this is a pretty wiggly metric. The history of something you want to use as a historical trend line should not change as you append more data.

I see your point, but I don't see how it is avoidable. From my knowledge, any user churn metric will suffer the same effect: If you consider a user is churned after two weeks of inactivity, then this will change if you change the cut-off (the last two weeks of the this month? the two weeks before them? ...etc).

Even if you measure the "elabsed time" instead of "last seen", the cut-off will change your curve.

Extreme example: If you assume a user is churned after 1 year of inactivity (elapsed time since last activitiy), then a user that shared one story in 2007, and then a second story in end of 2023, will apear as active. If you change the cut-off from 2023 to 2022, then the user will appear as inactive.


I don't see how it is avoidable.

You can define a metric such that future data doesn't affect past data. Here's a straightforward one: a user is inactive at time t if they haven't posted in the period between t and t - k where k some constant time period one picks. So let's say k is a year and you're looking at active users per year†. So in your last example, the user would be counted as active in 2007 and 2008, counted inactive in 2009 to 2022 and would count as active in 2023. If you truncate the data at 2022 nothing changes.

† year is probably too big of a window for this (I'd take something like a month) but let's stick with it for now


In a SaaS product, these are different issues:

1. I am making a proxy for churn (last seen == end of subscription in this product).

2. You are looking for active users (yearly / monthly / ...)

I think your suggestion (point 2) is definitely an important view, but it doesn't conflict with the need for point 1.

Sorry if this is self-evident, I will just leave this link for reference on such metrics: https://userpilot.com/blog/product-engagement-metrics/

In any case, I am happy to help: if you would like an export of the data, or the DB dump, let me know. And I very much looking forward for your analysis :)


I will double check. I see your point


> I find myself considering my online comments more carefully now than I did 10 years ago across all platforms.

I am curious, why is that? Is the medium less safe than before (e.g., unpleasant interactions)? Or is it a high effort to engage in a discussion, and perhaps there are other priorities now?

Perhaps something else


I have found myself doing the same.

1. I use my real name, which has a strong cooling effect on the kinds of stuff I write. I'm not going to get into a flame war or say career-ending things under my real name. I do this deliberately to force me to keep my responses as high quality as I can.

2. As I temper what I write based on today's norms and taboos, I also have to think about (guess) what might be taboo in 10, 20, 30 years. Assume HN will be searchable forever. People today have a habit of drudging up things that their opponents wrote decades ago, and measuring them against today's (more restrictive) yardstick. I've told jokes back in the 90s that were benign and funny then, but would get me fired today. Imagine people decades from now reading the innocent things I am writing today and how they will be offended by it!


Yeah that is sensible. It is one thing that I miss about the internet back in the early 2000s.

I am still a bit conflicted tbh about what to do concerning that. The accounts don't really get deleted, and it is not far-fetched to imagine tools that correlate the styles accross multiple accounts (so, one mistake, and the identity is unveiled).


The "users ceasing to post stories after one year" metric seems a bit wonky to me. I mean, I don't doubt it, but it's probably skewed by accounts created for self-promotion or spam. If you look at front-page stories, I think that most of them are from long-time members.

It would probably be useful to separate the churn of throwaway accounts and the ones that cross some engagement threshold - 50 comments or something like that.


Better yet just a graph of the ratio of comments vs posts over time.


That is an interesting point! It makes sense to have a criteria that identify real users / long-term members.

I will look into that. Cheers


We humans are emotional. Users' submissions didn't get the desired reach, and comments were thrashed to oblivion- they got emotional and left/lurked without participating anymore. Besides the usual promotional accounts that crop up, this is one of the reasons people do not participate.

Leave the emotion out of the equation, and HackerNews becomes way calmer for you.

The same story you submitted that failed gets talked about/discussed when submitted by another -- eh! All the smart people shred your comment -- ah, let's take a walk.


i mostly lurk HN, and don't post unless i feel like my contribution is useful in some way.

from my perspective, it's relieving to see that the number of users has remained mostly constant in recent years. of course it's selfish to think in this way, but almost no "social media" or UGC-based platform that i've used has actually become better or more useful to me as it became (much) larger.

this kind of fast growth in users (beyond some size) often leads to a shift in the culture that made preexisting users participate in the first place, leading to a loss in overall quality of the platform as a whole. if the growth is gradual enough, then new users eventually figure out how to fit into the culture or leave.

i guess i've said nothing that isn't obvious to people who have used a computer before, basically "yay no eternal september for HN yet", but i digress.


Growth assumes that older posters continue to post, which both according to TFA and my own analysis isn't true.

In my own analysis, comments in a thread have a recent-user bias and roughly 30-40% of an article's comments will come from the last 3-4 years of users. I found this to hold true for many large threads over the past 5 years, though I haven't yet exhaustively demonstrated this yet simply because I don't do HN analysis that often.

What that means practically is that the folks who post on this site are constantly changing. I also, generally, find that the most contentious threads on this site tend to have a relatively stronger recency bias among its posters.

By that regard, eternal September is ever present: the posts on this site weigh toward recent posters. I'd be curious to see if that effect can be explained by throwaways and sock puppet posters but I'm not sure of any reliable way to identify those especially as historic karma counts aren't kept for users.

While I have more robust models that work a lot better, a very simple method I've found is that the more recent the upper percentiles of posters are on a thread, the less I will like it: to me Eternal September is here. Of course my user here is ancient from 2009. The numbers just quantify that there's been a change in audience since I joined, a wholly unsurprising fact.


i did read the article but didn't come to the conclusion that "Eternal September" is here. although i don't argue that the quality of posts has gone up recently, i do believe that many of the new users of 2022/2023 have become - or will become - the high quality posters that attracted them here initially. as i said in the first post, i think this is true because there hasn't been an unprecedented spike in new users.

basically, quality(new user + time) = quality(old user), as long as the proportion of new users/old users remains small. of course it's subjective as to where you put this value though. i just think it hasn't been reached yet here.


What "Eternal September" means to anyone is different.

What I assert based on data (simply map the comments in to the join date of their authors and make a distribution) is that most large threads with a lot of discussion on this site have a recency bias: newly joined users post a disproportionate amount on the website. At any given time, any given thread will be more heavily commented on by newer users. Given that observation, Eternal September is largely personal: do you still value the content on the site given that recent posters create more of it. For me the answer, as borne our in data, is no. I find from ~ 2016 a change in the types of discussions on the site and find that the newer the poster distribution skews from that time period onward the less interesting the discussion becomes to me. That's really all. I'm glad that you still find value in the site but that makes sense: you're a newer user. Of course this may be your second or even nth account in which case I'm obviously wrong, but what I can be certain about is myself and my own preferences, and even those change over time.


Few points

> newly joined users post a disproportionate amount on the website.

What is your definition of a new user?

> For me the answer, as borne our in data, is no. I find from ~ 2016 a change in the types of discussions on the site and find that the newer the poster distribution skews from that time period onward the less interesting the discussion becomes to me.

I am curious: Can you elaborate on how such analysis is being made?


> What is your definition of a new user?

A poster that joined in the last 2-3 years.

> I am curious: Can you elaborate on how such analysis is being made?

Try doing analysis on the time at which posters in a thread joined this site. I weigh their dates based on the number of times they comment in a thread. If you do that you'll see a recency bias. The mass of the distribution is clumped more heavily in the last 3 years since the article's submission. If you think about it, that makes a lot of sense. People are probably much more energized to comment when they first join and eventually get bored and stop. You point out a really similar effect out in your own analysis about posters that stop posting after a year.


this is my first account, and i understand your perspective. i did sometimes come to HN from 2015 onwards, but not enough to really get a feel for the level of content and conversation that happened. 2015 is still much later than you first started using it, so i can see how your older first impressions of this site led you to the conclusion that present-day HN is a shadow of its former self.

so i think we agree on everything, except that you're comparing pre-2016 to now, vs. me comparing 2020 to now, which leads to the opposite conclusions. excuse me for being a part of the less interesting era of posters ;-)


Something that makes this all harder is that HN and other sites don't exist in a vacuum. There's an external world that shapes the opinions and ideas of people.

When I joined HN, hacking was mostly for nerds. The real important tech people worked on chips or devices and the money makers were in finance. This has changed. With it has come an interest in lurid tech coverage, like the existence of sites like "Techdirt", and something about software has led to the growth of a tabloid style news interest in it in a way that never happened for hardware (probably an artifact of the rise of tech being concomitant and convergent with the evolution of the Internet.) There's also been a lot of other changes going on in the greater world as the internet has become a larger part of our lives. I suspect those changes outside "in the real world" that eventually finding itself into here.

For me the antidote has been more focused conversations on other sites like Discord or BlueSky but this is all very personal.


Interesting point.

I was watching The Synanon Fix on HBO, this weekend, and towards the end of the first episode, we see the original addicts, being replaced by the more lucrative "lifestylers," and I suspect that the next episodes will be downhill, from here.

The app I just wrote and released, is picking up steam, but slowly. That's deliberate. We're not doing any promotion, and it's giving us a chance to make course corrections. By the time people start piling in (and they will, but it will never be more than a rounding error to many social media apps), I think it will be in extremely good shape.


I have long wondered about doing an analysis of HN comments to find correlations of opinions, instances where people have changed their mind over time, topics where the balance of opinion is out of proportion to the balance of comments due to a vocal minority.

I have made some casual observations that it would be nice to know whether the broader data supports what I have seen. For instance, one weird thing that I have noticed, is that antagonistic comments appear to me to be posted more often by users with PGP keys in their profile. It may just be by imagination.

Possibly more importantly, but also something should be approached with caution and sensitivity would be analysis of comments to identify people in mental distress. I know of a few HN users that have taken their own lives and have wondered if there was enough information in their comments to allow someone to provide pre-emptive support.


I would be really curious to explore such correlations.

Tbh, while I was working on this, I did struggle with forming hypotheses to test, or even a suitable manifold for the different items/users (e.g., I didn't know some of the users share the PGP keys). Unsupervised methods were not very satisfying in uncovering such hypotheses as well.

There is also the point you raised about how to share (or what to do with) some of these correlations. For example, concerning mental illness, as much as I would love to uncover that due to my sheer curiosity, I am really concerned about what I would see.


> antagonistic comments appear to me to be posted more often by users with PGP keys in their profile

While it's definitely preferable to encrypt private communications among all people including the "nothing to hide" types who generally behave well within societal norms, the types who don't behave as well feel a greater need to encrypt.


I should clarify that I don't think that most these type of comments are from people deliberately violating societal norms (like a troll would).

My hypothesis is broadly similar to yours, that comments like this come from individuals with a more adversarial view of the world.


> Even more interestingly, there was negative karma. I don’t even know how this came to be.

Brand new account commenting with very low value but not worth flagging. For example "I read the article yesterday" or "Why is this article even on HN?". I've seen a couple that had negative karma on the first days. Often though it's simply spam and the account soon gets shadow-banned though I don't understand the process that does that. Example user: https://news.ycombinator.com/user?id=gabriella4151


Well explained, Gracias!


Thank you for clarification.

I think part of the confusion that I had was that I didn't know about the downvoting option at the beginning.

I think it will be really great of HN releases the scores of the comments.


The karma analysis completely ignores (after mentioning it) the karma gained from comments. I post almost no stories to HN, but have a karma of over 18k. Any sane, full analysis of HN culture needs to take this sort of thing into account.


I think of it as the easy way and the hard way to gain karma, but I'm biased as a "commenter" user. In reality, some people work hard to supply HN with interesting stories and that's important, but it's pretty easy to have sizeable karma without really ever engaging in conversation.

I think the tradeoff there is that the HN submitter crowd are very fast to the draw. I'll often encounter a story elsewhere that I want to hear from HN about, so I'll do a search and find it was submitted 15 minutes ago, or even days ago (and I wonder what RSS fueled craze people must go through to find and post things so fast). Of course, for me those are the best cases, because I can go through and read what people think and reply without having to wait for the uptick. But no karma for the slow.


LOL! No RSS, no nothing for me. For me, it is just consistency at a specific timeslot -- wind-down evening for me (Europe's in Office, West Coast wakes up). I tend to fire and forget. I think the hit is less than 5%, but the consistency pays off in the long run. The best I remember is that 4 of my submissions were on the front page over a couple of hours, which is 13.33% of the front page stories.


Author here

I do agree with your point, but unfortunately I was limited by what is available from the APIs: the upvotes for the comments were not provided. I couldn't find a proxy for it either. Thus, I couldn't include it in the analysis


You can build an analysis based on partial ordering. The Axios API gives you relative ranking of comments on a given level. Based on partial order you can build lattices and make assertions but that is really difficult and the juice probably isn't worth the squeeze.


I thought about it, and you might be right that a proxy for the comments upvotes can be established here. But I just couldn't come up with anything satisfying tbh. Besides, with comments, there are replies, replies to the replies, ...etc.


Yes, this is the observation -- comments (when they are meaningful and upvoted) accelerate one's karma. This is also hard, consumes much of your core time, and is more challenging to deal with hyperboles.


If what they call 'consistency' is just the total number of stories posted, it's not really surprising to see a strong correlation with total karma points, is it? The more stories posted, the more karma gathered. Anything else would be a real head scratcher.

You could say, there might be people who keep posting stories even though the don't get upvoted, but that would be kind of irrational. If the community doesn't seem to be interested in what they post, they will stop doing so sooner or later.


> You could say, there might be people who keep posting stories even though the don't get upvoted, but that would be kind of irrational. If the community doesn't seem to be interested in what they post, they will stop doing so sooner or later.

Probably many people stop sharing in this case indeed. But for those who don't, I guess the idea is: eventually, if you keep sharing, you can collect upvotes here and there, and eventually the karma will go up


I didn't realize that the active user tenure is this low (one year) since I frequently come across veteran users.

On a similar note, sometimes i wonder where the high profile users who leave the platform move into (if anything), there doesn't seem to be a lot of options. Do they switch to alt accounts? turn into lurkers? maybe go off the grid?


The average user in the dataset will have lower tenure than the average user you encounter, because long-tenure users will have more encounters.


I’ve significantly reduced my commenting recently because I realized that I’ve been using the internet to hone my discussion/argumentation skills since I got on the web in 1996

At this point there’s too much noise on the internet for good long term, long form discussion like we had on irc and forums and frankly I’m not sure where that lives today.

HN seems to be the last place that had good regular conversation mostly cause of dang


These platforms are not designed for intellectual discourse, even when they're advertised to be, and even when they try to be.

You can't just be right. You also need to be careful. Some would say mindful, but that's not it. You need to be careful not to trigger anyone or trip any ideological wires. Not fun and for what, so you don't bother anymore.

And this is not to discredit the moderators or anything else here or on any other moderation dependent platform. Without them the average quality of comments would be worse, but there also is no denying that their primary role is censorship.


> In Comments

> Be kind. Don't be snarky. Converse curiously; don't cross-examine. Edit out swipes.

The guidelines want you to be kind, which is orthogonal to right.

https://news.ycombinator.com/newsguidelines.html


The guidelines are there to remind you where the site's incentives and culture don't. IME all upvote oriented sites turn into forums for "winning debate club" rather than enlightening discussion. The big difference between the sites is who the judges (upvoting user base) are. The guidelines and moderation here retard the trend but a trend it is nonetheless.


Being unkind actually means you're approaching a discussion with the intent of hurting another person, it's not a property of being factually correct or incorrect. Lots of assholes out there are "technically correct", and there are lots of people out there who are wrong, but still decent human beings.

However, you often can't retroactively determine someone's intent over the Internet. Therefore, you have to start somewhere. If you're a cynic, I guess you assume everyone is out to offend or hurt others. But as an optimist, I tend to assume good faith unless proven otherwise.


I like reading Marginal Revolution because it doesn't have this rule whilst also being a rather intellectually curious place.

(not suggesting HN ever takes this route)


How is being kind orthogonal to being right?


There are hundreds of ways to say something right, but you can phrase it meanly or kindly without changing how right you are. The definition of orthogonal is that one axis doesn’t influence the other.

I’d say it’s mostly orthogonal. It’s easier to be mean and right than kind, which is probably why there are so many more comments that are. A very rare some of the time, being right also requires a dose of meanness, which is a lot less fun for everyone involved.


> The definition of orthogonal is that one axis doesn’t influence the other.

Ok that’s interesting, a new definition on me. I see orthogonal as attributes at right angles, a measure of perpendicularity. I don’t see it as an indicator of attribute independence.

So if kind and right are orthogonal, this means you can (mostly) be one or the other but not (usually) both.

But a quick google shows your usage is common/normal. Hmm, lovely English. TIL something. Thanks for explaining.


Imagine X and Y axes, perpendicular as usual. If you become 3 units more X, that doesn't change how Y you are. Contrariwise, if we nudge our axes out of orthogonality, now moving along the X axes changes where I am on the Y. Obviously the use here is metaphorical, but that's the sense meant.

Edited to add: consider "independence" and orthogonality in vector spaces, if you want to get mathematically precise about it.


Downvoted without a counter argument. Which just proves my point. There is nothing more unintellectual than a plain downvote. I even had some guardrails in that comment too. But I am sure the downvote was warranted to the person who did it.

But also upvotes are similar. No one necessarily "likes" the truth. Nothing about correctness really warrants whatever an upvote means to that person. A "correct" vote maybe, but those are unavailable on these platforms. Not designed for intellectual discourse.

Just to be fair, Twitter is worse. How can you have intellectual discourse with character limits and contextless-ness as a feature? You get something, and something valuable to many, surely, but it isn't intellectual discourse.

Youtube comments are another great case study. Reddit also.


I often find that checking comment karma too frequently can come with an emotional rollercoaster. You post a take you think folks will find interesting or resonate with, and then the next time you look at it it's gone from 1 to 0.

But it's important to remember that it's a balance that you can't see. If your comment has a 1, that could have been 10 downvotes and 10 upvotes in the time since you posted the comment. And if you wait awhile, more often than not it will go back up. I've had comments hit a -2 and then end up being a 5, or even more. It's all organic.

It's best not to overreact because you happened to look when the balance was a little more negative than positive, and it's useful to consider the possibility that there are 9 upvotes and 10 downvotes to make your comment sit at a zero.

I empathize with what you are saying in the parent post about being cautious not to create a firestorm of people who are offended by your post. But this is really only an issue early on when your karma is low, and that's when it feels the most negative. Sometimes I post things that folks don't like, getting a -1 or a -2 or even -3- but ultimately the impact of that comment to your karma is capped. I don't think it's possible to, say, get a -57 (your current karma) and zero out your account.

EDIT: Oh, according to HN Undocumented, it actually does reduce your user karma even if the comment itself can't go past -4 [https://github.com/minimaxir/hacker-news-undocumented?tab=re...]. But it is important to remember that by that point the comment is dead and only users with showdead turned on can even see it anymore. And of course flamewar protection can disable the downvote button (at least the convenient one) when it is triggered, and as a story itself ages downvoting and even upvoting gets automatically turned off.

After you've been posting for a long time and your karma grows into the thousands, having a post be unpopular is basically a non-issue. Obviously you should still adhere to the guidelines and not get yourself removed due to misbehavior, but you just don't have to worry about it much.

Still, this community works best when we take HN's guidelines to heart. Everyone wants to be treated with kindness and the benefit of the doubt, and trying to build that into your own posts helps maintain that community.


I have had previous accounts. https://news.ycombinator.com/user?id=unabst

So now my parent comment is +5 and the comment you responded to is 0. Of course, though shall not mention downvotes is one of the cardinal rules of this club, so warranted again I am sure. Guilty as charged.

I appreciate the empathy. If more people would just lay out their counterarguments and objections, that would be discourse. But so long as downvotes are used the way they are, any discomforting or nonconforming facts will continue to simply be downvoted, which is no environment for challenging existing ideas. An environment for whatever these environments are.

And as long as any system relies on dictatorship and censorship, as all of these systems do, there will never be true intellectual discourse.

Yes, there needs to be a way to encourage responsible behavior. But that is not done through censorship. Not even through democracy really... as modern politics shows.

Then what would? Not a topic of discourse here. Sadly.


Downvotes as punishment for writing reasoned comments and arguments which are true to one's own sensibilities, are an indication that a point of view is "special".

One can start to feel a certain pride in that. (Of course, that shouldn't become the purpose of commenting.)


The internet being what it is, it’s a guarantee that you’ll always find someone who can dislike anything someone else posts, sometimes for no other reason than to be contrary.

Especially when the comment is either 100% factual, e.g. “1+1 does not equal ‘an Iguana’” or where someone clearly states a personal opinion “I’m not partial to eating Iguanas, myself”.


Yes I have done this upon occasion. You just know your going to get downvoted, so you really turn the rationale up to 10 but you know it's still not enough.


I did get to the point where I could tailor a comment for upvotes and guard them from downvotes, at least to some degree... But that's not free speech or free thinking. And all that effort to an audience that is also under the same constraints only makes for a safe but somewhat walking-on-eggshells conversation, at least when it comes to controversial ideas like... space vs tabs and apple vs microsoft. lol

HN is great for what it is. Gathering 2 cents from everyone. I've only had a great time adding my positive 2 cents, and I continue to enjoy learning from everyone else's.


My main issue with HN comments is that the threading display tends to cause focus on a very small number of high ranking comments along with their threads. This leads to potentially insightful comments on the original article being largely unread.

It's a problem where the solution is not obvious. There is perhaps an an opportunity to cycle through a series of candidate solutions.

I imagine something like placing a single comment at the top of comments selected from some formula of view-count and score, so that comments at least get some views to get the chance of being upvoted to the point of being seen by the wider community.


It’s better not to worry about it so much. Yes, top comments are exponentially more read than the rest. But HN is so big now that even the bottom comments get a large number most of the time.

It’s also up to the commenters to strategically place their comments if they care that much. No one forces you to write a toplevel comment when there are already 80. If you have something to say in that situation, and you really want a lot of people to hear it, and you’re pretty sure it’s a good fit for HN (the hardest part), then it usually pays off to post a reply instead. You’ll need to morph what you’re saying slightly so that it doesn’t sound weird, but there are tricks to do so.


Why not just minimize comments as you finish with a thread you find interesting? I start with a root comment, go down the chains as long as they remain interesting, then go back up the to the top level comment at which an interesting branch was and keep reading down it, minimizing branches as I go. Eventually the entire comment section is collapsed and I am done. Miss nothing, and no additional algorithms needed.


I doubt a comment section will ever be more than just that.

I still mourn the death of traditional internet forums. There you had long-running threads of proper discussions and exchange. After some time you got a sense of community and started to see the personalities of each individual. And if things ever got out of hand, you had moderation that would keep everybody in check.

But that was when the web was P2P.


You've put into words exactly what I feel.

"[..] frankly I’m not sure where that lives today"

I wish I knew.

Being a digital native (albeit an old one) and never having been very social in real life I notice that this trend drives me increasingly into real world interactions. I always found trade fairs, conferences, meet-ups and especially networking events a bit ridiculous and yet I attended more of them after the pandemic than in the decades before and very strangely I enjoyed it.

"HN seems to be the last place that had good regular conversation mostly cause of dang"

Amen to that!


Nowadays this is often filled by interest group Discord servers. The more niche the better, tbh. A lot of people think of Twitch streamers or video games when they think of Discord, but I'm in C++, Typescript, Minecraft mod development, Unreal development, and politics discords, just to name a few. There is a lot out there.


No different than the millions of specialty myphp boards back in the day

Reality is that those fall into the same buckets as well - and each over optimized on some emergent characteristics of their userbase

I think the answer for me is: be done with debating altogether, especially on the internet

My tools are sharp enough and at this point I don’t need to go to anonymous internet arguments to refine my philosophy


> Reality is that those fall into the same buckets as well - and each over optimized on some emergent characteristics of their userbase

Over optimized, or perfectly optimized? I'm sure that people debate whether to use #pragma or #define HEADER_H_ on these things, the debate will be better when done amongst two highly skilled C++ developers who care about the corner cases.

That's why I said niche. The broader a community, the less useful that debate will be, if it even qualifies. Having a fresh CS grad explain that #pragma once is supported in the latest Clang, MSVC and GCC and thus there's never a reason to avoid it are right for whatever small university project they are building, but a seasoned developer is annoyed they still need to target an ancient RHEL release on their mission critical open source project. (Disclaimer: I have no idea if this is actually a thing, I figure by now #pragma must be good to go even on older LTS distros)


Really I feel like HN is the worst for discussion. Look at all these long empty comments. There’s nothing of substance here.


Author here

Wow! I didn't expect that! I love these discussions and comments :)

I will do my best to go through all of them :)

Cheers everyone


If you have a chance could you talk a little about the decreased engagement part of the analysis? The 2023 datapoint of idle users especially looks unusual, I commented on it here https://news.ycombinator.com/item?id=39975875


This is super awesome and I spend my morning and breakfast reading it. Btw, you know how to hide real well. I would love to exchange emails but I failed to get a contact. Please email me, I would hopefully be able to learn more of your work. brajeshwar@oinam.com


Another curious data point: id =40000000 is up in a few hours. I thought it would be fun to try to make a submission pointing to that id, while trying to get that id right when it’s up. But it’s too late to start hacking on a script; I’m rather going to bed now. (Anybody else trying, please don’t kill the server or make dang mad).

Edit: oh I missed a 9 in looking at an earlier link, it’ll be a little more than a day from now, unless sama announces a new product, triggering a comment storm.


Damn! I honestly didn't think about it! That would have been really cool! I can't believe how this completely escaped me :D


> This finding suggests that the key to a high score isn’t necessarily landing viral hits, but rather consistently sharing stories.

This is interesting. I find it's very difficult to predict which links do and don't perform on Hacker News, resulting in quite a diverse (to me at least) series of stories. Thanks for the analysis.


It is a very good point, and it is the subject of part 2: I want to understand if the title / website / time of sharing has an impact on the score


I wonder if people rotate their accounts alot here at the one year mark. Which I should be doing...


if nothing else because they dont allow you to delete your account


You may want to use anti-stylometry tools in addition to account switching.

There's already been evidence that analysis to identify and attribute authorship across accounts can work surprisingly well.


the agents enjoy my unique typos though!


Would be cool to see the leaderboard of consistent users (with highest median net upvotes per story). Karma conflates comment karma and post karma, so that’s not really the same thing.


I think the leaderboard is a cool idea!

It doesn't need to be a leaderboard for "all the time", but over a recent window of time.

> Karma conflates comment karma and post karma, so that’s not really the same thing.

I agree, but the limitation that I faced is that I don't have access to the comments upvotes, and I didn't find a satisfying proxy for it either. That is why I dropped it from the analysis.


I always find it so strange that people here have such a high opinion of HN, but outside of this place, almost everyone I know thinks this place is terrible.


2-3 years ago, I thought HN was amazing. Today, I think it's pretty terrible, measured in terms of the accuracy of information shared in comment threads.


I still like the simple interfact tbh ^_^

But I think it can be improved: personalized feed, allow following certain users, ...etc.


> for those treating HN as a competitive platform where karma points serve as the score

I have to imagine that people for whom there’s a primary motivation to be a “winner” in points must quickly lose interest? But maybe I’m just naive to how motivating status could be…


I think there is a pronounced fall-off in the "gamification" interest too as a user progresses in karma. IIRC the last perk for high karma is downvoting comments at 501- no more fun perks to spend your Internet points on beyond that.

I also don't get the sense that people habitually check the karma of people they are reading or replying to anyway, so it doesn't in and of itself grant much in the way of sway or gravitas in the community. That's much more about being notable or recognizable (other than the obvious "post good comments and stories").

And sure, it does help the sense of "being part of the community" to get to 500 where you are trusted to downvote, but let's be real- it only takes a few good story submissions to get you there if you want to get there fast. I only have 3 submissions that aren't Show HN or my own stuff, and 1 of those is 194 points (https://news.ycombinator.com/item?id=38626454).

And I only posted that story because I was hoping Matthew would elaborate on his position here on HN (as well as wanting to hear what HN thought of his take), since he is a well known HN user.


You are doing that right now though because you want attention, don’t you see how ironic your comment is? You’re the noise.


Once new users gain the ability to downvote stuff, actively engaging in discussions increases the risk of losing karma, and lose the one ability that is not available to new users. So if users perceive the main value of Hacker News as a moderated news feed, it seems the best strategy is to earn their 501 points somehow and lurk after that. Maybe that's why user activity falls off after a few years.


There’s also active downvote bots that search for phrases like GOP and flag them immediately… watch.


Originally joined HN as a way to collect opinions for writing research. HN represents a certain slice of the tech world that's it's important to observe.

I still find HN amongst the top places for engaging discussion. It's within the comments, not the postings, that I find eclectic nuggets of gold and learn new things. Moderation is generally very good. At least 10 books I've read in the last year came from HN comments.

One thing I would change is the up-down voting. It adds little. Because I use a text only browser I (assume) don't see voting. So it never occurs to me and have never voted any comment up or down during my stay here. However I find gratuitous flagging and down-voting without the courage to venture a position to be very negative. It simply stymies more challenging discussion, and seems a knee-jerk reaction by an invisible cadre of anti-intellectuals.


I think that upvoting moves the post up in the list. So even with a text-only browser you get the messages sorted by their votes... right?


> Because I use a text only browser I (assume) don't see voting.

Well you would still see the effect of voting. As for the points on comments, only the poster can see it (other than the placement effects, and that low point comments become noticeably grayed out). If you enable showdead (in your settings), you can also see comments that have been flagged or downvoted to death, which I do leave on myself (cause I can take it!)

> However I find gratuitous flagging and down-voting without the courage to venture a position to be very negative. It simply stymies more challenging discussion, and seems a knee-jerk reaction by an invisible cadre of anti-intellectuals.

That's an easy stance to take, but I think it's more nuanced. Sure, when there's no explanatory replies at all it's frustrating to understand the manner in which other readers disagree. But when someone repeatedly posts statements that are very commonly posted by others, and with a basic look around at other interactions is obviously unpopular, disproven or even problematic, it's not exactly the responsibility of passers by to have to consistently regurgitate the same rationale for why it's unpopular or problematic every time the user posts- if the same debunked, disproven, or otherwise disagreeable arguments show up every time an article about Product A hits the front page, I don't think it's fair to measure how just, intellectual, or morally correct the two sides of the disagreement are just by how willing they are to expend energy arguing about it.

In fact, the person with the less popular opinion is going to have an incentive and desire to post assertions of those opinions a lot more than others will have the energy to challenge them.

Also there's no sense posting a sibling reply that says the same thing. If one person already has a reply capturing the essence of what's wrong with the comment and the comment is still being downvoted, it's probably because other readers agree with the reply.

Finally, as I posted elsewhere, a lot of folks see the first changes to their comment's score as indicative of the overall community's opinion but that's rarely the case. Sure you may get a bunch of downvotes in the first hour of posting it, but then over the next few hours that balances out and goes in the other direction. And, complaining about downvotes usually gets more downvotes.


> Well you would still see the effect of voting.

Yes I do. But I don't see any buttons to vote other's posts. Presumably those are one of the few things that use JavaScript on HN. I don't miss that. If I want to interact with another comment I reply, which I think is the right thing to do, or just move on.

> That's an easy stance to take

Do you mean it's simplistic? I don't find it "easy" to be left with a low opinion of fellow posters.

> when there's no explanatory replies at all it's frustrating

Exactly, that's what I dislike. But let's talk about why that's "frustrating".

> statements that are very commonly posted by others

I wouldn't say that characterises my comments. I always try to add something original.

> it's not exactly the responsibility of passers by

We disagree here. Yes it kind of is. Nobody is compelled to vote. It's much better to get a reply, even a negative one, than a vote either up or down. The only reason it even matters is that I learned downvoted posts become invisible, which seems like a form of censorship, no?

> I don't think it's fair to measure how just, intellectual, or morally correct the two sides of the disagreement are just by how willing they are to expend energy arguing about it.

Now we're getting interesting. I strongly believe in the principles of disputation - that is to say - conversations always have a deeper "Socratic/Hegelian" purpose to learn and change, not to "win" or "sway" some barometer. That's why I make the effort with people. I suppose I consider it good manners to reciprocate, and in my mind I equate voting but not commenting with laziness. Therefore I do make a moral judgement about that I feel comfortable with.

> changes to their comment's score as indicative of the overall community's opinion but that's rarely the case.

Sure. Though as I said, popularity contests are not what concern me. It is the deleterious effects of oblique censorship that accrue from casual, unreflective downvoting that seem like fault of HN.

The frustration comes from putting the effort and thought into a comment, only to see it hidden from others by a minority that dislike what is being said.

Like you I notice that after many days my most downvoted posts fly back up to very high values. But that's irrelevant because my focus on that subject has moved on and the opportunity was lost to interact with others because a comment was hidden from them. The vanity of "overall score" doesn't figure in my reasoning at all.


It's much better to get a reply, even a negative one, than a vote either up or down.

It's not, it would make for a much noisier and much less interesting forum. The whole 'receipts for (or instead of!) downvotes' thing is driven by the (entirely real) unpleasantness of getting downvoted. Nobody ever asks for receipts for upvotes.


> It's not

It would be better for me, and for the reasons I participate.

> much less interesting forum.

For me it would make it much more interesting

> Nobody ever asks for receipts for upvotes.

Then let me be the first. Most sincerely, I am equally dismayed by having a bunch of random upvotes and no explanation of why that particular point hit home.

Maybe I need to say it explicitly "Why is this being upvoted?" That could lead to some interesting discussions.


For me it would make it much more interesting

Right but the forum has been around for 15+ years and also aims for interestingness for the many users that are not you.


That's a snarky false dichotomy pvg, and you should know better. There's no reason it cannot be improved no matter how long things have been as they are, and there's no reason it can't be more interesting for me and for others.


It’s a dichotomy you got us into with your own snark. You haven’t described how what you’re suggesting (a very very common HN suggestion) would improve HN. I told you why it thought it wouldn’t.


> You haven’t described how what you’re suggesting (a very very common HN suggestion) would improve HN.

Well that's not true. I've said that I believe getting detailed feedback from a known respondent is better than anonymous up or downvotes, and that since downvotes affect visibility if ones aim is to improve the quality of posting/writing then it's a superior form of feedback.

You call it "noise".

I think that's just where we'll have to differ.

Might I also say that it being a "very, very common" suggestion might bolster my point rather than subtract from it. We're all believers in supply and demand here, right?

Anyway I must hush my tongue as Dang is wagging a finger at me for being too mouthy. :)

respects


It's a long dead thread so you have plenty of mouthiness latitude. As to very very common suggestion? It has an equally common and even more persuasive rebuttal which, again, you've not addressed on its merits at all. You have 'I'd find it interesting', 'I don't like you dismissing it as noise' and 'you say many other people have had this bad idea thus it's not a bad idea'. These are all a little indirect, meta and not really responsive to the counterargument.


There's no mechanism to return to view what you've upvoted, so such a request isn't likely to be fulfilled.

Just think of it like here's the balance of thumbs up and thumbs down, and go from there. We don't live in a world where we have perfect information, unfortunately. If you get downvoted on something, feedback that in to the next time you mention it. It may be bad form to complain about downvoting but it's not bad form to start with "I think this is an unpopular opinion (though I'm not sure why yet) but..." and hope that folks bite and tell you why.


There's no mechanism to return to view what you've upvoted

It's in your profile, unless I'm misunderstanding the mechanism you have in mind.


> > That's an easy stance to take > Do you mean it's simplistic?

I mean it's an easily justifiable take. You could call it simplistic, but I think that's not an exact synonym for "lacks nuance"

> I don't find it "easy" to be left with a low opinion of fellow posters.

My meaning is that it's easy to take the stance that a downvote without a comment is never justifiable, but yet I think there are valid cases where it is, which is what I was getting at.

> > statements that are very commonly posted by others

> I wouldn't say that characterises my comments. I always try to add something original.

I've only read the comment I'm replying to. I didn't go read your comment backlog and tailor my response to how I perceive your behavior. I'm sure your posts are just fine, after all the one I am replying to is a pretty good comment. And I don't see any signs of downvoting.

EDIT: Oh, no it's been downvoted a bit, my bad. It didn't stop a discussion from happening though. I do want to clarify that I'm coming in here to discuss what you've posted, not speaking specifically about whether this post should be downvoted. It's probably somewhat unfair that the original post here is downvoted, but that's almost certainly because folks have the same opinion that I do, that it is in fact warranted to downvote without comment sometimes. And, by luck of me coming in here, I have satisfied your request about getting context on why you're being downvoted!! But to help out, I gave you an upvote :-)

> > it's not exactly the responsibility of passers by

> We disagree here. Yes it kind of is. Nobody is compelled to vote. It's much better to get a reply, even a negative one, than a vote either up or down. The only reason it even matters is that I learned downvoted posts become invisible, which seems like a form of censorship, no?

Censorship comes from a position of power. No one user on HN has more power than you, save for dang, who is known to wield it very well.

It's kind of strange to declare the ability for the population to influence what words sit at the top as censorship. After all, downvotes have the inverse effect of pushing the more interesting stuff up, while acting as a way for the community to temper itself, and act as a signal that you're going to have to do better to prove your point than what you did. Because of HN's format, I do not want to have to see 400 comments saying "I do not agree." with no further commentary. If we eliminate the downvote, then that will be what happens.

And the reason I'm certain it will happen is the same reason that I mention the disparity between the desire for holders of unpopular opinions to flood spaces with those opinions and the relative lack of energy of people to continuously fight those unpopular opinions. If 10 people post the same uninformed comment, I can certainly respond to every one clarifying their ignorance, but a lot of people are going to read that comment and miss my clarification. I do think it's important for the rationale for downvoting to be present, but I just contend that there is not an absolute that says we should simply put a small asterisk next to each thing that is unreasonable and hope that everyone reads the full context.

> > I don't think it's fair to measure how just, intellectual, or morally correct the two sides of the disagreement are just by how willing they are to expend energy arguing about it.

> Now we're getting interesting. I strongly believe in the principles of disputation - that is to say - conversations always have a deeper "Socratic/Hegelian" purpose to learn and change, not to "win" or "sway" some barometer. That's why I make the effort with people. I suppose I consider it good manners to reciprocate, and in my mind I equate voting but not commenting with laziness. Therefore I do make a moral judgement about that I feel comfortable with.

This is all fair, and is very true in a Socratic dialog, but that's not the nature of online forums, let alone a tree structure like HN. This is not a 1:1 ordered structured dialog. If I want my opinion to have more sway, I can post it as a leaf node on every part of the tree. If I do that, should I expect you to repeat your rationale for why I'm wrong on each leaf post I make? And on top of that, ignorance is often more common than enlightenment. Especially for heated or otherwise impactful stories, it is very common for the same argument to be posted by tens or hundreds of independent commenters, all in good faith. While yes, in an ideal world each one of them deserves to know why they are being downvoted, it's also expected of them to read at least some other parts of the comment tree, where arguments similar or identical to theirs have indeed been responded to with Socratic fervor already, several hours ago.

> The frustration comes from putting the effort and thought into a comment, only to see it hidden from others by a minority that dislike what is being said.

This gets at a strategic thing that some don't realize about HN: you do not, and in many cases probably should not, post at the top level. Posts at the top level which are at the top of the feed usually have been there for awhile-- they have a gravity by way of having a lot of points. Often, there will be whole pages of subreplies on them. You don't need a single downvote for your toplevel comment to be at the top for 1 minute and immediately drop to page 3.

Instead, discuss on the leafs, or make your opinion known somewhere else in the tree. When you do that, your comment can only drop to the bottom of that tree node unless it is killed by flag or downvote. If you have a 1 point comment that gets no activity, it will just sit in that tree.

Furthermore, if you approach writing your comments gracefully, in a way that gives other participants the benefit of the doubt, and has humility in case your wrong, it is exceedingly rare to be downvoted to death. In my entire time on HN across thousands of posts, it has happened to me maybe once or twice-- and I am not always as agreeable as I'm being on this post.

> only to see it hidden from others by a minority that dislike what is being said.

I do also want to remind you that your comment points are a balance. If you have a 1, that can be 1+0-0 or 1+10-10. If 10 people like your comment and one person dislikes it, that's still 10 points. It is a balance system.

> Like you I notice that after many days my most downvoted posts fly back up to very high values. But that's irrelevant because my focus on that subject has moved on and the opportunity was lost to interact with others because a comment was hidden from them. The vanity of "overall score" doesn't figure in my reasoning at all.

Yeah that's true when it's a top level comment. Again I advise only doing top level comments strategically. When it's a leaf node, shown as one of 3 comments in reply to something else, you are not losing anything-- only when a comment becomes [dead] is it removed from the tree. And I for one will still see your comment, because I have showdead turned on. I recommend it! Truly TOS breaking comments are removed anyway, and it's a good way to understand what the overall community will push out of the tree.


I enjoyed reading all of that. Thank you. Maybe it's the nature of this thread, and it being "meta" but - well I'll just say it; there's a higher quality of commentary and individuals going on in here. You clearly make an effort and care enough about the discussion. Not that I don't like "tabloid" or superficial and playful ones too. Those Slashdot-like runs of jokes, wit and memes are rare here but very enjoyable.

Not to get too philosophical but it comes back to the question of "why we're here".

What I said when I started this discussion remains true. I take the responses people give here seriously (not personally), as in I'll file them in my org notes with extra commentary - the kind I wish I could distil by analysis like; "dude seems really delighted with this idea and is the third person who likes it" or "fell on deaf ears again; reformulate".

Clearly everyone comes to a forum for different reasons. Some to have their ideas validated. Some to look for inspiration. Some to just pass the time idly. Some to pick a fight. Some to spread their views. Some to heal and comfort....

So the machinery will not work perfectly for everyone. My complaints then, are about the machinery and how it (mis)shapes behaviours, not the people. People will be active or lazy as the tools afford them.

But then, I'm probably a weirdo outlier in my reason/use of forums like this - because I use it instrumentally to "people watch" and get to the bottom of their emotional and ethical relation to technology. (And a little confessionally - to share/sympathise)

The more mundane and selfish use I have is to promote the Cybershow (blog/podcast) I work on. Generally I find that well received because "self promotion" is something everybody shares these days.

Anyway thanks for the tips. My feeling remains that the down-voting as currently formulated harms discussion. It needs some tweaking.


Everything needs tweaking, but it's always a tradeoff. Tweak in one direction to optimize variable A and seemingly unrelated variable B becomes unoptimized. If there was a perfect system we'd have found it by now; I'm actually very confident there is not one that exists, unfortunately.

Anyway, yeah it's been enjoyable for me too. Thanks for the good discussion :-D


Here is a dead comment in this thread: https://news.ycombinator.com/item?id=39974997

It got downvoted almost certainly because they implied that doomer is the opposite of productive. Would this have been the right place to start a whole debate about whether folks who are realistic about the climate crisis are productive members of society? Probably not.

Sure, they might not have meant it that way, and sure, it's one thing at the end of their post that honestly shouldn't have killed it, but human nature is such, and that probably tells you people are tired of hearing that kind of take.


> Doomer Vs Productive

You see, that's fascinating!

At least for me. Because it acts like a sonar ping revealing the hidden landscape of cultural assumptions and prejudices beneath the surface here.

Also, maybe you're completely wrong. Maybe the reason the comment was attacked is something else.

That's why it amplifies the gripe about blind-downvoting. Surely it would be much better to be able to click on the [dead] link and see "Reason: I dislike the implication that a pessimistic outlook correlates to non-productivity".


> Also, maybe you're completely wrong. Maybe the reason the comment was attacked is something else.

Absolutely, total guess here. It's reading the tea leaves of group behavior.


The author "found out" that sama = Sam Altman nad pg = Paul Graham, by his surprise.

It is well known that Hacker News owner company, Y Combinator had Altman as his CEO before he joined OpenAI.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: