The function of repeatabilty in scientific experiments

Deleted User March 30, 2021 at 13:10 #516542 0 likes

This user has been deleted and all their posts removed.

bert1 March 30, 2021 at 13:14 #516543 0 likes

Quoting tim wood

The only improvement on these just being silent.

Do you mean I should not have posted? Mods can delete if they want.

javi2541997 March 30, 2021 at 13:29 #516546 0 likes

Reply to bert1

I also think that repeatability in science is to check the results we got previously in our analysis. But, even further than this, repeatability could also help us to improve the hypothesis itself. If you want to make a solid statement I guess you should repeat a lot until you believe is enough proven.

bert1 March 30, 2021 at 13:32 #516547 0 likes

Quoting javi2541997

I also think that repeatability in science is to check the results we got previously in our analysis. But, even further than this, repeatability could also help us to improve the hypothesis itself. If you want to make a solid statement I guess you should repeat a lot until you believe is enough proven.

I see, thanks.

bert1 March 30, 2021 at 13:32 #516548 0 likes

I can give an example if that would help.

Deleted User March 30, 2021 at 13:33 #516549 0 likes

This user has been deleted and all their posts removed.

bert1 March 30, 2021 at 13:40 #516552 0 likes

Quoting tim wood

Not at all. Had you been silent we would not have had from you such a gem.

Oh! Thanks. I didn't understand. I process what you said when I can.

T Clark March 30, 2021 at 14:47 #516564 0 likes

Quoting bert1

The function of repeatability in experiments is NOT to confirm a hypothesis.

The function of repeatability is to check the reliability of the experimental result.

What's the difference? At least under most circumstances.

Quoting bert1

A single unrepeated experiment, if reliable, is enough to refute a hypothesis. You don't have to do it again.

So, how do you demonstrate that an experiment is reliable. I can think of two ways.1) review the premises, procedures, data, calculations, and conclusions for errors and unsound interpretations and 2) rerun the experiment. How far you have to go to show an experiment is reliable depends on the consequences of being wrong. If people's lives or millions of dollars are on the line, maybe you do need to rerun the experiment after all.

fdrake March 30, 2021 at 15:29 #516571 0 likes

Quoting bert1

The function of repeatability in experiments is NOT to confirm a hypothesis.

:up:

If you repeat a measurement under the same conditions in an experiment, the goal of that is usually to take an average; establishing concordance and forming a variance reduced estimate of the true value you're measuring.

If you repeat a measurement under different conditions in an experiment, in part that's trying to find out how the measured response varies with the stimulus/treatment, in part that's trying to find out how that response varies with contextual factors, in part (nowadays) that's trying to assess whether and how the stimulus/treatment's response itself varies with contextual factors. On this level, "repeating a measurement" is pretty much the core of a controlled experiment.

If you're repeating an entire experiment, there's some wiggle room in practice regarding what counts as a repeat. There's the hypothetical "exact replication", which is where you do literally everything the same, the "conceptual replication", which is where you try to ape the experimental conditions to be the same but can't do it exactly. I doubt those are an exhaustive typology of replication results, but the purpose of both isn't easily reducible to confirming or testing a previously held hypothesis in most cases, and that follows just because the overall set up in the initial experiment isn't identical, or necessarily even equivalent in all relevant respects, to the replication attempt.

That "lack of identity" (arguably) shows up in the difference in replication rates between papers where the initial researcher group is represented in the reproduction team and where they are not.

Quoting bert1

The function of repeatability is to check the reliability of the experimental result.

:up: Largely, I think so, at least up to what's intended by "reliability".

I would make the claim that the function of reproduction attempts/replication attempts in science isn't to check the reliability of any individual result; most results are false and over-simplifications and everyone knows this; the overall function is to make the process of scientific discovery in the aggregate not spend too long on "clear" falsehoods and inaccuracies, it's a quality control thing. What counts as a "clear falsehood" only makes sense in light of reproducibility.

Another angle on repeatability is that if you're repeating the experiment, manage it exactly, and the effect doesn't show up the same as before, that doesn't necessarily mean the conclusions of the initial experiment were false - it might be that the response is contextually variable, it might be a contextual interaction - both experiments could be samples of a distribution associated with the "true effect" indexed by contexts and their variables. The latter approach, to my understanding, is the one favoured by Gelman and his group.

Quoting bert1

A single unrepeated experiment, if reliable, is enough to refute a hypothesis. You don't have to do it again.

I think that depends too, the role of a non-repeat, if you see it in the context of a contextually variable interaction, it's not a refutation but evidence that the effect is contextual if it exists (and that starts a process of compensation of making it smaller compared to context induced imprecision, "exaggeration factors" "the garden of forking paths", and analysing true power of the study/broader scientific endeavour), if you see it in the context of everything's really set up exactly the same, the effect's probably not there as it was theorised - but if the "exact replication" must reproduce the contextual ambiguities of the initial one? It still doesn't mean the effect's not there/is 0* if the second one comes out, it could be that the ambiguities realised differently in both experiments.

In that kind of case, if the ambiguities are enough to swamp the signal, it's reasonable to say the treatment as intended or the effect as theorised has little to no evidence that it exists... Probably.

Edit*: its expectation could be 0, but it could still have high contextual variability...

fdrake March 30, 2021 at 15:39 #516574 0 likes

So:

Quoting bert1

The function of repeatability is to check the reliability of the experimental result.

:up: (IMO)

Up to precisely what's meant by "reliability", anyway.

bert1 April 18, 2021 at 20:44 #524439 0 likes

Reply to fdrake Many thanks for your excellent answer. It's taken me a while to get back to it.

Quoting fdrake

If you repeat a measurement under the same conditions in an experiment, the goal of that is usually to take an average; establishing concordance and forming a variance reduced estimate of the true value you're measuring.

:nod:

Quoting fdrake

If you repeat a measurement under different conditions in an experiment, in part that's trying to find out how the measured response varies with the stimulus/treatment, in part that's trying to find out how that response varies with contextual factors, in part (nowadays) that's trying to assess whether and how the stimulus/treatment's response itself varies with contextual factors. On this level, "repeating a measurement" is pretty much the core of a controlled experiment.

Indeed.

Quoting fdrake

If you're repeating an entire experiment, there's some wiggle room in practice regarding what counts as a repeat. There's the hypothetical "exact replication", which is where you do literally everything the same, the "conceptual replication", which is where you try to ape the experimental conditions to be the same but can't do it exactly. I doubt those are an exhaustive typology of replication results, but the purpose of both isn't easily reducible to confirming or testing a previously held hypothesis in most cases, and that follows just because the overall set up in the initial experiment isn't identical, or necessarily even equivalent in all relevant respects, to the replication attempt.

I hadn't thought of that specifically, thanks.

Quoting fdrake

That "lack of identity" (arguably) shows up in the difference in replication rates between papers where the initial researcher group is represented in the reproduction team and where they are not.

That's interesting. Quoting fdrake

I would make the claim that the function of reproduction attempts/replication attempts in science isn't to check the reliability of any individual result; most results are false and over-simplifications and everyone knows this; the overall function is to make the process of scientific discovery in the aggregate not spend too long on "clear" falsehoods and inaccuracies, it's a quality control thing. What counts as a "clear falsehood" only makes sense in light of reproducibility.

Sure. I think by 'results' you mean conclusions/interpretations rather than data?

Quoting fdrake

Another angle on repeatability is that if you're repeating the experiment, manage it exactly, and the effect doesn't show up the same as before, that doesn't necessarily mean the conclusions of the initial experiment were false - it might be that the response is contextually variable, it might be a contextual interaction - both experiments could be samples of a distribution associated with the "true effect" indexed by contexts and their variables. The latter approach, to my understanding, is the one favoured by Gelman and his group.

Another very good point I hadn't thought of.

Quoting fdrake

I think that depends too, the role of a non-repeat, if you see it in the context of a contextually variable interaction, it's not a refutation but evidence that the effect is contextual if it exists (and that starts a process of compensation of making it smaller compared to context induced imprecision, "exaggeration factors" "the garden of forking paths", and analysing true power of the study/broader scientific endeavour), if you see it in the context of everything's really set up exactly the same, the effect's probably not there as it was theorised - but if the "exact replication" must reproduce the contextual ambiguities of the initial one? It still doesn't mean the effect's not there/is 0* if the second one comes out, it could be that the ambiguities realised differently in both experiments.

In that kind of case, if the ambiguities are enough to swamp the signal, it's reasonable to say the treatment as intended or the effect as theorised has little to no evidence that it exists... Probably.

Yes, I completely glossed over all that nuance. My initial post popped into my head by considering vague memories of my philosophy of science on falsificationism and confirmationism. I found all that really interesting at the time but forgot most of it.

Thank you, that was really interesting and helpful.

The function of repeatabilty in scientific experiments

Comments (11)