# "Pivotal questions" initiative

## The Pivotal Questions project in brief

[The Unjournal](http://unjournal.org) commissions public evaluations of impactful research in quantitative social sciences fields. We are seeking ‘pivotal questions’ to guide our choice of research papers to commission for evaluation. We are reaching out to organizations that aim to use evidence to do the most good, and asking: Which open questions most affect your policies and funding recommendations? For which questions would research yield the highest ‘value of information’?<br>

Our main approach has been to search for papers and then commission experts to publicly evaluate them. (For more about our process, see [here](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/policies-projects-evaluation-workflow)). Our field specialist teams search and monitor prominent research archives (like [NBER](https://www.nber.org/papers?page=1\&perPage=50\&sortBy=public_date#listing-77041)), and consider [agendas from impactful organizations](https://airtable.com/applDG6ifmUmeEJ7j/shrQkVhLlJSpRKOGY), while keeping an eye on forums and social media. Our approach has largely been to look for research that seems relevant to impactful questions and crucial considerations. We're now exploring turning this on its head and identifying pivotal questions first and evaluating  a cluster of research that informs these. This could offer a more efficient and observable path to impact.  (See our[ ‘logic model’ flowchart for our theory of change](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/benefits-and-features/global-priorities-theory-of-change) for context.)<br>

## The process

#### Elicit questions

The Unjournal will ask impact-focused research-driven organizations such as GiveWell, Open Philanthropy, and Charity Entrepreneurship to identify specific [quantifiable questions](#user-content-fn-1)[^1]. that impact their funding, policy, and research-direction choices. For example, if an organization is considering whether to fund a psychotherapeutic intervention in a LMIC, they might ask “How much does a brief course of non-specialist psychotherapy increase happiness, compared to the same amount spent on direct cash transfers?” We’re looking for the questions with the highest value-of-information (VOI) for the organization’s work over the next few years. We have some requirements — the questions should relate to The Unjournal’s coverage areas and engage rigorous research in economics, social science, policy, or impact quantification. Ideally, organizations will identify at least one piece of publicly-available research that relates to their question. But we are doing this mainly to help these organizations, so we will try to keep it simple and low-effort for them.

#### Select, refine, and get feedback on the target questions

The Unjournal team will then discuss the suggested questions, leveraging our field specialists’ expertise. We’ll rank these questions, prioritizing at least one for each organization. We’ll work with the organization to specify the priority question precisely and in a useful way. We want to be sure that 1. evaluators will interpret these questions as intended, and 2. the answers that come out are likely to be actually helpful. We’ll make these lists of questions public and solicit general feedback — on the relevance of the questions, on their framing, on key sub-questions, and on pointers to relevant research.

Where practicable, we will operationalize the target questions as a claim on a prediction market (for example, Metaculus) to be resolved by the evaluations and synthesis below.

**Where feasible, post these on public prediction markets (such as Metaculus)**&#x20;

If the question is well operationalized, and we have a clear approach to 'resolving it' after the evaluations and synthesis, we will post it on a reputation-based market like [Metaculus](https://metaculus.com/) or [Manifold](https://app.gitbook.com/s/scEoiIiYYQByE1FaibWQ/tools-and-examples/cole_haus-modeling). Metaculus is offering 'minitaculus' platforms such as [this one on Sudan](https://www.metaculus.com/project/Sudan/) to enable these more flexible questions. &#x20;

#### Elicit stakeholder beliefs

We will ask (and help) the organizations and interested parties to specify their own beliefs about these questions, aka their 'priors'. We may adapt the Metaculus interface for this.

#### Source and prioritize research informing the target questions

Once we’ve converged on the target question, we’ll do a variation of our usual evaluation process.

For each question we will prioritize roughly two to five [relevant research papers](#user-content-fn-2)[^2]. These papers may be suggested by the organization that suggested the question, sourced by The Unjournal, or discovered through community feedback ([see note](#user-content-fn-3)[^3]).&#x20;

#### Commission expert evaluations of research, informing the target questions

As we normally do, we’ll have ‘evaluation managers’ recruit [expert evaluators to assess each paper](#user-content-fn-4)[^4]. However, we’ll ask the evaluators to [focus on the target question](#user-content-fn-5)[^5], and to consider the target organization’s priorities.&#x20;

We’ll also [enable phased deliberation and discussion among evaluators](#user-content-fn-6)[^6]. This is inspired by the[ repliCATS project](https://replicats.research.unimelb.edu.au/), and some evidence suggesting that the (mechanistically aggregated) estimates of experts after deliberations [perform better](#user-content-fn-7)[^7] than their independent estimates (also mechanistically aggregated). We may also facilitate collaborative evaluations and ‘live reviews’, following the examples of [ASAPBio](https://asapbio.org/crowd-preprint-review), [PREreview](https://prereview.org/live-reviews), and others.

#### Get feedback from paper authors and from the target organization(s)

We will contact both the research authors (as per our standard process) and the target organizations for their responses to the evaluations, and for follow up questions. We’ll foster a productive discussion between them (while preserving anonymity as requested, and being careful not to overtax people’s time and generosity)

#### Prepare a “Synthesis Report”

[We’ll commission one or more](#user-content-fn-8)[^8] evaluation managers to write a report as a summary of the research investigated.&#x20;

These reports should synthesize “What do the research,  evaluations, and responses say about the question/claim?” They should provide an overall metric relating to the truth value of the target question (or similar for the parameter of interest).  If and when we integrate prediction markets, they should decisively resolve the market claim.

Next, we will share these synthesis reports with authors and organizations for feedback.

#### (Where applicable) Resolve the prediction markets&#x20;

#### Complete and publish the ‘target question evaluation packages’

We’ll put up each evaluation on our[ Unjournal.pubpub.org](http://unjournal.pubpub.org) page, bringing them into academic search tools, databases, bibliometrics, etc. We’ll also curate them, linking them to the relevant target question and to the synthesis report..

We will produce, share, and promote further summaries of these packages. This could include forum and blog posts summarizing the results and insights, as well as interactive and visually appealing web pages. We might also produce less technical content, perhaps submitting work to outlets like[ Asterisk](https://asteriskmag.com/), [Vox](https://www.vox.com/future-perfect), or [worksinprogress.co](https://worksinprogress.co/).&#x20;

### ‘Operationalizable’ questions

At least initially, we’re planning to ask for questions that could be definitively answered and/or measured quantitatively, and we will help organizations and other suggesters refine their questions to make this the case. These should approximately resemble questions that could be posted on forecasting platforms such as [Manifold Markets](https://manifold.markets/) or [Metaculus](https://www.metaculus.com/home/).  These should also somewhat resemble the ['claim identification'](https://docs.google.com/document/d/1mBkAmCVomcUt0Ks7hsxShTsjAbx3WVtFfMCnasGQxns/edit) we currently request from evaluators.&#x20;

We give detailed guidance with examples below:

{% content-ref url="pivotal-questions-initiative/operationalizable-questions" %}
[operationalizable-questions](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/~/changes/536/pivotal-questions-initiative/operationalizable-questions)
{% endcontent-ref %}

*Why do we want these pivotal questions to be 'operationalizable'?*

{% content-ref url="pivotal-questions-initiative/why-operationalizable-questions" %}
[why-operationalizable-questions](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/~/changes/536/pivotal-questions-initiative/why-operationalizable-questions)
{% endcontent-ref %}

### How you can help us&#x20;

#### Give us feedback on this proposal

We’re still refining this idea, and looking for your suggestions about what is unclear, what could go wrong, what might make this work better, what has been tried before, and where the biggest wins are likely to be. We’d appreciate your feedback! (Feel free to email <contact@unjournal.org> to make suggestions or arrange a discussion.)

#### Suggest organizations and people we should reach out to&#x20;

#### Suggest target questions

If you work for an impact-focused research organization and you are interested in participating in our pilot, please reach out to us at <contact@unjournal.org> to flag your interest and/or complete [this form](https://coda.io/form/Expression-of-Interest-for-The-Unjournal-Pilot_dUpq6ZxNtdC).  We  would like to see:

* A brief description of what your organization does (your ‘about us’ page is fine)
* A specific, [operationalized](https://docs.google.com/document/d/1rOp9_7g7wG_0gEGKWEL_dCgZE4tlrjYhfZTTUlZcmBs/edit#heading=h.lmscceyw2s4z), high-value claim or research question you would like to be evaluated, that is within our scope (\~quantitative social science, economics, policy, and impact measurement)
* A brief explanation of why this question is particularly high value for your organization or your work, and how you have tried to answer it&#x20;
* If possible, a link to at least one research paper that relates to this question&#x20;
* Optionally, your current beliefs about this question (your ‘priors’)

Please also let us know how you would like to engage with us on refining this question and addressing it. Do you want to follow up with a 1-1 meeting? How much time are you willing to put in? Who, if anyone, should we reach out to at your organization?

Remember that we plan to make all of this analysis and evaluation public.

If you don’t represent an organization, we still welcome your suggestions, and will try to give feedback.&#x20;

([Note on 'bounties](#user-content-fn-9)[^9]'.) &#x20;

Please remember that we currently focus on quantitative \~social sciences fields, including economics, policy, and impact modeling (see [here](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/policies-projects-evaluation-workflow/considering-projects/what-specific-areas-do-we-cover) for more detail on our coverage). Questions surrounding (for example) technical AI safety, microbiology, or measuring animal sentience are less likely to be in our domain.                                          &#x20;

If you want to talk about this first, or if you have any questions, please send an email or [schedule a meeting](https://calendly.com/daaronr) with David Reinstein, our co-founder and director. <br>

[^1]: We may later expand this to somewhat more open-ended and general questions; see discussion in later sections.

[^2]: Or dynamic ‘projects’, or non-academic rigorous work — see[ discussion here](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/benefits-and-features/dynamic-documents-vs-living-projects), and notes on our ‘[applied stream](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/policies-projects-evaluation-workflow/considering-projects/applied-and-policy-track-trial)’.

[^3]: We discuss how this relates to our typical rules for ‘what we need permission to evaluate’ [here](https://coda.io/d/_ddIEzDONWdb/Evaluating-Pivotal-Questions_suamu#_luJvW).

[^4]: Naturally, we may ask some experts to evaluate multiple papers within the same question or theme.

[^5]: This could be integrated with the “claim evaluation” section we’re[ introducing](https://docs.google.com/document/d/1mBkAmCVomcUt0Ks7hsxShTsjAbx3WVtFfMCnasGQxns/edit#heading=h.ljcrdyqus3l8) to our evaluation forms (see [here](https://coda.io/form/Unjournal-evaluation-form-applied-stream_dkjUPyzvHoH)). We’ll also ask them to evaluate the paper according to The Unjournal’s [standard](https://globalimpact.gitbook.io/the-unjournal-project-and-communication-space/policies-projects-evaluation-workflow/evaluation/guidelines-for-evaluators) or [applied stream](https://coda.io/form/Unjournal-evaluation-form-applied-stream_dkjUPyzvHoH) guidelines. But we’ll cut them some slack here, and offer additional compensation for the extra work.

[^6]: We have plans to do this in general (see[ sketch here](https://coda.io/d/_ddIEzDONWdb/_sujIB#_luRE_)). This seems particularly promising for this pivotal questions project, as we have a more well-defined and measurable task.

[^7]: Here, we’re relying on Anca Hanea, a member of our Advisory Board who focuses on aggregating expert judgment. Academic work such as[ Rowe and Wright 2001](https://www.semanticscholar.org/paper/Expert-Opinions-in-Forecasting%3A-The-Role-of-the-Rowe-Wright/e315327ee3c6eebbb18152b9d9d97c1e31006b58) (“Delphi groups are somewhat more accurate than statistical groups (which are made up of noninteracting individuals whose judgments are aggregated)”) also seems to support this point.

[^8]: See details [here](https://coda.io/d/_ddIEzDONWdb/Evaluating-Pivotal-Questions_suamu#_luNnx).

[^9]: As noted above, we may offer bounties in the future for suggestions that we engage with. Any such bounty will also apply retroactively, to suggestions made in response to this post.