在幾分鐘內獲得同儕審查反饋,而非數月

Jenni 會根據標準的同行評審標準為您的草稿評分,並直接在您的文件中留下具體可行的建議。在審查人員看到問題之前就將其解決。

HC

HC

HC

Loved by over 6 million academics

在幾分鐘內獲得同儕審查反饋,而非數月

Jenni 會根據標準的同行評審標準為您的草稿評分,並直接在您的文件中留下具體可行的建議。在審查人員看到問題之前就將其解決。

HC

HC

HC

Loved by over 6 million academics

在幾分鐘內獲得同儕審查反饋,而非數月

Jenni 會根據標準的同行評審標準為您的草稿評分,並直接在您的文件中留下具體可行的建議。在審查人員看到問題之前就將其解決。

HC

HC

HC

Loved by over 6 million academics

Beyond Detection: A Framework for Ethical AI…
Share
TText
B
I
U
S
x2
x2
@Cite
Autocomplete

Beyond Detection: A Framework for Ethical AI Integration in Academic Research

The proliferation of generative AI in academic contexts has revealed a fundamental truth that institutions have been reluctant to acknowledge:

The detection paradigm has failed.

AI detection tools achieve accuracy rates often below 80% in independent testing (Wakjira et al., 2025). Their false positive rates can be as high as 50% across widely-used platforms (Weber-Wulff et al., 2023). There is also documented systematic bias, with over 61% of non-native English writing flagged as AI-generated (Liang et al., 2023). The current approach of "detect and punish" thus creates more harm than it prevents. Studies indicate that 13.5% to 22.5% of academic papers now show evidence of AI assistance (Kobak et al., 2025).

The path forward requires abandoning unreliable surveillance in favor of transparency architectures: tools and policies designed from inception to make AI contributions visible, auditable, and appropriately constrained.

Part I: The epistemological limits of AI detection

Contemporary AI detection rests on a brittle assumption: that the statistical fingerprints of machine-generated prose remain stable, distinguishable from human writing, and resistant to even modest paraphrase. Each of these premises dissolves under sustained scrutiny. Modern generative systems are trained on the same authoritative corpora that high-quality human writing draws from, and their outputs converge on precisely the registers detectors are calibrated to flag as natural (Sadasivan et al., 2024). The result is a moving target that detectors cannot follow without retraining on every new model generation — a posture that is neither operationally nor epistemologically sustainable.

Empirical work over the past eighteen months has documented this drift in granular detail. When evaluated on out-of-distribution writing — graduate theses, technical manuscripts, translated passages — detector accuracy collapses well below the threshold required for any high-stakes adjudication (Liang et al., 2023; Sadasivan et al., 2024). A meta-analysis of fourteen commercial detectors found a median accuracy of 39.5% on lightly paraphrased text — a figure that is not merely poor but actively misleading. Institutions deploying these systems are operating below the level of a coin flip while presenting their judgments as forensic evidence.

1.1 The base-rate fallacy in detection deployment

Even a hypothetical detector with 95% sensitivity and 95% specificity — performance no current system approaches — produces an unacceptable error rate when applied across populations where undisclosed AI use is rare. If 5% of submissions involve a genuine policy violation, applying such a detector to a class of 400 students correctly flags 19 of the 20 actual cases while wrongly accusing roughly 19 honest students. Real detectors operating below 80% accuracy push the false accusation rate beyond what any educational institution can ethically sustain (Fleckenstein et al., 2024).

These statistical realities are compounded by a recursive contamination problem. As model output increasingly populates the open web, the next generation of detectors trains on a corpus in which human and machine are no longer cleanly distinct categories — they are interleaved, cross-cited, and mutually shaping (Shumailov et al., 2024). Detection at that point ceases to identify a meaningful boundary; it merely reproduces the priors encoded during its last training cycle.

1.2 Disparate impact and the linguistic monoculture

The harms of unreliable detection are not distributed evenly. Independent audits repeatedly show that detectors penalize writers whose first language is not English at rates three to four times higher than native speakers (Liang et al., 2023), and that lower-perplexity prose — the very prose that structured academic training tends to produce — registers as "machine-like" to most commercial models. A system that punishes linguistic care while rewarding idiosyncrasy is not measuring authorship; it is measuring stylistic distance from a narrow Anglophone norm. The pedagogical consequences are severe: students learn to write worse on purpose to evade the detector, inverting every signal a writing program is meant to cultivate.

4,812 words
Peer Review
Run peer review

深受全球大學與企業的信賴

深受全球大學與企業的信賴

深受全球大學與企業的信賴

運作方式
運作方式

只需三個步驟,即可從草稿進展到同行評審回饋

01

01

拖入您的草稿

以 .docx 格式上傳您的手稿,或將其中一個章節貼至現有的 Jenni 文件中。同行評審(Peer Review)會完整閱讀整份文件。

以 .docx 格式上傳您的原稿,或將其中一個章節貼上至現有的 Jenni 文件中。同行評審(Peer Review)會完整地端對端閱讀整份文件。


02

02

執行同儕審查

Jenni 會根據標準的同行評審標準對您的手稿進行評估,對關鍵領域進行評分,並直接在您的草稿中標出可付諸行動的改進之處。

03

03

解決、重新執行、重複

評論會直接呈現在您的手稿中,並與需要修改的具體段落相連結。解決每個問題,並看著您的評分逐步提升。

運作方式

運作方式

觀看同儕審查的實際運作

觀看 Jenni 如何閱讀真實的手稿、對照評分量表進行評分,並在每個需要改進的章節留下評語。

運作原理

運作原理

為學術嚴謹而建而生

大多數人工智慧工具只會給您通用的寫作反饋。Peer Review 則會像審稿人一樣評估您的手稿。

閱讀整部手稿

Peer Review 會逐字逐句閱讀您的全文草稿,捕捉每一個論點、每一處方法說明和每一次過渡承接,確保反饋能完整反映整份文件。

審查員所使用的相同標準

「同行評審」使用頂級期刊相同的評審表進行填寫,內容包含對合理性、貢獻度和陳述表達的評分,並提供書面反饋。

與段落關聯的註解

Jenni 將每個評論都錨定到特定的句子上,並附帶原因和修改建議。您不僅能知道哪裡不對勁,還能清楚知道要修改什麼以及在哪裡修改。

評論的一部分

評論的一部分
評論的一部分

您的提交前引用完整審查閱完整報告

「同儕審查」是四種審查工具之一,能在審查人員發現問題前先一步找出問題。請同時執行這些工具,以進行完整的提交前檢查。

Peer review8 / 10

Manuscript scored against a peer-review rubric with reviewer comments on each section.

Soundness
3/4
Presentation
4/4
Contribution
3/4
Results
Strengths
Weaknesses
Claim confidence10 issues

The claim confidence analysis addressed issues of redundant, weak, or missing citations, alongside instances of contradiction in citation arguments.

Misrepresented
Contradicted
3
Unsupported
4
Weakly supported
2
Overstated
Unverifiable
Outdated
2
Self-citation heavy
Predatory source
Citation mismatch
1
Proofread18 edits

Whilst generally sound, the text contains some areas for improvement to comply with academic best practices.

Word choice
AllThe majority of participants reported improved outcomes.
Formality
Yang (2024) found a negative correlation which was interesting..
Grammar
These results indicate that early intervention be effective. appears to be effective.
Transitions
Also, In addition, Jones (2022) found similar results.
Overgeneralized
AllThe majority of participants reported improved outcomes.
The results provesuggest that X has an effect on Y.
Tone of voice22 notes

Suggestions across vocabulary, syntax, punctuation, tone and flow to keep a consistent academic voice.

All Suggestions
22
Vocabulary
6
Syntax
5
Punctuation
4
Tone
3
Flow
4

同儕審查

理賠信心

校對

語氣

「聲稱可信度(Claim Confidence)功能非常實用。它會標記任何未經證實、誇大或支持力度微弱的聲稱。」

薩賓·霍森菲爾德

物理學家及《迷失於數學》作者

「聲稱可信度(Claim Confidence)功能非常實用。它會標記任何未經證實、誇大或支持力度微弱的聲稱。」

薩賓·霍森菲爾德

物理學家及《迷失於數學》作者

「聲稱可信度(Claim Confidence)功能非常實用。它會標記任何未經證實、誇大或支持力度微弱的聲稱。」

薩賓·霍森菲爾德

物理學家及《迷失於數學》作者

「我經常嘗試使用 AI 工具進行研究,並發現 Jenni 是最好且最易於使用的。特別是在快速重新格式化參考文獻和開發新的論文想法方面。」

加雷斯

泰勒-弗朗西斯出版集團總編輯

「我經常嘗試使用 AI 工具進行研究,並發現 Jenni 是最好且最易於使用的。特別是在快速重新格式化參考文獻和開發新的論文想法方面。」

加雷斯

泰勒-弗朗西斯出版集團總編輯

「我經常嘗試使用 AI 工具進行研究,並發現 Jenni 是最好且最易於使用的。特別是在快速重新格式化參考文獻和開發新的論文想法方面。」

加雷斯

泰勒-弗朗西斯出版集團總編輯

常見問題

就在今天,讓您的偉大工作取得進展

今天就和 Jenni 一起撰寫你的第一篇論文,從此不再回頭

Start for free

No credit card required

Cancel anytime

超過 6 公尺

全球學術界

節省了 5.2 小時

每篇論文的平均值

超過 1500 萬

在 Jenni 上撰寫的論文

就在今天,讓您的偉大工作取得進展

今天就和 Jenni 一起撰寫你的第一篇論文,從此不再回頭

Start for free

No credit card required

Cancel anytime

超過 6 公尺

全球學術界

節省了 5.2 小時

每篇論文的平均值

超過 1500 萬

在 Jenni 上撰寫的論文

就在今天,讓您的偉大工作取得進展

今天就和 Jenni 一起撰寫你的第一篇論文,從此不再回頭

Start for free

No credit card required

Cancel anytime

超過 6 公尺

全球學術界

節省了 5.2 小時

每篇論文的平均值

超過 1500 萬

在 Jenni 上撰寫的論文