Full text

Turn on search term navigation

Introduction

Microfossils such as foraminifers, coccolithophores, radiolaria, and diatoms, have been used to constrain depositional ages and environments of various kinds of seafloor sediments, as well as to provide high-resolution and detailed records of evolutionary processes (Armstrong & Brasier, 2005). Among them, microfossil fish teeth and denticles, referred to as ichthyoliths, are composed of calcium phosphate, which is resistant to dissolution on the deep seafloor (Doyle & Riedel, 1985; Sibert et al., 2017). Therefore, ichthyoliths are observed from almost all types of seafloor sediments, including pelagic clay, where other siliceous and calcareous microfossils are rarely observed. Taking advantage of this, ichthyoliths have provided key constraints for depositional ages (Doyle & Riedel, 1979, 1985; Ohta et al., 2020) and marine environments and/or ecosystems (Britten & Sibert, 2020; Sibert et al., 2014, 2016; Sibert & Rubin, 2021) especially in pelagic realms. In addition, ichthyoliths preserve a variety of geochemical systems, including strontium and neodymium isotopes, which can provide additional age constraints on sediments (e.g., Gleason et al., 2002; Ingram, 1992) and insights into deep water circulation patterns and origin of sedimentary components (e.g., Huck et al., 2016; Martin & Haley, 2000; Scher & Martin, 2004; Tanaka et al., 2022; Thomas et al., 2014). Oxygen isotopes in ichthyoliths have also been used to reconstruct changes in ocean temperature (e.g., MacLeod et al., 2018). However, traditional observation methods rely on “handpicking,” in which an observer picks fossils individually under a stereomicroscope (Ohta et al., 2020; Sibert et al., 2017; Tanaka et al., 2022). This process is time-consuming and can only be conducted by a skilled observer, making it difficult to analyze large numbers of ichthyoliths from various sediment samples.

Computer vision technologies are developing rapidly. In particular, image processing using deep learning has been applied to various fields, including earth science (Hoeser & Kuenzer, 2020; Mimura, Nakamura, Takao, et al., 2023). Automating previous manual observation processes saves time and provides opportunities for discoveries by increasing the number of fossils that can be observed and processed. The application of deep learning techniques for the classification of foraminifers (Hsiang et al., 2019) and radiolarians (Carlsson et al., 2022, 2023; Itaki, Taira, Kuwamori, Saito, et al., 2020; Tetard et al., 2020), and coccolithophores (Beaufort et al., 2022) is enhancing the resolution in paleoenvironmental studies. These studies detect particles by thresholding and recognize their classes using classification models. However, this method is difficult to directly apply to ichthyoliths because it is sometimes challenging to identify the outline of ichthyoliths by thresholding method (Figure 1). To solve this problem, we have proposed an automated detection of ichthyoliths in microscopic images by combining the object detection model “Mask R-CNN” (He et al., 2020) and image classification model “EfficientNet-V2,” both of which are based on deep learning techniques (Mimura et al., 2022). Although the system showed a good performance, two problems remained. First, due to the scarcity of the learning data set, the system could only detect triangular teeth, leaving denticles and saw-toothed ichthyoliths undetected (Figure 1). Second, there was a time loss in the combined system, as a well-trained object detection model can distinguish classes without using the classification model.

[IMAGE OMITTED. SEE PDF]

Recently, we compared the performances of object detection models “Mask R-CNN” and “YOLOv5” (Jocher et al., 2022) in detecting signals of hydrothermal activity in echo sounder images (Mimura, Nakamura, Takao, et al., 2023) and showed that the YOLOv5 model achieved much higher performance than that of the Mask R-CNN model. Here, with reference to this, we applied “YOLOv7” (Wang et al., 2022), one of the latest versions of You Only Look Once (YOLO, Redmon et al., 2016), to solve the problem of ichthyolith detection. To overcome the problems associated with the previous system developed by us, we aimed to detect teeth, denticles, and irregular shapes of teeth in a single step.

Materials and Methods

Sample Description

We used pelagic clay samples obtained from the Deep Sea Drilling Project (DSDP) Site 576, Ocean Drilling Program (ODP) Site 1149, Integrated Ocean Drilling Program (IODP) Sites U1366 and U1370, and piston cores KR13-02 PC04 and MR14-E02 PC11. All cores were recovered from the Pacific Ocean at water depths of more than 5,000 m (Table S1 in Supporting Information S1). We aimed to cover a variety of depositional ages from the late Cretaceous to the present using DSDP/ODP/IODP samples and to enhance the number of irregular teeth called Rectangular saw-toothed (Figure 1) by collecting images from specific horizons of the two piston cores.

Slide Preparation and Imaging

Glass slides were prepared from the samples as described by Mimura et al. (2022). Approximately 3–10 g of the sample was mixed well with deionized water and sieved using a 62-μm mesh. Larger particles were collected in a centrifuge, mixed with sodium polytungstate with a specific gravity of approximately 2.8 g/cm³, and centrifuged at 1,000–1,500 rpm to collect heavier particles, which were proposed by Sibert et al. (2017). The collected particles were washed with deionized water, moved onto glass slides using a pipette, dried at 40°C, and sealed with a cover glass using a light-curing adhesive.

Imaging of glass slides was also performed as described previously (Mimura et al., 2022). Using a digital microscope RX-100 (Hirox Co., Ltd.), the whole part of the observation realm (∼24 × 36 mm) was divided into ∼1,000 squares (∼1.15 × 1.15 mm/1,200 × 1,200 pixels). The z-stack images were automatically acquired using motorized x, y, and z stages. To capture as many ichthyoliths as possible in a complete form, each image overlaps with adjoining images by 20%.

Generation of Data Sets

Out of more than 1 million (M) images of the microscopic field of view, 12,219 were selected for “original” data sets. The locations and classes of the ichthyoliths within the images were annotated manually. Ichthyoliths were classified into three classes (Figure 1): triangular tooth (class name: “tooth”), denticle (“denticle”), and forms similar to Rectangular saw-toothed (“saw-toothed”).

Two data sets were generated from these images and annotations. The data set “original_selected” comprised 6,945 images with ichthyoliths, and the data set “original_all” comprised 6,945 images with ichthyoliths and 5,274 images without ichthyoliths (Mimura, Nakamura, Yasukawa, et al., 2023). The data sets contained 7,705 triangular teeth, 533 denticles, and 103 saw-toothed shapes. The images and corresponding annotation files were randomly split into three subsets: 80% for training, 10% for validation, and 10% for testing. We note here that images in each subset are the same between the two data sets, except for the image that does not contain ichthyoliths. This enabled us to conduct performance tests on the same data set (i.e., models trained on the training subset of data set original_selected can be tested by the testing subset of the data set original_all).

Tuning of Hyperparameters

We conducted hyperparameter tuning by training the “YOLOv7” model under different initial learning rates (“lr0” in YOLOv7's parameter file) and the final one-cycle learning rates (“lrf”). A stochastic gradient descent algorithm with a momentum fixed at 0.937 was applied for training. The image size was fixed at 640 × 640 pixels and the batch size at 8. The models were trained on a local Windows PC with a single graphic board with 16 GB of memory (GeForce RTXTM 3080 Ti, NVIDIA Inc.).

Training Conditions

YOLOv7 provides several models with various numbers of trainable parameters. In this study, we compared five models, “YOLOv7-tiny,” “YOLOv7,” “YOLOv7-X,” “YOLOv7-W6,” and “YOLOv7-E6,” each having 6.2M, 36.9M, 71.3M, 70.4M, and 97.2M parameters, respectively. Training of YOLOv7-tiny and YOLOv7 models was conducted on the local Windows PC, while training of the higher models was conducted on the cloud computing platform “Google Colaboratory” (Carneiro et al., 2018). The image size was basically set to 640 × 640 pixels. However, we also trained YOLOv7-W6 models with a larger image size set to 1,280 × 1,280 pixels, as Wang et al. (2022) proposed for larger models. In all training cases, the batch size was fixed at 8. The models were trained on either the local Windows PC, a local Linux PC with two graphic boards having 24 GB memory (GeForce RTXTM 3090 Ti, NVIDIA Inc.), or Google Colaboratory (see Table 2).

Following YOLOv7's online augmentation method, the images were randomly flipped vertically and/or horizontally, and the colors, scales, and shear of the images were randomly changed every time the training images were loaded.

Practical Test

In the data sets described in Section 2.3, more than half of the images contained at least one ichthyolith, whereas only tens to one hundred ichthyoliths are observed from ∼1,000 images in actual observation. We, therefore, conducted a practical test to evaluate the performance of the trained models under more practical conditions.

Three samples at DSDP Site 576, not used in the original data sets described in Section 2.3 or the extended data set described in Section 3.3, were selected for the practical test. The models detected ichthyoliths from the whole field-of-view images (30,826 in total) taken from 28 slides. Since microscopic images were taken with overlap, duplicated detections were excluded by calculating absolute coordinates in the entire slide (Figure 2). The slides were also observed manually under a polarization microscope. We tested the practical applicability of the trained models by comparing the number of ichthyoliths counted by the models with that observed manually.

[IMAGE OMITTED. SEE PDF]

Results and Discussion

Hyperparameter Tuning and Iteration Test

F1 scores of YOLOv7 models trained with different hyperparameters on data set “original_all” are presented in Table 1. The initial learning rate of 0.0007 and final one-cycle learning rate of 0.05 were the most suitable conditions in this study. Under the same condition, we then conducted and evaluated five training iterations and observed that one standard error (1 SE) of the F1 score was 0.008 (Table S2 in Supporting Information S1). When comparing the performance of the models in the following discussion, a difference in F1 scores greater than 2 SE (0.016) was considered significant.

Table 1 F1 Scores of the Models Trained on Different Hyperparameters of Initial Learning Rate (“lr0”) and Final One-Cycle Learning Rate (“lrf”)

	lrf
lr0	0.1	0.05	0.01
0.001	0.08	0.29	0.75
0.0007	0.46	0.82	0.55
0.0004	0.68	0.13	0.21

Comparison of Performances Under Different Training Conditions

The performance of the models trained on different model sizes and data sets is detailed in #1 to #12 of Table 2. We evaluated the performance of models based on averaged F1 scores of the three classes (macro-F1 score). Comparing the number of parameters (Figure 3a), models with ∼70M trainable parameters (YOLOv7-X, YOLOv7-W6) exhibited the highest F1 score, suggesting that these models are suitable for this study. Comparing the image sizes (Figure 3b), we observed that the models trained with the input image size set at 640 exhibited higher F1 scores than those trained with an image size of 1,280. Although the difference in the data set “selected” is less than 2SE, we suggest that the suitable input image size is 640, as larger input size increases the risk of overfitting (e.g., Sabottke & Spieler, 2020). Finally, comparing the data set type (Figure 3c), the results exhibited a variety of trends. However, following the discussion above, if we focus on the cases with a number of parameters around 70M and input image size at 640, models trained on the data set “all” showed higher F1 scores than those trained on the data set “selected.” Thus, we concluded that the suitable training condition in this study is (a) to use models with ∼70M parameters (YOLOv7-X or YOLOv7-W6), (b) to set the input image size at 640, and (c) to train on a data set “all,” which is composed of both images containing ichthyoliths and images that do not contain ichthyoliths.

Table 2 Performances of the Training With Different Models and Data Sets

Case	1	2	3	4	5	6	7	8	9	10	11	12	13	14
Condition	Data set	original_selected	original_all	extended_all
Environment	Windows	Windows	Colab	Ubuntu	Ubuntu	Ubuntu	Windows	Windows	Colab	Ubuntu	Ubuntu	Ubuntu	Ubuntu	Ubuntu
Model	YOLOv7-tiny	YOLOv7	YOLOv7-X	YOLOv7-W6	YOLOv7-W6	YOLOv7-E6	YOLOv7-tiny	YOLOv7	YOLOv7-X	YOLOv7-W6	YOLOv7-W6	YOLOv7-E6	YOLOv7-X	YOLOv7-W6
#Param. (M)	6.2	36.9	71.3	70.4	70.4	97.2	6.2	36.9	71.3	70.4	70.4	97.2	71.3	70.4
Image size	640	640	640	640	1,280	640	640	640	640	640	1,280	640	640	640
Precision	Tooth	0.814	0.885	0.885	0.882	0.778	0.760	0.641	0.868	0.857	0.778	0.791	0.768	0.910	0.931
Denticle	0.671	0.829	0.859	0.803	0.712	0.702	0.672	0.852	0.853	0.756	0.668	0.653	0.832	0.895
Saw-toothed	0.778	0.875	0.799	0.888	0.817	0.714	0.694	0.800	0.900	0.833	0.727	0.833	0.778	0.887
Average	0.754	0.863	0.848	0.858	0.769	0.725	0.669	0.840	0.870	0.789	0.728	0.751	0.840	0.904
Recall	Tooth	0.789	0.832	0.857	0.850	0.907	0.862	0.880	0.859	0.896	0.908	0.916	0.870	0.843	0.813
Denticle	0.755	0.837	0.869	0.776	0.857	0.755	0.837	0.776	0.831	0.918	0.857	0.776	0.878	0.878
Saw-toothed	0.699	0.698	0.800	0.800	0.893	0.500	0.682	0.800	0.899	0.999	0.797	0.500	0.700	0.800
Average	0.748	0.789	0.842	0.888	0.886	0.706	0.800	0.811	0.875	0.942	0.857	0.715	0.807	0.830
f1 score	Tooth	0.801	0.858	0.871	0.866	0.838	0.808	0.742	0.863	0.876	0.838	0.849	0.816	0.875	0.868
Denticle	0.711	0.833	0.864	0.789	0.778	0.728	0.745	0.812	0.842	0.829	0.751	0.709	0.854	0.886
Saw-toothed	0.736	0.777	0.799	0.842	0.853	0.588	0.688	0.800	0.899	0.908	0.760	0.625	0.737	0.841
Average	0.749	0.822	0.845	0.832	0.823	0.708	0.725	0.825	0.872	0.859	0.787	0.717	0.822	0.865

[IMAGE OMITTED. SEE PDF]

Efficient Production of Training Data Set Using Detection Results

YOLOv7 can output results as text files in the same format as the training labels. Taking advantage of this, we enhanced the sizes of data sets by first predicting a trained model and then checking the result manually. Using the YOLOv7-X model trained on the data set “all” with an image size of 640 (#9 of Table 2), the existence of ichthyoliths was predicted from ∼1,100,000 images generated from the six sites considered in this study. Images from three samples at Site 576 used for the practical test were excluded. We collected 4,463 images in which the model predicted the existence of the class “denticle” or “saw-toothed,” which were relatively small compared to the class “tooth.” After the manual check of detection results for the 4,463 images, 2,528 images contained ichthyoliths, and 1,935 did not have ichthyoliths; of those containing ichthyoliths, 1,657 teeth, 1,282 denticles, and 108 saw-toothed ichthyoliths were identified. Notably, the “denticle” was more than twice the number in the original data set, and the “saw-toothed” was almost the same as the number in the original data set. As well as the original data sets, images, and annotation information were randomly split into training (80%), validation (10%), and testing (10%) subsets.

The data set “extended_all” was generated by combining the data set collected by the above process and the data set “original_all” (Mimura, Nakamura, Yasukawa, et al., 2023). Considering the discussion in Section 3.2, we trained the two models, YOLOv7-X and YOLOv7-W6, on the data set “extended_all” with an input image size set at 640. The performances of the trained models are shown in #13 and #14 of Table 2.

Practical Test

We conducted a practical test for the four models: YOLOv7-X trained on the data sets “original_all” (#9 of Table 2) and “extended_all” (#13), YOLOv7-w6 trained on the data sets “original_all” (#10) and “extended_all” (#14). The number of ichthyoliths detected by these models and manually counted are shown in Table S3. We also calculated the root mean square percentage error (RMSPE), using the following equation: 1 $\text{RMSPE}=\sqrt{\frac{1}{n}\sum\limits _{i=1}^{n}{\left(\frac{{\widehat{y}}_{i}-{y}_{i}}{{y}_{i}}\right)}^{2}}\times 100\,[\%]$ where n, ${\widehat{y}}_{i}$ , and y_i indicate the number of samples, the predicted ichthyoliths, and the manually observed ichthyoliths, respectively.

Comparing the models trained on the data set “original_all” (#9, #10) and “extended_all” (#13, #14), models trained on “extended_all” showed trends closer to y = x for classes tooth and denticle (Figures 4a and 4b). The high performance of the model trained with the “extended_all” data set may be attributed to the high variation of false patterns in practical conditions. We realized that models trained on the original data set confused various triangular particles or patterns with teeth (Figure S1 in Supporting Information S1). Since the “extended_all” data set contains many images that the preliminary model misdetected, the model trained with this data set is considered to learn false positives efficiently. RMSPEs suggest that using the v7-w6_extended_all model (#14), the number of teeth and denticles from a sample can be estimated with ∼7% and ∼24% error rates, respectively. On the other hand, RMSPEs for the “saw-toothed” class are >70%. Furthermore, no clear trend was observed (Figure 4c), indicating that the number of “saw-toothed” cannot be accurately estimated based solely on the model's detection result.

[IMAGE OMITTED. SEE PDF]

We also manually checked the images detected by models #13 and #14 and removed false positives and duplications that could not be excluded by the algorithm described in Figure 2. After checking model #13's detection, we observed a trend closer to y = x (Figures 4d–4f), indicating that combining manual review with model #13 is preferable. Model #13, with manual check, achieved an RMSPE of ∼3%, ∼9%, and almost no error for counting the number of teeth, denticles, and saw-toothed ichthyoliths, respectively (Table S3).

Advantages of Object Detection Method Using YOLO-v7

The application of deep learning to microfossil observations has attracted increasing attention recently (Carlsson et al., 2022, 2023; Hsiang et al., 2019; Itaki, Taira, Kuwamori, Maebayashi, et al., 2020; Marchant et al., 2020; Mitra et al., 2019; Romero et al., 2020; Salonen et al., 2019; Tetard et al., 2020). A commonly used method in particle detection is to apply rule-based thresholding to detect each particle and subsequently classify them using an image classification model. Although these methods require less work to prepare a data set, deep learning-based detection has advantages over traditional methods in finding “challenging” particles. While traditional rule-based thresholding methods struggle to detect particles that overlap, have drastic changes in brightness, or have almost similar brightness to the background (Figure 1) in ichthyolith slides, deep learning-based methods can accurately detect them. Therefore, we propose that object detection would broaden the range of deep learning applications in microfossil studies.

Compared to our previous method (Mimura et al., 2022), which required two steps, object detection by Mask R-CNN and image classification by EfficieneNet-V2, the new method can detect ichthyolith in a single step, which enhances the efficiency of observation. We measured the detection times for processing 10,884 slide images using the two methods on Google Colaboratory. While the previous method required 11,250 s in total, 7,230 s for detection using Mask R-CNN, and 4,020 s for classification using EfficientNet-V2, the new method required only 1,040 s in total process, indicating that the new method is approximately 10 times faster than the previous method.

Implications for Biostratigraphic and Paleoecological Studies Using Ichthyoliths

We expect the new observation method to make the biostratigraphy of ichthyoliths more precise, advancing progress in paleoceanography and resource geology related to pelagic (red) clay. Pelagic clay covers over one-third of the global ocean (Dutkiewicz et al., 2015) and has huge variation in bulk geochemistry (Dunlea et al., 2015; Mimura et al., 2019). Therefore, pelagic clay is a good recorder of long-term and global/regional environmental changes (Kyte et al., 1993; Tanaka et al., 2022; Yasukawa et al., 2023; Zhou & Kyte, 1992). Moreover, pelagic clay is also attracting attention as a promising resource for rare-earth elements (Kato et al., 2011; Ren et al., 2021; Takaya et al., 2018; Yasukawa et al., 2014). However, the scarcity of microfossils except for ichthyoliths has hampered making precise age models of pelagic clay. Letting machines perform much of the time-consuming observations, substantial amounts of ichthyoliths can be observed, and more accurate age models will be established. This should provide numerous insights into the evolution of pelagic environments from paleoceanographic viewpoints, as well as the ore genesis and potential distributions of the prospective deep-sea mineral resource.

We also expect that this tool will improve our understanding in biological and ecological studies. As a demonstration, we show a downhole variation of denticle/tooth (D/T) ratios at DSDP Site 576 in the western North Pacific Ocean (Table 3, Figure 5), which were generated from the detection results of model #13 combined with manual check. D/T ratio is an index for relative ratios of shark and ray-fined fish, an indicator of marine vertebrate community stability (Sibert et al., 2016). By manual counting in a previous study (Sibert et al., 2016), three stages in the D/T ratios from the late Cretaceous to the present were proposed. Cretaceous ocean (i.e., older than 66 Ma) was characterized by high D/T ratios, reflecting a relatively small number of ray-fined fishes compared to the present ocean. Subsequently, Paleogene ocean (from 66 to ∼20 Ma) showed moderate D/T ratios, reflecting the evolution of ray-finned fish after the K/Pg boundary (Sibert & Norris, 2015). Finally, the modern ocean (from ∼20 Ma to the present) is characterized by low D/T ratios, which may reflect an extinction event of sharks in the early Miocene (Sibert & Rubin, 2021) and the consequent predominance of ray-finned fish. In the previous study, the trend was clearly exhibited from the South Pacific (DSDP Site 596), but the evidence from the North Pacific (ODP Site 886) was somewhat limited due to the huge hiatus in the Paleogene (Figure 5). Using our deep learning-based image processing method, we found D/T ratios results that were consistent with the previous study from DSDP Site 576 in the North Pacific site that has continuous Paleogene sedimentation, supporting the pelagic vertebrate community structure proposed in Sibert et al. (2016). While this method is still developing, high throughput data collection provides the opportunity for elucidating the interaction between environmental change and the marine vertebrate community.

Table 3 The Total Count of Ichthyoliths in the Three Classes Detected by Model #13 (YOLOv7-X)

Sample	Depth (mbsf)	Age	Weight (g)	v7x_extended_all	v7x_extended_all + manual check	Outlier
Tooth	Denticle	SawToothed	Tooth	Denticle	SawToothed
576B_01_02_77	2.28	P-Q^a	2.53	2	1	0	2	0	0	1^b
576B_01_06_52	8.03	P-Q^a	3.45	16	0	1	13	0	1
576B_02_03_125	12.46	P-Q^a	2.99	44	4	2	36	1	0
576B_02_07_23	17.44	P-Q^a	6.40	89	3	8	76	1	1
576B_03_03_125	21.96	Miocene	2.19	12	1	0	9	0	0	1^b
576B_03_06_81	26.02	Miocene	3.81	123	5	5	101	3	3
576B_04_01_75	27.96	Oligocene	2.72	38	3	2	31	2	1	1^b
576B_04_02_25	28.96	Oligocene	3.26	151	5	6	124	4	2
576B_04_03_25	30.46	Oligocene	3.65	191	9	6	164	9	1
576B_04_03_75	30.96	Oligocene	3.75	194	7	8	165	4	3
576B_04_04_75	32.46	Oligocene	5.71	526	15	7	472	9	3
576B_04_05_75	33.97	Oligocene	4.24	542	32	4	472	24	2
576B_04_06_21	34.93	Oligocene	4.04	1,099	78	17	969	61	3
576B_04_07_27	36.485	Oligocene	4.84	517	46	3	451	39	1
576B_05_01_25	36.96	Eocene	3.71	455	44	3	391	32	2
576B_05_02_75	38.96	Eocene	5.29	497	40	1	417	35	1
576B_05_03_75	40.47	Eocene	5.26	635	55	4	519	43	4
576B_05_04_75	41.97	Eocene	4.43	115	13	0	96	12	0
576B_05_05_25	42.97	Eocene	5.52	294	31	1	223	26	0
576B_05_06_75	44.97	Eocene	4.63	413	21	0	345	13	0
576B_06_02_75	48.47	Paleocene	2.85	373	16	0	330	13	0
576B_06_04_23	50.95	Paleocene	3.89	28	10	0	23	8	0	1^b
576B_06_04_140	52.11	Cretaceous	3.42	438	52	1	357	47	0
576B_06_06_140	55.11	Cretaceous	4.16	486	82	0	392	68	0
576B_07_03_139	60.105	Cretaceous	4.51	444	53	0	308	46	0
576B_07_07_30	65.01	Cretaceous	2.48	239	21	0	164	18	0

[IMAGE OMITTED. SEE PDF]

Conclusions

In this study, we proposed a new and efficient method for the observation of ichthyoliths, which is approximately 10 times faster than our previous method. Using this method, we expect that studies using ichthyoliths, including biostratigraphy, geochemistry, paleoecology, and the evolution of fishes, will become more precise due to improved sample throughput and identification. Conventional studies on ichthyolith stratigraphy have focused mainly on the presence or absence of each ichthyolith species. In contrast, ratios of the species were hardly considered, possibly due to the enormous amount of manual work required to count the total number of fossils in a discrete sediment sample under a microscope. Since the object detection method is capable of counting the total number of ichthyoliths in a sample, as well as classifying them to a particular type (here, teeth, denticles, or saw-toothed teeth), it can rapidly calculate a ratio of each ichthyolith species within an entire sample slide glass. This tool enables research focusing on quantitative changes in the occurrence of each ichthyolith morphotype, which in turn will provide more accurate depositional ages on pelagic clays, improve geochemical reconstructions, and open the possibilities for high-resolution ecological and evolutionary studies of fish and sharks at significantly increased spatiotemporal resolution. Finally, while we focused here on ichthyoliths, which are understudied compared to other microfossil groups, the automated deep learning methods presented here can be applied broadly to a wide array of microfossil groups, increasing the throughput of data across many fields of study.

Acknowledgments

The authors thank Prof. Richard Norris at the University of California San Diego for preliminary discussions on ichthyolith analysis. This research used drill core samples collected by the Deep Sea Drilling Project (DSDP), Ocean Drilling Program (ODP), and Integrated Ocean Drilling Program (IODP). This research was funded by the Japan Society for the Promotion of Science (JSPS) KAKENHI Grant 20H05658 to YK, 17H01361 to KN, 19J14560, 21K20354, and 23K13192 to KM, and JST FOREST Program (Grant JPMJFR227A, Japan) to KY, as well as National Science Foundation (NSF) Grant 2403839 to ECS.

Conflict of Interest

The authors declare no conflicts of interest relevant to this study.

Data Availability Statement

We named a series of program codes “yolov7-slideObservation” and made it available on GitHub (). The data sets for this study can be accessed at figshare (Mimura, Nakamura, Yasukawa, et al., 2023).

References

Armstrong, H. A., & Brasier, M. D. (2005). Microfossils (2nd ed., p. 296). Blackwell Publishing. [DOI: https://dx.doi.org/10.1017/S001675680621238X]

Word count: 4317

Show less

© 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Microfossils of fish teeth and denticles, referred to as ichthyoliths, provide critical information for depositional ages, paleo‐environments, and marine ecosystems, especially in pelagic realms. However, owing to their small size and rarity, it is time‐consuming and difficult to analyze large numbers of ichthyoliths from sediment samples, limiting their use in scientific studies. Here, we propose a method to automatically detect ichthyoliths from microscopic images using a deep learning technique. We applied YOLO‐v7, one of the latest object detection architectures, and trained several models under different conditions. The model trained under appropriate conditions with an original data set achieved an F1 score of 0.87. We then enhanced the data set efficiently using the pre‐trained model. We validated the practical applicability of the model by comparing the number of ichthyoliths detected by the model with those counted manually. This revealed that the best model can predict the number of triangular teeth, denticles and irregularly shaped teeth with minimal human intervention. This object detection method can extend the applicability of deep learning to a wider array of microfossils and has the potential to dramatically increase the spatiotemporal resolution of ichthyolith records for applications across disciplines.

Details

Title

Applicability of Object Detection to Microfossil Research: Implications From Deep Learning Models to Detect Microfossil Fish Teeth and Denticles Using YOLO‐v7

Author

Mimura, K.¹

; Nakamura, K.²

; Yasukawa, K.³

; Sibert, E. C.⁴

; Ohta, J.⁵; Kitazawa, T.⁶; Kato, Y.⁷

¹ Ocean Resources Research Center for Next Generation, Chiba Institute of Technology, Narashino, Japan, Department of Systems Innovation, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan
² Ocean Resources Research Center for Next Generation, Chiba Institute of Technology, Narashino, Japan, Department of Systems Innovation, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan, Frontier Research Center for Energy and Resources, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan
³ Department of Systems Innovation, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan, Frontier Research Center for Energy and Resources, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan
⁴ Department of Geology & Geophysics, Woods Hole Oceanographic Institution, Woods Hole, MA, USA
⁵ Ocean Resources Research Center for Next Generation, Chiba Institute of Technology, Narashino, Japan, Frontier Research Center for Energy and Resources, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan, Volcanoes and Earth's Interior Research Center, Research Institute for Marine Geodynamics, Japan Agency for Marine‐Earth Science and Technology (JAMSTEC), Yokosuka, Japan
⁶ Department of Systems Innovation, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan
⁷ Ocean Resources Research Center for Next Generation, Chiba Institute of Technology, Narashino, Japan, Department of Systems Innovation, School of Engineering, The University of Tokyo, Bunkyo‐ku, Japan, Submarine Resources Research Center, Research Institute for Marine Resources Utilization, Japan Agency for Marine‐Earth Science and Technology (JAMSTEC), Yokosuka, Japan

Section

Research Letter

Publication year

2024

Publication date

Jan 1, 2024

Publisher

John Wiley & Sons, Inc.

e-ISSN

2333-5084

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1029/2023EA003122

ProQuest document ID

2919481761

Applicability of Object Detection to Microfossil Research: Implications From Deep Learning Models to Detect Microfossil Fish Teeth and Denticles Using YOLO‐v7

Jump to:

Full text

Abstract

Details

Suggested sources